Skip to content

2024-04-23

cs.AR - Architecture

标题作者发布日期PDF摘要
NeuraChip: Accelerating GNN Computations with a Hash-based Decoupled Spatial AcceleratorKaustubh Shivdikar, Nicolas Bohm Agostini, Malith Jayaweera, Gilbert Jonatan, Jose L. Abellan, Ajay Joshi, John Kim, David Kaeli2024-04-23下载Graph Neural Networks (GNNs) are emerging as a formidable tool for processing non-euclidean data across various domains, ranging from social network analysis to bioinformatics.
Evaluating LLMs for Hardware Design and TestJason Blocklove, Siddharth Garg, Ramesh Karri, Hammond Pearce2024-04-23下载Large Language Models (LLMs) have demonstrated capabilities for producing code in Hardware Description Languages (HDLs). However, most of the focus remains on their abilities to write functional code,...
Distributed Architecture for FPGA-based Superconducting Qubit ControlNeelay Fruitwala, Gang Huang, Yilun Xu, Abhi Rajagopala, Akel Hashim, Ravi K. Naik, Kasra Nowrouzi, David I. Santiago, Irfan Siddiqi2024-04-23下载Quantum circuits utilizing real time feedback techniques (such as active reset and mid-circuit measurement) are a powerful tool for NISQ-era quantum computing.
Apparate: Evading Memory Hierarchy with GodSpeed Wireless-on-ChipNitesh Narayana GS, Abhijit Das2024-04-23下载The rapid advancements in memory systems, CPU technology, and emerging technologies herald a transformative potential in computing, promising to revolutionize memory hierarchies.
A high-level synthesis approach for precisely-timed, energy-efficient embedded systemsYuchao Liao, Tosiron Adegbija, Roman Lysecky2024-04-23下载Embedded systems continue to rapidly proliferate in diverse fields, including medical devices, autonomous vehicles, and more generally, the Internet of Things (IoT).
Skip the Benchmark: Generating System-Level High-Level Synthesis Data using Generative Machine LearningYuchao Liao, Tosiron Adegbija, Roman Lysecky, Ravi Tandon2024-04-23下载High-Level Synthesis (HLS) Design Space Exploration (DSE) is a widely accepted approach for efficiently exploring Pareto-optimal and optimal hardware solutions during the HLS process.
Workload-Aware Hardware Accelerator Mining for Distributed Deep Learning TrainingMuhammad Adnan, Amar Phanishayee, Janardhan Kulkarni, Prashant J. Nair, Divya Mahajan2024-04-23下载In this paper, we present a novel technique to search for hardware architectures of accelerators optimized for end-to-end training of deep neural networks (DNNs).

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Low-Bandwidth Matrix Multiplication: Faster Algorithms and More General Forms of SparsityChetan Gupta, Janne H. Korhonen, Jan Studený, Jukka Suomela, Hossein Vahidi2024-04-23下载In prior work, Gupta et al. (SPAA 2022) presented a distributed algorithm for multiplying sparse n×nn \times n matrices, using nn computers. They assumed that the input matrices are uniformly sparse--...
NeuraChip: Accelerating GNN Computations with a Hash-based Decoupled Spatial AcceleratorKaustubh Shivdikar, Nicolas Bohm Agostini, Malith Jayaweera, Gilbert Jonatan, Jose L. Abellan, Ajay Joshi, John Kim, David Kaeli2024-04-23下载Graph Neural Networks (GNNs) are emerging as a formidable tool for processing non-euclidean data across various domains, ranging from social network analysis to bioinformatics.
FedGreen: Carbon-aware Federated Learning with Model Size AdaptationAli Abbasi, Fan Dong, Xin Wang, Henry Leung, Jiayu Zhou, Steve Drew2024-04-23下载Federated learning (FL) provides a promising collaborative framework to build a model from distributed clients, and this work investigates the carbon emission of the FL process.
A Review on Message Complexity of the Algorithms for Clock Synchronization in Distributed SystemsChandeepa Dissanayake, Chanuka Algama2024-04-23下载In this work, we present an extensive analysis of clock synchronization algorithms, with a specific focus on message complexity. We begin by introducing fundamental concepts in clock synchronization, ...
Estimation Network Design framework for efficient distributed optimizationMattia Bianchi, Sergio Grammatico2024-04-23下载Distributed decision problems features a group of agents that can only communicate over a peer-to-peer network, without a central memory. In applications such as network control and data ranking, each...
Efficient Multi-Processor Scheduling in Increasingly Realistic ModelsPál András Papp, Georg Anegg, Aikaterini Karanasiou, A. N. Yzelman2024-04-23下载We study the problem of efficiently scheduling a computational DAG on multiple processors. The majority of previous works have developed and compared algorithms for this problem in relatively simple m...
Black Hole Search by Scattered Agents in Dynamic RingsGiuseppe Antonio Di Luna, Paola Flocchini, Giuseppe Prencipe, Nicola Santoro2024-04-23下载In this paper, we address the challenge of locating a black hole within a dynamic graph using a set of scattered agents, which start from arbitrary positions in the graph.
Mapping Parallel Matrix Multiplication in GotoBLAS2 to the AMD Versal ACAP for Deep LearningJie Lei, Enrique S. Quintana-Ortí2024-04-23下载This paper investigates the design of parallel general matrix multiplication (GEMM) for a Versal Adaptive Compute Accelerated Platform (ACAP) equipped with a VC1902 system-on-chip and multiple Artific...
Graph Neural Networks and Reinforcement Learning for Proactive Application Image PlacementAntonios Makris, Theodoros Theodoropoulos, Evangelos Psomakelis, Emanuele Carlini, Matteo Mordacchini, Patrizio Dazzi, Konstantinos Tserpes2024-04-23下载The shift from Cloud Computing to a Cloud-Edge continuum presents new opportunities and challenges for data-intensive and interactive applications.
Channel Access Methods for RF-Powered IoT Networks: A SurveyHang Yu, Lei Zhang, Yiwei Li, Kwan-Wu Chin, Changlin Yang2024-04-23下载Many Internet of Things (IoT) networks with Radio Frequency (RF) powered devices operate over a shared medium. They thus require a channel access protocol.
It's Hard to HAC with Average Linkage!MohammadHossein Bateni, Laxman Dhulipala, Kishen N Gowda, D Ellis Hershkowitz, Rajesh Jayaram, Jakub Łącki2024-04-23下载Average linkage Hierarchical Agglomerative Clustering (HAC) is an extensively studied and applied method for hierarchical clustering. Recent applications to massive datasets have driven significant in...
ORBIT: Oak Ridge Base Foundation Model for Earth System PredictabilityXiao Wang, Siyan Liu, Aristeidis Tsaris, Jong-Youl Choi, Ashwin Aji, Ming Fan, Wei Zhang, Junqi Yin, Moetasim Ashfaq, Dan Lu, Prasanna Balaprakash2024-04-23下载Earth system predictability is challenged by the complexity of environmental dynamics and the multitude of variables involved. Current AI foundation models, although advanced by leveraging large and h...
Towards Fast Setup and High Throughput of GPU Serverless ComputingHan Zhao, Weihao Cui, Quan Chen, Shulai Zhang, Zijun Li, Jingwen Leng, Chao Li, Deze Zeng, Minyi Guo2024-04-23下载Integrating GPUs into serverless computing platforms is crucial for improving efficiency. However, existing solutions for GPU-enabled serverless computing platforms face two significant problems due t...
Workload-Aware Hardware Accelerator Mining for Distributed Deep Learning TrainingMuhammad Adnan, Amar Phanishayee, Janardhan Kulkarni, Prashant J. Nair, Divya Mahajan2024-04-23下载In this paper, we present a novel technique to search for hardware architectures of accelerators optimized for end-to-end training of deep neural networks (DNNs).

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Multi-Tier Non-Terrestrial Networking for Disaster Communications: A Layered Clustering ApproachMetin Ozturk, Berk Çiloğlu, Görkem Berkay Koç, Halim Yanikomeroglu2024-04-23下载It is crucial to deploy temporary non-terrestrial networks (NTN) in disaster situations where terrestrial networks are no longer operable. Deploying uncrewed aerial vehicle base stations (UAV-BSs) can...
Predictive Intent Maintenance with Intent Drift Detection in Next Generation NetworkChukwuemeka Muonagor, Mounir Bensalem, Admela Jukan2024-04-23下载Intent-Based Networking (IBN) is a known concept for enabling the autonomous configuration and self-adaptation of networks. One of the major issues in IBN is maintaining the applied intent due the eff...
Securing O-RAN Open InterfacesJoshua Groen, Salvatore D'Oro, Utku Demir, Leonardo Bonati, Davide Villa, Michele Polese, Tommaso Melodia, Kaushik Chowdhury2024-04-23下载The next generation of cellular networks will be characterized by openness, intelligence, virtualization, and distributed computing. The Open Radio Access Network (Open RAN) framework represents a sig...
Outage Probability Analysis of Wireless Paths with Faulty Reconfigurable Intelligent SurfacesMounir Bensalem, Admela Jukan2024-04-23下载We consider a next generation wireless network incorporating a base station a set of typically low-cost and faulty Reconfigurable Intelligent Surfaces (RISs).
Understanding IoT Domain Names: Analysis and Classification Using Machine LearningIbrahim Ayoub, Martine S. Lenders, Benoît Ampeau, Sandoche Balakrichenan, Kinda Khawam, Thomas C. Schmidt, Matthias Wählisch2024-04-23下载In this paper, we investigate the domain names of servers on the Internet that are accessed by IoT devices performing machine-to-machine communications.
A Data-Driven Analysis of Vulnerable Road User Safety in Interaction with Connected Automated VehiclesEdmir Xhoxhi, Vincent Albert Wolff2024-04-23下载According to the World Health Organization, the involvement of Vulnerable Road Users (VRUs) in traffic accidents remains a significant concern, with VRUs accounting for over half of traffic fatalities...
Vulnerable Road User Clustering for Collective Perception Messages: Efficient Representation Through Geometric ShapesEdmir Xhoxhi, Vincent Albert Wolff, Yao Li, Florian Alexander Schiegg2024-04-23下载Ensuring the safety of Vulnerable Road Users (VRUs) is a critical concern in transportation, demanding significant attention from researchers and engineers.
Feature Distribution Shift Mitigation with Contrastive Pretraining for Intrusion DetectionWeixing Wang, Haojin Yang, Christoph Meinel, Hasan Yagiz Özkan, Cristian Bermudez Serna, Carmen Mas-Machuca2024-04-23下载In recent years, there has been a growing interest in using Machine Learning (ML), especially Deep Learning (DL) to solve Network Intrusion Detection (NID) problems.
Channel Access Methods for RF-Powered IoT Networks: A SurveyHang Yu, Lei Zhang, Yiwei Li, Kwan-Wu Chin, Changlin Yang2024-04-23下载Many Internet of Things (IoT) networks with Radio Frequency (RF) powered devices operate over a shared medium. They thus require a channel access protocol.
Uncrewed Vehicles in 6G Networks: A Unifying Treatment of Problems, Formulations, and ToolsWinston Hurst, Spilios Evmorfos, Athina Petropulu, Yasamin Mostofi2024-04-23下载Uncrewed Vehicles (UVs) functioning as autonomous agents are anticipated to play a crucial role in the 6th Generation of wireless networks. Their seamless integration, cost-effectiveness, and the addi...
Teaching Network Traffic Matrices in an Interactive Game EnvironmentChasen Milner, Hayden Jananthan, Jeremy Kepner, Vijay Gadepally, Michael Jones, Peter Michaleas, Ritesh Patel, Sandeep Pisharody, Gabriel Wachman, Alex Pentland2024-04-23下载The Internet has become a critical domain for modern society that requires ongoing efforts for its improvement and protection. Network traffic matrices are a powerful tool for understanding and analyz...

cs.PF - Performance

标题作者发布日期PDF摘要
Towards self-optimization of publish/subscribe IoT systems using continuous performance monitoringMohammed Djahafi, Nabila Salmi2024-04-23下载Today, more and more embedded devices are being connected through a network, generally Internet, offering users different services. This concept refers to Internet of Things (IoT), bringing informatio...

基于 VitePress 构建