Skip to content

2024-10-22

cs.AR - Architecture

标题作者发布日期PDF摘要
A 10.60 μW 150 GOPS Mixed-Bit-Width Sparse CNN Accelerator for Life-Threatening Ventricular Arrhythmia DetectionYifan Qin, Zhenge Jia, Zheyu Yan, Jay Mok, Manto Yung, Yu Liu, Xuejiao Liu, Wujie Wen, Luhong Liang, Kwang-Ting Tim Cheng, X. Sharon Hu, Yiyu Shi2024-10-22下载This paper proposes an ultra-low power, mixed-bit-width sparse convolutional neural network (CNN) accelerator to accelerate ventricular arrhythmia (VA) detection.
Towards Efficient IMC Accelerator Design Through Joint Hardware-Workload Co-optimizationOlga Krestinskaya, Mohammed E. Fouda, Ahmed Eltawil, Khaled N. Salama2024-10-22下载Designing generalized in-memory computing (IMC) hardware that efficiently supports a variety of workloads requires extensive design space exploration, which is infeasible to perform manually.
BETA: Automated Black-box Exploration for Timing Attacks in ProcessorsCongcong Chen, Jinhua Cui, Jiliang Zhang2024-10-22下载Modern processor advancements have introduced security risks, particularly in the form of microarchitectural timing attacks. High-profile attacks such as Meltdown and Spectre have revealed critical fl...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
AI-focused HPC Data Centers Can Provide More Power Grid Flexibility and at Lower CostYihong Zhou, Angel Paredes, Chaimaa Essayeh, Thomas Morstyn2024-10-22下载The recent growth of Artificial Intelligence (AI), particularly large language models, requires energy-demanding high-performance computing (HPC) data centers, which poses a significant burden on powe...
AMUSD: Asynchronous Multi-Device Speculative Decoding for LLM AccelerationBradley McDanel2024-10-22下载Large language models typically generate tokens autoregressively, using each token as input for the next. Recent work on Speculative Decoding has sought to accelerate this process by employing a small...
Parallel Cluster-BFS and Applications to Shortest PathsLetong Wang, Guy Blelloch, Yan Gu, Yihan Sun2024-10-22下载Breadth-first Search (BFS) is one of the most important graph processing subroutines, especially for computing the unweighted distance. Many applications may require running BFS from multiple sources.
Security and RAS in the Computing ContinuumMartí Alonso, David Andreu, Ramon Canal, Stefano Di Carlo, Odysseas Chatzopoulos, Cristiano Chenet, Juanjo Costa, Andreu Girones, Dimitris Gizopoulos, George Papadimitriou, Enric Morancho, Beatriz Otero, Alessandro Savino2024-10-22下载Security and RAS are two non-functional requirements under focus for current systems developed for the computing continuum. Due to the increased number of interconnected computer systems across the co...
FlowTracer: A Tool for Uncovering Network Path Usage Imbalance in AI Training ClustersHasibul Jamil, Abdul Alim, Laurent Schares, Pavlos Maniotis, Liran Schour, Ali Sydney, Abdullah Kayi, Tevfik Kosar, Bengi Karacali2024-10-22下载The increasing complexity of AI workloads, especially distributed Large Language Model (LLM) training, places significant strain on the networking infrastructure of parallel data centers and supercomp...
LoRA-C: Parameter-Efficient Fine-Tuning of Robust CNN for IoT DevicesChuntao Ding, Xu Cao, Jianhang Xie, Linlin Fan, Shangguang Wang, Zhichao Lu2024-10-22下载Efficient fine-tuning of pre-trained convolutional neural network (CNN) models using local data is essential for providing high-quality services to users using ubiquitous and resource-limited Internet...
Efficient Scheduling of Vehicular Tasks on Edge Systems with Green Energy and Battery StorageSuvarthi Sarkar, Abinash Kumar Ray, Aryabartta Sahu2024-10-22下载The autonomous vehicle industry is rapidly expanding, requiring significant computational resources for tasks like perception and decision-making.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Characterizing Robocalls with Multiple Vantage PointsSathvik Prasad, Aleksandr Nahapetyan, Bradley Reaves2024-10-22下载Telephone spam has been among the highest network security concerns for users for many years. In response, industry and government have deployed new technologies and regulations to curb the problem, a...
Technical Report: Toward Applying Quantum Computing to Network VerificationKahlil Dozier, Justin Beltran, Kylie Berg, Hugo Matousek, Loqman Salamatian, Ethan Katz-Bassett, Dan Rubenstein2024-10-22下载Network verification (NWV), broadly defined as the verification of properties of distributed protocols used in network systems, cannot be efficiently solved on classical hardware via brute force.
FlowTracer: A Tool for Uncovering Network Path Usage Imbalance in AI Training ClustersHasibul Jamil, Abdul Alim, Laurent Schares, Pavlos Maniotis, Liran Schour, Ali Sydney, Abdullah Kayi, Tevfik Kosar, Bengi Karacali2024-10-22下载The increasing complexity of AI workloads, especially distributed Large Language Model (LLM) training, places significant strain on the networking infrastructure of parallel data centers and supercomp...
Optimizing Mixture-of-Experts Inference Time Combining Model Deployment and Communication SchedulingJialong Li, Shreyansh Tripathi, Lakshay Rastogi, Yiming Lei, Rui Pan, Yiting Xia2024-10-22下载As machine learning models scale in size and complexity, their computational requirements become a significant barrier. Mixture-of-Experts (MoE) models alleviate this issue by selectively activating r...
Nanosecond Precision Time Synchronization for Optical Data Center NetworksYiming Lei, Jialong Li, Zhengqing Liu, Raj Joshi, Yiting Xia2024-10-22下载Optical data center networks (DCNs) are renovating the infrastructure design for the cloud in the post Moore's law era. The fact that optical DCNs rely on optical circuits of microsecond-scale duratio...
Downtime Required for Bitcoin Quantum-SafetyJamie J. Pont, Joseph J. Kearney, Jack Moyler, Carlos A. Perez-Delgado2024-10-22下载Quantum devices capable of breaking the public-key cryptosystems that Bitcoin relies on to secure its transactions are expected with reasonable probability within a decade.
Safe Load Balancing in Software-Defined-NetworkingLam Dinh, Pham Tran Anh Quang, Jérémie Leguay2024-10-22下载High performance, reliability and safety are crucial properties of any Software-Defined-Networking (SDN) system. Although the use of Deep Reinforcement Learning (DRL) algorithms has been widely studie...
xApp-Level Conflict Mitigation in O-RAN, a Mobility Driven Energy Saving CaseAbdul Wadud, Fatemeh Golpayegani, Nima Afraz2024-10-22下载This paper investigates the emerging challenges of conflict detection and mitigation in Open Radio Access Network (O-RAN). Conflicts between xApps can arise that affect network performance and stabili...
Resource-Efficient Sensor Fusion via System-Wide Dynamic Gated Neural NetworksChetna Singhal, Yashuo Wu, Francesco Malandrino, Sharon Ladron de Guevara Contreras, Marco Levorato, Carla Fabiana Chiasserini2024-10-22下载Mobile systems will have to support multiple AI-based applications, each leveraging heterogeneous data sources through DNN architectures collaboratively executed within the network.

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Bauplan: zero-copy, scale-up FaaS for data pipelinesJacopo Tagliabue, Tyler Caraza-Harter, Ciro Greco2024-10-22下载Chaining functions for longer workloads is a key use case for FaaS platforms in data applications. However, modern data pipelines differ significantly from typical serverless use cases (e.g.

基于 VitePress 构建