2024-12-17

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Algorithmic Strategies for Sustainable Reuse of Neural Network Accelerators with Permanent Faults	Youssef A. Ait Alama, Sampada Sakpal, Ke Wang, Razvan Bunescu, Avinash Karanth, Ahmed Louri	2024-12-17	下载	Hardware failures are a growing challenge for machine learning accelerators, many of which are based on systolic arrays. When a permanent hardware failure occurs in a systolic array, existing solution...
AnalogXpert: Automating Analog Topology Synthesis by Incorporating Circuit Design Expertise into Large Language Models	Haoyi Zhang, Shizhao Sun, Yibo Lin, Runsheng Wang, Jiang Bian	2024-12-17	下载	Analog circuits are crucial in modern electronic systems, and automating their design has attracted significant research interest. One of major challenges is topology synthesis, which determines circu...
Design of an AI-Enhanced Digital Stethoscope: Advancing Cardiovascular Diagnostics Through Smart Auscultation	Abraham G. Taye, Sador Yemane, Eshetu Negash, Yared Minwuyelet, Nebiha Tofik	2024-12-17	下载	In the ever-evolving landscape of medical diagnostics, this study details the systematic design process and concept selection methodology for developing an advanced digital stethoscope, demonstrating ...
if-ZKP: Intel FPGA-Based Acceleration of Zero Knowledge Proofs	Shahzad Ahmad Butt, Benjamin Reynolds, Veeraraghavan Ramamurthy, Xiao Xiao, Pohrong Chu, Setareh Sharifian, Sergey Gribok, Bogdan Pasca	2024-12-17	下载	Zero-Knowledge Proofs (ZKPs) have emerged as an important cryptographic technique allowing one party (prover) to prove the correctness of a statement to some other party (verifier) and nothing else.
Investigating the Effect of Electrical and Thermal Transport Properties on Oxide-Based Memristors Performance and Reliability	Armin Gooran-Shoorakchaly, Sarah Sharif, Yaser Banad	2024-12-17	下载	Achieving reliable resistive switching in oxide-based memristive devices requires precise control over conductive filament (CF) formation and behavior, yet the fundamental relationship between oxide m...
Design and Performance Analysis of an Ultra-Low Power Integrate-and-Fire Neuron Circuit Using Nanoscale Side-contacted Field Effect Diode Technology	Seyedmohamadjavad Motaman, Sarah Sharif, Yaser Banad	2024-12-17	下载	Enhancing power efficiency and performance in neuromorphic computing systems is critical for next-generation artificial intelligence applications.
FinGraV: Methodology for Fine-Grain GPU Power Visibility and Insights	Varsha Singhania, Shaizeen Aga, Mohamed Assem Ibrahim	2024-12-17	下载	Ubiquity of AI makes optimizing GPU power a priority as large GPU-based clusters are often employed to train and serve AI models. An important first step in optimizing GPU power consumption is high-fi...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Accelerating the Operation of Complex Workflows through Standard Data Interfaces	Taylor Paul, William Regli	2024-12-17	下载	In this position paper we argue for standardizing how we share and process data in scientific workflows at the network-level to maximize step re-use and workflow portability across platforms and netwo...
Distributed Speculative Execution for Resilient Cloud Applications	Tianyu Li, Badrish Chandramouli, Philip A. Bernstein, Samuel Madden	2024-12-17	下载	Fault-tolerance is critically important in highly-distributed modern cloud applications. Solutions such as Temporal, Azure Durable Functions, and Beldi hide fault-tolerance complexity from developers ...
C-FedRAG: A Confidential Federated Retrieval-Augmented Generation System	Parker Addison, Minh-Tuan H. Nguyen, Tomislav Medan, Jinali Shah, Mohammad T. Manzari, Brendan McElrone, Laksh Lalwani, Aboli More, Smita Sharma, Holger R. Roth, Isaac Yang, Chester Chen, Daguang Xu, Yan Cheng, Andrew Feng, Ziyue Xu	2024-12-17	下载	Organizations seeking to utilize Large Language Models (LLMs) for knowledge querying and analysis often encounter challenges in maintaining an LLM fine-tuned on targeted, up-to-date information that k...
Cuckoo Heavy Keeper and the balancing act of maintaining heavy hitters in stream processing	Vinh Quang Ngo, Marina Papatriantafilou	2024-12-17	下载	Finding heavy hitters in databases and data streams is a fundamental problem with applications ranging from network monitoring to database query optimization, machine learning, and more.
Exposing the Vulnerability of Decentralized Learning to Membership Inference Attacks Through the Lens of Graph Mixing	Ousmane Touat, Jezekael Brunon, Yacine Belal, Julien Nicolas, César Sabater, Mohamed Maouche, Sonia Ben Mokhtar	2024-12-17	下载	The primary promise of decentralized learning is to allow users to engage in the training of machine learning models in a collaborative manner while keeping their data on their premises and without re...
AsyncSC: An Asynchronous Sidechain for Multi-Domain Data Exchange in Internet of Things	Lingxiao Yang, Xuewen Dong, Zhiguo Wan, Sheng Gao, Wei Tong, Di Lu, Yulong Shen, Xiaojiang Du	2024-12-17	下载	Sidechain techniques improve blockchain scalability and interoperability, providing decentralized exchange and cross-chain collaboration solutions for Internet of Things (IoT) data across various doma...
Uncertainty-Aware Hybrid Inference with On-Device Small and Remote Large Language Models	Seungeun Oh, Jinhyuk Kim, Jihong Park, Seung-Woo Ko, Tony Q. S. Quek, Seong-Lyun Kim	2024-12-17	下载	This paper studies a hybrid language model (HLM) architecture that integrates a small language model (SLM) operating on a mobile device with a large language model (LLM) hosted at the base station (BS...
Exploring AI-Enabled Cybersecurity Frameworks: Deep-Learning Techniques, GPU Support, and Future Enhancements	Tobias Becher, Simon Torka	2024-12-17	下载	Traditional rule-based cybersecurity systems have proven highly effective against known malware threats. However, they face challenges in detecting novel threats.
TrainMover: An Interruption-Resilient and Reliable ML Training Runtime	ChonLam Lao, Minlan Yu, Aditya Akella, Jiamin Cao, Yu Guan, Pengcheng Zhang, Zhilong Zheng, Yichi Xu, Ennan Zhai, Dennis Cai, Jiaqi Gao	2024-12-17	下载	Large-scale ML training jobs are frequently interrupted by hardware and software anomalies, failures, and management events. Existing solutions like checkpointing or runtime reconfiguration suffer fro...
Round and Communication Efficient Graph Coloring	Yi-Jun Chang, Gopinath Mishra, Hung Thuan Nguyen, Farrel D Salim	2024-12-17	下载	In the context of communication complexity, we explore protocols for graph coloring, focusing on the vertex and edge coloring problems in $n$ -vertex graphs $G$ with a maximum degree Δ.
Accelerating End-Cloud Collaborative Inference via Near Bubble-free Pipeline Optimization	Luyao Gao, Jianchun Liu, Hongli Xu, Sun Xu, Qianpiao Ma, Liusheng Huang	2024-12-17	下载	End-cloud collaboration offers a promising strategy to enhance the Quality of Service (QoS) in DNN inference by offloading portions of the inference workload from end devices to cloud servers.
A System for Microserving of LLMs	Hongyi Jin, Ruihang Lai, Charlie F. Ruan, Yingcheng Wang, Todd C. Mowry, Xupeng Miao, Zhihao Jia, Tianqi Chen	2024-12-17	下载	The recent advances in LLMs bring a strong demand for efficient system support to improve overall serving efficiency. As LLM inference scales towards multiple GPUs and even multiple compute nodes, var...
Echo: Simulating Distributed Training At Scale	Yicheng Feng, Yuetao Chen, Kaiwen Chen, Jingzong Li, Tianyuan Wu, Peng Cheng, Chuan Wu, Wei Wang, Tsung-Yi Ho, Hong Xu	2024-12-17	下载	Simulation offers unique values for both enumeration and extrapolation purposes, and is becoming increasingly important for managing the massive machine learning (ML) clusters and large-scale distribu...
FinGraV: Methodology for Fine-Grain GPU Power Visibility and Insights	Varsha Singhania, Shaizeen Aga, Mohamed Assem Ibrahim	2024-12-17	下载	Ubiquity of AI makes optimizing GPU power a priority as large GPU-based clusters are often employed to train and serve AI models. An important first step in optimizing GPU power consumption is high-fi...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Accelerating the Operation of Complex Workflows through Standard Data Interfaces	Taylor Paul, William Regli	2024-12-17	下载	In this position paper we argue for standardizing how we share and process data in scientific workflows at the network-level to maximize step re-use and workflow portability across platforms and netwo...
Driving Innovation in 6G Wireless Technologies: The OpenAirInterface Approach	Florian Kaltenberger, Tommaso Melodia, Irfan Ghauri, Michele Polese, Raymond Knopp, Tien Thinh Nguyen, Sakthivel Velumani, Davide Villa, Leonardo Bonati, Robert Schmidt, Sagar Arora, Mikel Irazabal, Navid Nikaein	2024-12-17	下载	The development of 6G wireless technologies is rapidly advancing, with the 3rd Generation Partnership Project (3GPP) entering the pre-standardization phase and aiming to deliver the first specificatio...
TIMESAFE: Timing Interruption Monitoring and Security Assessment for Fronthaul Environments	Joshua Groen, Simone Di Valerio, Imtiaz Karim, Davide Villa, Yiewi Zhang, Leonardo Bonati, Michele Polese, Salvatore D'Oro, Tommaso Melodia, Elisa Bertino, Francesca Cuomo, Kaushik Chowdhury	2024-12-17	下载	5G and beyond cellular systems embrace the disaggregation of Radio Access Network (RAN) components, exemplified by the evolution of the fronthaul (FH) connection between cellular baseband and radio un...
System-Level Experimental Evaluation of Reconfigurable Intelligent Surfaces for NextG Communication Systems	Maria Tsampazi, Tommaso Melodia	2024-12-17	下载	Reconfigurable Intelligent Surfaces (RISs) are a promising technique for enhancing the performance of Next Generation (NextG) wireless communication systems in terms of both spectral and energy effici...
2D-AoI: Age-of-Information of Distributed Sensors for Spatio-Temporal Processes	Markus Fidler, Flavio Gallistl, Jaya Prakash Champati, Joerg Widmer	2024-12-17	下载	The freshness of sensor data is critical for all types of cyber-physical systems. An established measure for quantifying data freshness is the Age-of-Information (AoI), which has been the subject of e...
Experimental Study of Low-Latency Video Streaming in an ORAN Setup with Generative AI	Andreas Casparsen, Van-Phuc Bui, Shashi Raj Pandey, Jimmy Jessen Nielsen, Petar Popovski	2024-12-17	下载	Current Adaptive Bit Rate (ABR) methods react to network congestion after it occurs, causing application layer buffering and latency spikes in live video streaming.
Uncertainty-Aware Hybrid Inference with On-Device Small and Remote Large Language Models	Seungeun Oh, Jinhyuk Kim, Jihong Park, Seung-Woo Ko, Tony Q. S. Quek, Seong-Lyun Kim	2024-12-17	下载	This paper studies a hybrid language model (HLM) architecture that integrates a small language model (SLM) operating on a mobile device with a large language model (LLM) hosted at the base station (BS...
Distributed satellite information networks: Architecture, enabling technologies, and trends	Qinyu Zhang, Liang Xu, Jianhao Huang, Tao Yang, Jian Jiao, Ye Wang, Yao Shi, Chiya Zhang, Xingjian Zhang, Ke Zhang, Yupeng Gong, Na Deng, Nan Zhao, Zhen Gao, Shujun Han, Xiaodong Xu, Li You, Dongming Wang, Shan Jiang, Dixian Zhao, Nan Zhang, Liujun Hu, Xiongwen He, Yonghui Li, Xiqi Gao, Xiaohu You	2024-12-17	下载	Driven by the vision of ubiquitous connectivity and wireless intelligence, the evolution of ultra-dense constellation-based satellite-integrated Internet is underway, now taking preliminary shape.
Personalized Federated Deep Reinforcement Learning for Heterogeneous Edge Content Caching Networks	Zhen Li, Tan Li, Hai Liu, Tse-Tin Chan	2024-12-17	下载	Proactive caching is essential for minimizing latency and improving Quality of Experience (QoE) in multi-server edge networks. Federated Deep Reinforcement Learning (FDRL) is a promising approach for ...
Rydberg Atomic Receiver: Next Frontier of Wireless Communications	Mingyao Cui, Qunsong Zeng, Kaibin Huang	2024-12-17	下载	Rydberg Atomic REceiver (RARE) is driving a paradigm shift in electromagnetic (EM) wave measurement by harnessing the electron transition phenomenon of Rydberg atoms.

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Scaling Inter-procedural Dataflow Analysis on the Cloud	Zewen Sun, Yujin Zhang, Duanchen Xu, Yiyu Zhang, Yun Qi, Yueyang Wang, Yi Li, Zhaokang Wang, Yue Li, Xuandong Li, Zhiqiang Zuo, Qingda Lu, Wenwen Peng, Shengjian Guo	2024-12-17	下载	Apart from forming the backbone of compiler optimization, static dataflow analysis has been widely applied in a vast variety of applications, such as bug detection, privacy analysis, program comprehen...
Optimizing System Memory Bandwidth with Micron CXL Memory Expansion Modules on Intel Xeon 6 Processors	Rohit Sehgal, Vishal Tanna, Vinicius Petrucci, Anil Godbole	2024-12-17	下载	High-Performance Computing (HPC) and Artificial Intelligence (AI) workloads typically demand substantial memory bandwidth and, to a degree, memory capacity.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
2D-AoI: Age-of-Information of Distributed Sensors for Spatio-Temporal Processes	Markus Fidler, Flavio Gallistl, Jaya Prakash Champati, Joerg Widmer	2024-12-17	下载	The freshness of sensor data is critical for all types of cyber-physical systems. An established measure for quantifying data freshness is the Age-of-Information (AoI), which has been the subject of e...
TrainMover: An Interruption-Resilient and Reliable ML Training Runtime	ChonLam Lao, Minlan Yu, Aditya Akella, Jiamin Cao, Yu Guan, Pengcheng Zhang, Zhilong Zheng, Yichi Xu, Ennan Zhai, Dennis Cai, Jiaqi Gao	2024-12-17	下载	Large-scale ML training jobs are frequently interrupted by hardware and software anomalies, failures, and management events. Existing solutions like checkpointing or runtime reconfiguration suffer fro...
Optimizing System Memory Bandwidth with Micron CXL Memory Expansion Modules on Intel Xeon 6 Processors	Rohit Sehgal, Vishal Tanna, Vinicius Petrucci, Anil Godbole	2024-12-17	下载	High-Performance Computing (HPC) and Artificial Intelligence (AI) workloads typically demand substantial memory bandwidth and, to a degree, memory capacity.