Skip to content

2024-12-17

cs.AR - Architecture

标题作者发布日期PDF摘要
Algorithmic Strategies for Sustainable Reuse of Neural Network Accelerators with Permanent FaultsYoussef A. Ait Alama, Sampada Sakpal, Ke Wang, Razvan Bunescu, Avinash Karanth, Ahmed Louri2024-12-17下载Hardware failures are a growing challenge for machine learning accelerators, many of which are based on systolic arrays. When a permanent hardware failure occurs in a systolic array, existing solution...
AnalogXpert: Automating Analog Topology Synthesis by Incorporating Circuit Design Expertise into Large Language ModelsHaoyi Zhang, Shizhao Sun, Yibo Lin, Runsheng Wang, Jiang Bian2024-12-17下载Analog circuits are crucial in modern electronic systems, and automating their design has attracted significant research interest. One of major challenges is topology synthesis, which determines circu...
Design of an AI-Enhanced Digital Stethoscope: Advancing Cardiovascular Diagnostics Through Smart AuscultationAbraham G. Taye, Sador Yemane, Eshetu Negash, Yared Minwuyelet, Nebiha Tofik2024-12-17下载In the ever-evolving landscape of medical diagnostics, this study details the systematic design process and concept selection methodology for developing an advanced digital stethoscope, demonstrating ...
if-ZKP: Intel FPGA-Based Acceleration of Zero Knowledge ProofsShahzad Ahmad Butt, Benjamin Reynolds, Veeraraghavan Ramamurthy, Xiao Xiao, Pohrong Chu, Setareh Sharifian, Sergey Gribok, Bogdan Pasca2024-12-17下载Zero-Knowledge Proofs (ZKPs) have emerged as an important cryptographic technique allowing one party (prover) to prove the correctness of a statement to some other party (verifier) and nothing else.
Investigating the Effect of Electrical and Thermal Transport Properties on Oxide-Based Memristors Performance and ReliabilityArmin Gooran-Shoorakchaly, Sarah Sharif, Yaser Banad2024-12-17下载Achieving reliable resistive switching in oxide-based memristive devices requires precise control over conductive filament (CF) formation and behavior, yet the fundamental relationship between oxide m...
Design and Performance Analysis of an Ultra-Low Power Integrate-and-Fire Neuron Circuit Using Nanoscale Side-contacted Field Effect Diode TechnologySeyedmohamadjavad Motaman, Sarah Sharif, Yaser Banad2024-12-17下载Enhancing power efficiency and performance in neuromorphic computing systems is critical for next-generation artificial intelligence applications.
FinGraV: Methodology for Fine-Grain GPU Power Visibility and InsightsVarsha Singhania, Shaizeen Aga, Mohamed Assem Ibrahim2024-12-17下载Ubiquity of AI makes optimizing GPU power a priority as large GPU-based clusters are often employed to train and serve AI models. An important first step in optimizing GPU power consumption is high-fi...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Accelerating the Operation of Complex Workflows through Standard Data InterfacesTaylor Paul, William Regli2024-12-17下载In this position paper we argue for standardizing how we share and process data in scientific workflows at the network-level to maximize step re-use and workflow portability across platforms and netwo...
Distributed Speculative Execution for Resilient Cloud ApplicationsTianyu Li, Badrish Chandramouli, Philip A. Bernstein, Samuel Madden2024-12-17下载Fault-tolerance is critically important in highly-distributed modern cloud applications. Solutions such as Temporal, Azure Durable Functions, and Beldi hide fault-tolerance complexity from developers ...
C-FedRAG: A Confidential Federated Retrieval-Augmented Generation SystemParker Addison, Minh-Tuan H. Nguyen, Tomislav Medan, Jinali Shah, Mohammad T. Manzari, Brendan McElrone, Laksh Lalwani, Aboli More, Smita Sharma, Holger R. Roth, Isaac Yang, Chester Chen, Daguang Xu, Yan Cheng, Andrew Feng, Ziyue Xu2024-12-17下载Organizations seeking to utilize Large Language Models (LLMs) for knowledge querying and analysis often encounter challenges in maintaining an LLM fine-tuned on targeted, up-to-date information that k...
Cuckoo Heavy Keeper and the balancing act of maintaining heavy hitters in stream processingVinh Quang Ngo, Marina Papatriantafilou2024-12-17下载Finding heavy hitters in databases and data streams is a fundamental problem with applications ranging from network monitoring to database query optimization, machine learning, and more.
Exposing the Vulnerability of Decentralized Learning to Membership Inference Attacks Through the Lens of Graph MixingOusmane Touat, Jezekael Brunon, Yacine Belal, Julien Nicolas, César Sabater, Mohamed Maouche, Sonia Ben Mokhtar2024-12-17下载The primary promise of decentralized learning is to allow users to engage in the training of machine learning models in a collaborative manner while keeping their data on their premises and without re...
AsyncSC: An Asynchronous Sidechain for Multi-Domain Data Exchange in Internet of ThingsLingxiao Yang, Xuewen Dong, Zhiguo Wan, Sheng Gao, Wei Tong, Di Lu, Yulong Shen, Xiaojiang Du2024-12-17下载Sidechain techniques improve blockchain scalability and interoperability, providing decentralized exchange and cross-chain collaboration solutions for Internet of Things (IoT) data across various doma...
Uncertainty-Aware Hybrid Inference with On-Device Small and Remote Large Language ModelsSeungeun Oh, Jinhyuk Kim, Jihong Park, Seung-Woo Ko, Tony Q. S. Quek, Seong-Lyun Kim2024-12-17下载This paper studies a hybrid language model (HLM) architecture that integrates a small language model (SLM) operating on a mobile device with a large language model (LLM) hosted at the base station (BS...
Exploring AI-Enabled Cybersecurity Frameworks: Deep-Learning Techniques, GPU Support, and Future EnhancementsTobias Becher, Simon Torka2024-12-17下载Traditional rule-based cybersecurity systems have proven highly effective against known malware threats. However, they face challenges in detecting novel threats.
TrainMover: An Interruption-Resilient and Reliable ML Training RuntimeChonLam Lao, Minlan Yu, Aditya Akella, Jiamin Cao, Yu Guan, Pengcheng Zhang, Zhilong Zheng, Yichi Xu, Ennan Zhai, Dennis Cai, Jiaqi Gao2024-12-17下载Large-scale ML training jobs are frequently interrupted by hardware and software anomalies, failures, and management events. Existing solutions like checkpointing or runtime reconfiguration suffer fro...
Round and Communication Efficient Graph ColoringYi-Jun Chang, Gopinath Mishra, Hung Thuan Nguyen, Farrel D Salim2024-12-17下载In the context of communication complexity, we explore protocols for graph coloring, focusing on the vertex and edge coloring problems in nn-vertex graphs GG with a maximum degree Δ.
Accelerating End-Cloud Collaborative Inference via Near Bubble-free Pipeline OptimizationLuyao Gao, Jianchun Liu, Hongli Xu, Sun Xu, Qianpiao Ma, Liusheng Huang2024-12-17下载End-cloud collaboration offers a promising strategy to enhance the Quality of Service (QoS) in DNN inference by offloading portions of the inference workload from end devices to cloud servers.
A System for Microserving of LLMsHongyi Jin, Ruihang Lai, Charlie F. Ruan, Yingcheng Wang, Todd C. Mowry, Xupeng Miao, Zhihao Jia, Tianqi Chen2024-12-17下载The recent advances in LLMs bring a strong demand for efficient system support to improve overall serving efficiency. As LLM inference scales towards multiple GPUs and even multiple compute nodes, var...
Echo: Simulating Distributed Training At ScaleYicheng Feng, Yuetao Chen, Kaiwen Chen, Jingzong Li, Tianyuan Wu, Peng Cheng, Chuan Wu, Wei Wang, Tsung-Yi Ho, Hong Xu2024-12-17下载Simulation offers unique values for both enumeration and extrapolation purposes, and is becoming increasingly important for managing the massive machine learning (ML) clusters and large-scale distribu...
FinGraV: Methodology for Fine-Grain GPU Power Visibility and InsightsVarsha Singhania, Shaizeen Aga, Mohamed Assem Ibrahim2024-12-17下载Ubiquity of AI makes optimizing GPU power a priority as large GPU-based clusters are often employed to train and serve AI models. An important first step in optimizing GPU power consumption is high-fi...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Accelerating the Operation of Complex Workflows through Standard Data InterfacesTaylor Paul, William Regli2024-12-17下载In this position paper we argue for standardizing how we share and process data in scientific workflows at the network-level to maximize step re-use and workflow portability across platforms and netwo...
Driving Innovation in 6G Wireless Technologies: The OpenAirInterface ApproachFlorian Kaltenberger, Tommaso Melodia, Irfan Ghauri, Michele Polese, Raymond Knopp, Tien Thinh Nguyen, Sakthivel Velumani, Davide Villa, Leonardo Bonati, Robert Schmidt, Sagar Arora, Mikel Irazabal, Navid Nikaein2024-12-17下载The development of 6G wireless technologies is rapidly advancing, with the 3rd Generation Partnership Project (3GPP) entering the pre-standardization phase and aiming to deliver the first specificatio...
TIMESAFE: Timing Interruption Monitoring and Security Assessment for Fronthaul EnvironmentsJoshua Groen, Simone Di Valerio, Imtiaz Karim, Davide Villa, Yiewi Zhang, Leonardo Bonati, Michele Polese, Salvatore D'Oro, Tommaso Melodia, Elisa Bertino, Francesca Cuomo, Kaushik Chowdhury2024-12-17下载5G and beyond cellular systems embrace the disaggregation of Radio Access Network (RAN) components, exemplified by the evolution of the fronthaul (FH) connection between cellular baseband and radio un...
System-Level Experimental Evaluation of Reconfigurable Intelligent Surfaces for NextG Communication SystemsMaria Tsampazi, Tommaso Melodia2024-12-17下载Reconfigurable Intelligent Surfaces (RISs) are a promising technique for enhancing the performance of Next Generation (NextG) wireless communication systems in terms of both spectral and energy effici...
2D-AoI: Age-of-Information of Distributed Sensors for Spatio-Temporal ProcessesMarkus Fidler, Flavio Gallistl, Jaya Prakash Champati, Joerg Widmer2024-12-17下载The freshness of sensor data is critical for all types of cyber-physical systems. An established measure for quantifying data freshness is the Age-of-Information (AoI), which has been the subject of e...
Experimental Study of Low-Latency Video Streaming in an ORAN Setup with Generative AIAndreas Casparsen, Van-Phuc Bui, Shashi Raj Pandey, Jimmy Jessen Nielsen, Petar Popovski2024-12-17下载Current Adaptive Bit Rate (ABR) methods react to network congestion after it occurs, causing application layer buffering and latency spikes in live video streaming.
Uncertainty-Aware Hybrid Inference with On-Device Small and Remote Large Language ModelsSeungeun Oh, Jinhyuk Kim, Jihong Park, Seung-Woo Ko, Tony Q. S. Quek, Seong-Lyun Kim2024-12-17下载This paper studies a hybrid language model (HLM) architecture that integrates a small language model (SLM) operating on a mobile device with a large language model (LLM) hosted at the base station (BS...
Distributed satellite information networks: Architecture, enabling technologies, and trendsQinyu Zhang, Liang Xu, Jianhao Huang, Tao Yang, Jian Jiao, Ye Wang, Yao Shi, Chiya Zhang, Xingjian Zhang, Ke Zhang, Yupeng Gong, Na Deng, Nan Zhao, Zhen Gao, Shujun Han, Xiaodong Xu, Li You, Dongming Wang, Shan Jiang, Dixian Zhao, Nan Zhang, Liujun Hu, Xiongwen He, Yonghui Li, Xiqi Gao, Xiaohu You2024-12-17下载Driven by the vision of ubiquitous connectivity and wireless intelligence, the evolution of ultra-dense constellation-based satellite-integrated Internet is underway, now taking preliminary shape.
Personalized Federated Deep Reinforcement Learning for Heterogeneous Edge Content Caching NetworksZhen Li, Tan Li, Hai Liu, Tse-Tin Chan2024-12-17下载Proactive caching is essential for minimizing latency and improving Quality of Experience (QoE) in multi-server edge networks. Federated Deep Reinforcement Learning (FDRL) is a promising approach for ...
Rydberg Atomic Receiver: Next Frontier of Wireless CommunicationsMingyao Cui, Qunsong Zeng, Kaibin Huang2024-12-17下载Rydberg Atomic REceiver (RARE) is driving a paradigm shift in electromagnetic (EM) wave measurement by harnessing the electron transition phenomenon of Rydberg atoms.

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Scaling Inter-procedural Dataflow Analysis on the CloudZewen Sun, Yujin Zhang, Duanchen Xu, Yiyu Zhang, Yun Qi, Yueyang Wang, Yi Li, Zhaokang Wang, Yue Li, Xuandong Li, Zhiqiang Zuo, Qingda Lu, Wenwen Peng, Shengjian Guo2024-12-17下载Apart from forming the backbone of compiler optimization, static dataflow analysis has been widely applied in a vast variety of applications, such as bug detection, privacy analysis, program comprehen...
Optimizing System Memory Bandwidth with Micron CXL Memory Expansion Modules on Intel Xeon 6 ProcessorsRohit Sehgal, Vishal Tanna, Vinicius Petrucci, Anil Godbole2024-12-17下载High-Performance Computing (HPC) and Artificial Intelligence (AI) workloads typically demand substantial memory bandwidth and, to a degree, memory capacity.

cs.PF - Performance

标题作者发布日期PDF摘要
2D-AoI: Age-of-Information of Distributed Sensors for Spatio-Temporal ProcessesMarkus Fidler, Flavio Gallistl, Jaya Prakash Champati, Joerg Widmer2024-12-17下载The freshness of sensor data is critical for all types of cyber-physical systems. An established measure for quantifying data freshness is the Age-of-Information (AoI), which has been the subject of e...
TrainMover: An Interruption-Resilient and Reliable ML Training RuntimeChonLam Lao, Minlan Yu, Aditya Akella, Jiamin Cao, Yu Guan, Pengcheng Zhang, Zhilong Zheng, Yichi Xu, Ennan Zhai, Dennis Cai, Jiaqi Gao2024-12-17下载Large-scale ML training jobs are frequently interrupted by hardware and software anomalies, failures, and management events. Existing solutions like checkpointing or runtime reconfiguration suffer fro...
Optimizing System Memory Bandwidth with Micron CXL Memory Expansion Modules on Intel Xeon 6 ProcessorsRohit Sehgal, Vishal Tanna, Vinicius Petrucci, Anil Godbole2024-12-17下载High-Performance Computing (HPC) and Artificial Intelligence (AI) workloads typically demand substantial memory bandwidth and, to a degree, memory capacity.

基于 VitePress 构建