Appearance
2024-12-17
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Algorithmic Strategies for Sustainable Reuse of Neural Network Accelerators with Permanent Faults | Youssef A. Ait Alama, Sampada Sakpal, Ke Wang, Razvan Bunescu, Avinash Karanth, Ahmed Louri | 2024-12-17 | 下载 | Hardware failures are a growing challenge for machine learning accelerators, many of which are based on systolic arrays. When a permanent hardware failure occurs in a systolic array, existing solution... |
| AnalogXpert: Automating Analog Topology Synthesis by Incorporating Circuit Design Expertise into Large Language Models | Haoyi Zhang, Shizhao Sun, Yibo Lin, Runsheng Wang, Jiang Bian | 2024-12-17 | 下载 | Analog circuits are crucial in modern electronic systems, and automating their design has attracted significant research interest. One of major challenges is topology synthesis, which determines circu... |
| Design of an AI-Enhanced Digital Stethoscope: Advancing Cardiovascular Diagnostics Through Smart Auscultation | Abraham G. Taye, Sador Yemane, Eshetu Negash, Yared Minwuyelet, Nebiha Tofik | 2024-12-17 | 下载 | In the ever-evolving landscape of medical diagnostics, this study details the systematic design process and concept selection methodology for developing an advanced digital stethoscope, demonstrating ... |
| if-ZKP: Intel FPGA-Based Acceleration of Zero Knowledge Proofs | Shahzad Ahmad Butt, Benjamin Reynolds, Veeraraghavan Ramamurthy, Xiao Xiao, Pohrong Chu, Setareh Sharifian, Sergey Gribok, Bogdan Pasca | 2024-12-17 | 下载 | Zero-Knowledge Proofs (ZKPs) have emerged as an important cryptographic technique allowing one party (prover) to prove the correctness of a statement to some other party (verifier) and nothing else. |
| Investigating the Effect of Electrical and Thermal Transport Properties on Oxide-Based Memristors Performance and Reliability | Armin Gooran-Shoorakchaly, Sarah Sharif, Yaser Banad | 2024-12-17 | 下载 | Achieving reliable resistive switching in oxide-based memristive devices requires precise control over conductive filament (CF) formation and behavior, yet the fundamental relationship between oxide m... |
| Design and Performance Analysis of an Ultra-Low Power Integrate-and-Fire Neuron Circuit Using Nanoscale Side-contacted Field Effect Diode Technology | Seyedmohamadjavad Motaman, Sarah Sharif, Yaser Banad | 2024-12-17 | 下载 | Enhancing power efficiency and performance in neuromorphic computing systems is critical for next-generation artificial intelligence applications. |
| FinGraV: Methodology for Fine-Grain GPU Power Visibility and Insights | Varsha Singhania, Shaizeen Aga, Mohamed Assem Ibrahim | 2024-12-17 | 下载 | Ubiquity of AI makes optimizing GPU power a priority as large GPU-based clusters are often employed to train and serve AI models. An important first step in optimizing GPU power consumption is high-fi... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Accelerating the Operation of Complex Workflows through Standard Data Interfaces | Taylor Paul, William Regli | 2024-12-17 | 下载 | In this position paper we argue for standardizing how we share and process data in scientific workflows at the network-level to maximize step re-use and workflow portability across platforms and netwo... |
| Distributed Speculative Execution for Resilient Cloud Applications | Tianyu Li, Badrish Chandramouli, Philip A. Bernstein, Samuel Madden | 2024-12-17 | 下载 | Fault-tolerance is critically important in highly-distributed modern cloud applications. Solutions such as Temporal, Azure Durable Functions, and Beldi hide fault-tolerance complexity from developers ... |
| C-FedRAG: A Confidential Federated Retrieval-Augmented Generation System | Parker Addison, Minh-Tuan H. Nguyen, Tomislav Medan, Jinali Shah, Mohammad T. Manzari, Brendan McElrone, Laksh Lalwani, Aboli More, Smita Sharma, Holger R. Roth, Isaac Yang, Chester Chen, Daguang Xu, Yan Cheng, Andrew Feng, Ziyue Xu | 2024-12-17 | 下载 | Organizations seeking to utilize Large Language Models (LLMs) for knowledge querying and analysis often encounter challenges in maintaining an LLM fine-tuned on targeted, up-to-date information that k... |
| Cuckoo Heavy Keeper and the balancing act of maintaining heavy hitters in stream processing | Vinh Quang Ngo, Marina Papatriantafilou | 2024-12-17 | 下载 | Finding heavy hitters in databases and data streams is a fundamental problem with applications ranging from network monitoring to database query optimization, machine learning, and more. |
| Exposing the Vulnerability of Decentralized Learning to Membership Inference Attacks Through the Lens of Graph Mixing | Ousmane Touat, Jezekael Brunon, Yacine Belal, Julien Nicolas, César Sabater, Mohamed Maouche, Sonia Ben Mokhtar | 2024-12-17 | 下载 | The primary promise of decentralized learning is to allow users to engage in the training of machine learning models in a collaborative manner while keeping their data on their premises and without re... |
| AsyncSC: An Asynchronous Sidechain for Multi-Domain Data Exchange in Internet of Things | Lingxiao Yang, Xuewen Dong, Zhiguo Wan, Sheng Gao, Wei Tong, Di Lu, Yulong Shen, Xiaojiang Du | 2024-12-17 | 下载 | Sidechain techniques improve blockchain scalability and interoperability, providing decentralized exchange and cross-chain collaboration solutions for Internet of Things (IoT) data across various doma... |
| Uncertainty-Aware Hybrid Inference with On-Device Small and Remote Large Language Models | Seungeun Oh, Jinhyuk Kim, Jihong Park, Seung-Woo Ko, Tony Q. S. Quek, Seong-Lyun Kim | 2024-12-17 | 下载 | This paper studies a hybrid language model (HLM) architecture that integrates a small language model (SLM) operating on a mobile device with a large language model (LLM) hosted at the base station (BS... |
| Exploring AI-Enabled Cybersecurity Frameworks: Deep-Learning Techniques, GPU Support, and Future Enhancements | Tobias Becher, Simon Torka | 2024-12-17 | 下载 | Traditional rule-based cybersecurity systems have proven highly effective against known malware threats. However, they face challenges in detecting novel threats. |
| TrainMover: An Interruption-Resilient and Reliable ML Training Runtime | ChonLam Lao, Minlan Yu, Aditya Akella, Jiamin Cao, Yu Guan, Pengcheng Zhang, Zhilong Zheng, Yichi Xu, Ennan Zhai, Dennis Cai, Jiaqi Gao | 2024-12-17 | 下载 | Large-scale ML training jobs are frequently interrupted by hardware and software anomalies, failures, and management events. Existing solutions like checkpointing or runtime reconfiguration suffer fro... |
| Round and Communication Efficient Graph Coloring | Yi-Jun Chang, Gopinath Mishra, Hung Thuan Nguyen, Farrel D Salim | 2024-12-17 | 下载 | In the context of communication complexity, we explore protocols for graph coloring, focusing on the vertex and edge coloring problems in -vertex graphs with a maximum degree Δ. |
| Accelerating End-Cloud Collaborative Inference via Near Bubble-free Pipeline Optimization | Luyao Gao, Jianchun Liu, Hongli Xu, Sun Xu, Qianpiao Ma, Liusheng Huang | 2024-12-17 | 下载 | End-cloud collaboration offers a promising strategy to enhance the Quality of Service (QoS) in DNN inference by offloading portions of the inference workload from end devices to cloud servers. |
| A System for Microserving of LLMs | Hongyi Jin, Ruihang Lai, Charlie F. Ruan, Yingcheng Wang, Todd C. Mowry, Xupeng Miao, Zhihao Jia, Tianqi Chen | 2024-12-17 | 下载 | The recent advances in LLMs bring a strong demand for efficient system support to improve overall serving efficiency. As LLM inference scales towards multiple GPUs and even multiple compute nodes, var... |
| Echo: Simulating Distributed Training At Scale | Yicheng Feng, Yuetao Chen, Kaiwen Chen, Jingzong Li, Tianyuan Wu, Peng Cheng, Chuan Wu, Wei Wang, Tsung-Yi Ho, Hong Xu | 2024-12-17 | 下载 | Simulation offers unique values for both enumeration and extrapolation purposes, and is becoming increasingly important for managing the massive machine learning (ML) clusters and large-scale distribu... |
| FinGraV: Methodology for Fine-Grain GPU Power Visibility and Insights | Varsha Singhania, Shaizeen Aga, Mohamed Assem Ibrahim | 2024-12-17 | 下载 | Ubiquity of AI makes optimizing GPU power a priority as large GPU-based clusters are often employed to train and serve AI models. An important first step in optimizing GPU power consumption is high-fi... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Accelerating the Operation of Complex Workflows through Standard Data Interfaces | Taylor Paul, William Regli | 2024-12-17 | 下载 | In this position paper we argue for standardizing how we share and process data in scientific workflows at the network-level to maximize step re-use and workflow portability across platforms and netwo... |
| Driving Innovation in 6G Wireless Technologies: The OpenAirInterface Approach | Florian Kaltenberger, Tommaso Melodia, Irfan Ghauri, Michele Polese, Raymond Knopp, Tien Thinh Nguyen, Sakthivel Velumani, Davide Villa, Leonardo Bonati, Robert Schmidt, Sagar Arora, Mikel Irazabal, Navid Nikaein | 2024-12-17 | 下载 | The development of 6G wireless technologies is rapidly advancing, with the 3rd Generation Partnership Project (3GPP) entering the pre-standardization phase and aiming to deliver the first specificatio... |
| TIMESAFE: Timing Interruption Monitoring and Security Assessment for Fronthaul Environments | Joshua Groen, Simone Di Valerio, Imtiaz Karim, Davide Villa, Yiewi Zhang, Leonardo Bonati, Michele Polese, Salvatore D'Oro, Tommaso Melodia, Elisa Bertino, Francesca Cuomo, Kaushik Chowdhury | 2024-12-17 | 下载 | 5G and beyond cellular systems embrace the disaggregation of Radio Access Network (RAN) components, exemplified by the evolution of the fronthaul (FH) connection between cellular baseband and radio un... |
| System-Level Experimental Evaluation of Reconfigurable Intelligent Surfaces for NextG Communication Systems | Maria Tsampazi, Tommaso Melodia | 2024-12-17 | 下载 | Reconfigurable Intelligent Surfaces (RISs) are a promising technique for enhancing the performance of Next Generation (NextG) wireless communication systems in terms of both spectral and energy effici... |
| 2D-AoI: Age-of-Information of Distributed Sensors for Spatio-Temporal Processes | Markus Fidler, Flavio Gallistl, Jaya Prakash Champati, Joerg Widmer | 2024-12-17 | 下载 | The freshness of sensor data is critical for all types of cyber-physical systems. An established measure for quantifying data freshness is the Age-of-Information (AoI), which has been the subject of e... |
| Experimental Study of Low-Latency Video Streaming in an ORAN Setup with Generative AI | Andreas Casparsen, Van-Phuc Bui, Shashi Raj Pandey, Jimmy Jessen Nielsen, Petar Popovski | 2024-12-17 | 下载 | Current Adaptive Bit Rate (ABR) methods react to network congestion after it occurs, causing application layer buffering and latency spikes in live video streaming. |
| Uncertainty-Aware Hybrid Inference with On-Device Small and Remote Large Language Models | Seungeun Oh, Jinhyuk Kim, Jihong Park, Seung-Woo Ko, Tony Q. S. Quek, Seong-Lyun Kim | 2024-12-17 | 下载 | This paper studies a hybrid language model (HLM) architecture that integrates a small language model (SLM) operating on a mobile device with a large language model (LLM) hosted at the base station (BS... |
| Distributed satellite information networks: Architecture, enabling technologies, and trends | Qinyu Zhang, Liang Xu, Jianhao Huang, Tao Yang, Jian Jiao, Ye Wang, Yao Shi, Chiya Zhang, Xingjian Zhang, Ke Zhang, Yupeng Gong, Na Deng, Nan Zhao, Zhen Gao, Shujun Han, Xiaodong Xu, Li You, Dongming Wang, Shan Jiang, Dixian Zhao, Nan Zhang, Liujun Hu, Xiongwen He, Yonghui Li, Xiqi Gao, Xiaohu You | 2024-12-17 | 下载 | Driven by the vision of ubiquitous connectivity and wireless intelligence, the evolution of ultra-dense constellation-based satellite-integrated Internet is underway, now taking preliminary shape. |
| Personalized Federated Deep Reinforcement Learning for Heterogeneous Edge Content Caching Networks | Zhen Li, Tan Li, Hai Liu, Tse-Tin Chan | 2024-12-17 | 下载 | Proactive caching is essential for minimizing latency and improving Quality of Experience (QoE) in multi-server edge networks. Federated Deep Reinforcement Learning (FDRL) is a promising approach for ... |
| Rydberg Atomic Receiver: Next Frontier of Wireless Communications | Mingyao Cui, Qunsong Zeng, Kaibin Huang | 2024-12-17 | 下载 | Rydberg Atomic REceiver (RARE) is driving a paradigm shift in electromagnetic (EM) wave measurement by harnessing the electron transition phenomenon of Rydberg atoms. |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Scaling Inter-procedural Dataflow Analysis on the Cloud | Zewen Sun, Yujin Zhang, Duanchen Xu, Yiyu Zhang, Yun Qi, Yueyang Wang, Yi Li, Zhaokang Wang, Yue Li, Xuandong Li, Zhiqiang Zuo, Qingda Lu, Wenwen Peng, Shengjian Guo | 2024-12-17 | 下载 | Apart from forming the backbone of compiler optimization, static dataflow analysis has been widely applied in a vast variety of applications, such as bug detection, privacy analysis, program comprehen... |
| Optimizing System Memory Bandwidth with Micron CXL Memory Expansion Modules on Intel Xeon 6 Processors | Rohit Sehgal, Vishal Tanna, Vinicius Petrucci, Anil Godbole | 2024-12-17 | 下载 | High-Performance Computing (HPC) and Artificial Intelligence (AI) workloads typically demand substantial memory bandwidth and, to a degree, memory capacity. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| 2D-AoI: Age-of-Information of Distributed Sensors for Spatio-Temporal Processes | Markus Fidler, Flavio Gallistl, Jaya Prakash Champati, Joerg Widmer | 2024-12-17 | 下载 | The freshness of sensor data is critical for all types of cyber-physical systems. An established measure for quantifying data freshness is the Age-of-Information (AoI), which has been the subject of e... |
| TrainMover: An Interruption-Resilient and Reliable ML Training Runtime | ChonLam Lao, Minlan Yu, Aditya Akella, Jiamin Cao, Yu Guan, Pengcheng Zhang, Zhilong Zheng, Yichi Xu, Ennan Zhai, Dennis Cai, Jiaqi Gao | 2024-12-17 | 下载 | Large-scale ML training jobs are frequently interrupted by hardware and software anomalies, failures, and management events. Existing solutions like checkpointing or runtime reconfiguration suffer fro... |
| Optimizing System Memory Bandwidth with Micron CXL Memory Expansion Modules on Intel Xeon 6 Processors | Rohit Sehgal, Vishal Tanna, Vinicius Petrucci, Anil Godbole | 2024-12-17 | 下载 | High-Performance Computing (HPC) and Artificial Intelligence (AI) workloads typically demand substantial memory bandwidth and, to a degree, memory capacity. |