Skip to content

2025-03-29

cs.AR - Architecture

标题作者发布日期PDF摘要
Concorde: Fast and Accurate CPU Performance Modeling with Compositional Analytical-ML FusionArash Nasr-Esfahany, Mohammad Alizadeh, Victor Lee, Hanna Alam, Brett W. Coon, David Culler, Vidushi Dadu, Martin Dixon, Henry M. Levy, Santosh Pandey, Parthasarathy Ranganathan, Amir Yazdanbakhsh2025-03-29下载Cycle-level simulators such as gem5 are widely used in microarchitecture design, but they are prohibitively slow for large-scale design space explorations.
Late Breaking Results: Breaking Symmetry- Unconventional Placement of Analog Circuits using Multi-Level Multi-Agent Reinforcement LearningSupriyo Maji, Linran Zhao, Souradip Poddar, David Z. Pan2025-03-29下载Layout-dependent effects (LDEs) significantly impact analog circuit performance. Traditionally, designers have relied on symmetric placement of circuit components to mitigate variations caused by LDEs...
SSM-RDU: A Reconfigurable Dataflow Unit for Long-Sequence State-Space ModelsSho Ko, Kunle Olukotun2025-03-29下载Long-sequence state-space models (SSMs) such as Hyena and Mamba replace the quadratic complexity of self-attention with more efficient FFT and scan operations.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
CAWAL: A novel unified analytics framework for enterprise web applications and multi-server environmentsÖzkan Canay, Ümit Kocabıçak2025-03-29下载In web analytics, cloud-based solutions have limitations in data ownership and privacy, whereas client-side user tracking tools face challenges such as data accuracy and a lack of server-side metrics.
Optimizing Distributed Training Approaches for Scaling Neural NetworksVishnu Vardhan Baligodugula, Fathi Amsaad2025-03-29下载This paper presents a comparative analysis of distributed training strategies for large-scale neural networks, focusing on data parallelism, model parallelism, and hybrid approaches.
Plug & Offload: Transparently Offloading TCP Stack onto Off-path SmartNIC with PnO-TCPHailong Nan, Zhe Zhou, Min Yang2025-03-29下载Host CPU resources are heavily consumed by TCP stack processing, limiting scalability in data centers. Existing offload methods typically address only partial functionality or lack flexibility.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
LAURA: LLM-Assisted UAV Routing for AoI MinimizationBisheng Wei, Ruichen Zhang, Ruihong Jiang, Mugen Peng, Dusit Niyato2025-03-29下载With the rapid growth of the low-altitude economy, there is increasing demand for real-time data collection using UAV-assisted wireless sensor networks.
Novel Closed Loop Control Mechanism for Zero Touch Networks using BiLSTM and Q-LearningTamizhelakkiya K, Dibakar Das, Jyotsna Bapat, Debabrata Das, Komal Sharma2025-03-29下载As networks advance toward the Sixth Generation (6G), management of high-speed and ubiquitous connectivity poses major challenges in meeting diverse Service Level Agreements (SLAs).
PartialLoading: User Scheduling and Bandwidth Allocation for Parameter-sharing Edge InferenceGuanqiao Qu, Qian Chen, Xianhao Chen, Kaibin Huang, Yuguang Fang2025-03-29下载By provisioning inference offloading services, edge inference drives the rapid growth of AI applications at network edge. However, how to reduce the inference latency remains a significant challenge.
Globus Service Enhancements for Exascale Applications and FacilitiesWeijian Zheng, Jack Kordas, Tyler J. Skluzacek, Raj Kettimuthu, Ian Foster2025-03-29下载Many extreme-scale applications require the movement of large quantities of data to, from, and among leadership computing facilities, as well as other scientific facilities and the home institutions o...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Linux for Everyone: Can Standardization Drive Mainstream Adoption?Rohit J Nandha, Ronak D Patel2025-03-29下载Despite its technical superiority and flexibility, Linux remains a niche OS in the consumer markets. Because fragmentation stems from diverse distributions, it lacks the standardized experience, which...

cs.PF - Performance

标题作者发布日期PDF摘要
Concorde: Fast and Accurate CPU Performance Modeling with Compositional Analytical-ML FusionArash Nasr-Esfahany, Mohammad Alizadeh, Victor Lee, Hanna Alam, Brett W. Coon, David Culler, Vidushi Dadu, Martin Dixon, Henry M. Levy, Santosh Pandey, Parthasarathy Ranganathan, Amir Yazdanbakhsh2025-03-29下载Cycle-level simulators such as gem5 are widely used in microarchitecture design, but they are prohibitively slow for large-scale design space explorations.

基于 VitePress 构建