Skip to content

2025-09-16

cs.AR - Architecture

标题作者发布日期PDF摘要
MACO: A Multi-Agent LLM-Based Hardware/Software Co-Design Framework for CGRAsZesong Jiang, Yuqi Sun, Qing Zhong, Mahathi Krishna, Deepak Patil, Cheng Tan, Jeff Zhang2025-09-16下载Coarse-Grained Reconfigurable Arrays (CGRAs) offer high performance and energy efficiency across domains, yet design remains difficult due to a vast, interdependent space and costly manual iteration.
FVDebug: An LLM-Driven Debugging Assistant for Automated Root Cause Analysis of Formal Verification FailuresYunsheng Bai, Ghaith Bany Hamad, Chia-Tung Ho, Syed Suhaib, Haoxing Ren2025-09-16下载Debugging formal verification (FV) failures represents one of the most time-consuming bottlenecks in modern hardware design workflows. When properties fail, engineers must manually trace through compl...
Cognition Engines: A Row-Scale HVDC Architecture for Computational Continuity of AIPaul Churnock2025-09-16下载AI training creates synchronized, step-dominant surges with millisecond edges that destabilize constant-power loads (Choukse et al., 2025; arXiv:2508.14318).
Orthrus: Dual-Loop Automated Framework for System-Technology Co-OptimizationYi Ren, Baokang Peng, Chenhao Xue, Kairong Guo, Yukun Wang, Guoyao Cheng, Yibo Lin, Lining Zhang, Guangyu Sun2025-09-16下载With the diminishing return from Moore's Law, system-technology co-optimization (STCO) has emerged as a promising approach to sustain the scaling trends in the VLSI industry.
HPIM: Heterogeneous Processing-In-Memory-based Accelerator for Large Language Models InferenceCenlin Duan, Jianlei Yang, Rubing Yang, Yikun Wang, Yiou Wang, Lingkun Long, Yingjie Qi, Xiaolin He, Ao Zhou, Xueyan Wang, Weisheng Zhao2025-09-16下载The deployment of large language models (LLMs) presents significant challenges due to their enormous memory footprints, low arithmetic intensity, and stringent latency requirements, particularly durin...
A Scalable Architecture for Efficient Multi-bit Fully Homomorphic EncryptionJiaao Ma, Ceyu Xu, Lisa Wu Wills2025-09-16下载In the era of cloud computing, privacy-preserving computation offloading is crucial for safeguarding sensitive data. Fully Homomorphic Encryption (FHE) enables secure processing of encrypted data, but...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Modeling the Carbon Footprint of HPC: The Top 500 and EasyCVarsha Rao, Andrew A. Chien2025-09-16下载Climate change is a critical concern for HPC systems, but GHG protocol carbon-emission accounting methodologies are difficult for a single system, and effectively infeasible for a collection of system...
Testing and benchmarking emerging supercomputers via the MFC flow solverBenjamin Wilfong, Anand Radhakrishnan, Henry A. Le Berre, Tanush Prathi, Stephen Abbott, Spencer H. Bryngelson2025-09-16下载Deploying new supercomputers requires testing and evaluation via application codes. Portable, user-friendly tools enable evaluation, and the Multicomponent Flow Code (MFC), a computational fluid dynam...
AERIS: Argonne Earth Systems Model for Reliable and Skillful PredictionsVäinö Hatanpää, Eugene Ku, Jason Stock, Murali Emani, Sam Foreman, Chunyong Jung, Sandeep Madireddy, Tung Nguyen, Varuni Sastry, Ray A. O. Sinurat, Sam Wheeler, Huihuo Zheng, Troy Arcomano, Venkatram Vishwanath, Rao Kotamarthi2025-09-16下载Generative machine learning offers new opportunities to better understand complex Earth system dynamics. Recent diffusion-based methods address spectral biases and improve ensemble calibration in weat...
Scaling Up Throughput-oriented LLM Inference Applications on Heterogeneous Opportunistic GPU Clusters with Pervasive Context ManagementThanh Son Phung, Douglas Thain2025-09-16下载The widespread growth in LLM developments increasingly demands more computational power from clusters than what they can supply. Traditional LLM applications inherently require huge static resource al...
Space-Time Trade-off in Bounded Iterated MemoryGuillermo Toyos-Marfurt, Petr Kuznetsov2025-09-16下载The celebrated asynchronous computability theorem (ACT) characterizes tasks solvable in the read-write shared-memory model using the unbounded full-information protocol, where in every round of comput...
Analysis of the carbon footprint of HPCAbdessalam Benhari, Yves Denneulin, Frédéric Desprez, Fanny Dufossé, Denis Trystram2025-09-16下载The demand in computing power has never stopped growing over the years. Today, the performance of the most powerful systems exceeds the exascale.
Asymmetric Grid Quorum Systems for Heterogeneous ProcessesMichael Senn, Christian Cachin2025-09-16下载Quorum systems are a common way to formalize failure assumptions in distributed systems. Traditionally, these assumptions are shared by all involved processes.
Analysis and Optimization of Wireless Multimodal Federated Learning on Modal HeterogeneityXuefeng Han, Wen Chen, Jun Li, Ming Ding, Qingqing Wu, Kang Wei, Xiumei Deng, Yumeng Shao, Qiong Wu2025-09-16下载Multimodal federated learning (MFL) is a distributed framework for training multimodal models without uploading local multimodal data of clients, thereby effectively protecting client privacy.
Emergent complexity and rhythms in evoked and spontaneous dynamics of human whole-brain models after tuning through analysis toolsGianluca Gaglioti, Alessandra Cardinale, Cosimo Lupo, Thierry Nieus, Federico Marmoreo, Robin Gutzen, Michael Denker, Andrea Pigorini, Marcello Massimini, Simone Sarasso, Pier Stanislao Paolucci, Giulia De Bonis2025-09-16下载The simulation of whole-brain dynamics should reproduce realistic spontaneous and evoked neural activity across different scales, including emergent rhythms, spatio-temporal activation patterns, and m...
AI Factories: It's time to rethink the Cloud-HPC dividePedro Garcia Lopez, Daniel Barcelona Pons, Marcin Copik, Torsten Hoefler, Eduardo Quiñones, Maciej Malawski, Peter Pietzutch, Alberto Marti, Thomas Ohlson Timoudas, Aleksander Slominski2025-09-16下载The strategic importance of artificial intelligence is driving a global push toward Sovereign AI initiatives. Nationwide governments are increasingly developing dedicated infrastructures, called AI Fa...
Energy-Efficient Quantized Federated Learning for Resource-constrained IoT devicesWilfrid Sougrinoma Compaoré, Yaya Etiabi, El Mehdi Amhoud, Mohamad Assaad2025-09-16下载Federated Learning (FL) has emerged as a promising paradigm for enabling collaborative machine learning while preserving data privacy, making it particularly suitable for Internet of Things (IoT) envi...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Odin: Effective End-to-End SLA Decomposition for 5G/6G Network Slicing via Online LearningDuo Cheng, Ramanujan K Sheshadri, Ahan Kak, Nakjung Choi, Xingyu Zhou, Bo Ji2025-09-16下载Network slicing plays a crucial role in realizing 5G/6G advances, enabling diverse Service Level Agreement (SLA) requirements related to latency, throughput, and reliability.
GRU-Based Learning for the Identification of Congestion Protocols in TCP TrafficPaul Bergeron, Sandhya Aneja2025-09-16下载This paper presents the identification of congestion control protocols TCP Reno, TCP Cubic, TCP Vegas, and BBR on the Marist University campus, with an accuracy of 97.
It Takes a Village: Bridging the Gaps between Current and Formal Specifications for ProtocolsDavid Basin, Nate Foster, Kenneth L. McMillan, Kedar S. Namjoshi, Cristina Nita-Rotaru, Jonathan M. Smith, Pamela Zave, Lenore D. Zuck2025-09-16下载Formal specifications have numerous benefits for both designers and users of network protocols. They provide clear, unambiguous representations, which are useful as documentation and for testing.
State Aware Traffic Generation for Real-Time Network Digital TwinsEnes Koktas, Peter Rost2025-09-16下载Digital twins (DTs) enable smarter, self-optimizing mobile networks, but they rely on a steady supply of real world data. Collecting and transferring complete traces in real time is a significant chal...
Fine-Grained AI Model Caching and Downloading With Coordinated Multipoint Broadcasting in Multi-Cell Edge NetworksYang Fu, Peng Qin, Yueyue Zhang, Pao Cheng, Jun Lu, Yifei Wang2025-09-16下载6G networks are envisioned to support on-demand AI model downloading to accommodate diverse inference requirements of end users. By proactively caching models at edge nodes, users can retrieve the req...
Joint Channel Estimation and Computation Offloading in Fluid Antenna-assisted MEC NetworksYing Ju, Mingdong Li, Haoyu Wang, Lei Liu, Youyang Qu, Mianxiong Dong, Victor C. M. Leung, Chau Yuen2025-09-16下载With the emergence of fluid antenna (FA) in wireless communications, the capability to dynamically adjust port positions offers substantial benefits in spatial diversity and spectrum efficiency, which...
Joint AoI and Handover Optimization in Space-Air-Ground Integrated NetworkZifan Lang, Guixia Liu, Geng Sun, Jiahui Li, Jiacheng Wang, Weijie Yuan, Dusit Niyato, Dong In Kim2025-09-16下载Despite the widespread deployment of terrestrial networks, providing reliable communication services to remote areas and maintaining connectivity during emergencies remains challenging.
A Unified Learning-based Optimization Framework for 0-1 Mixed Problems in Wireless NetworksKairong Ma, Yao Sun, Shuheng Hua, Muhammad Ali Imran, Walid Saad2025-09-16下载Several wireless networking problems are often posed as 0-1 mixed optimization problems, which involve binary variables (e.g., selection of access points, channels, and tasks) and continuous variables...
Sustainable LSTM-Based Precoding for RIS-Aided mmWave MIMO Systems with Implicit CSIPo-Heng Chou, Jiun-Jia Wu, Wan-Jen Huang, Ronald Y. Chang2025-09-16下载In this paper, we propose a sustainable long short-term memory (LSTM)-based precoding framework for reconfigurable intelligent surface (RIS)-assisted millimeter-wave (mmWave) MIMO systems.
CattleSense - A Multisensory Approach to Optimize Cattle Well-BeingSrijesh Pillai, M. I. Jawid Nazir2025-09-16下载CattleSense is an innovative application of Internet of Things (IoT) technology for the comprehensive monitoring and management of cattle well-being.
Secure and Efficient Out-of-band Call Metadata TransmissionDavid Adei, Varun Madathil, Nithin Shyam S., Bradley Reaves2025-09-16下载The STIR/SHAKEN (S/S) attestation Framework mandated by the United States, Canada, and France to combat pervasive telephone abuse has not achieved its goals, partly because legacy non-VoIP infrastruct...

cs.PF - Performance

标题作者发布日期PDF摘要
Outperforming Dijkstra on Sparse Graphs: The Lightning Network Use CaseDanila Valko, Rohan Paranjpe, Jorge Marx Gómez2025-09-16下载Efficient routing is critical for payment channel networks (PCNs) such as the Lightning Network (LN), where most clients currently rely on Dijkstra-based algorithms for payment pathfinding.

基于 VitePress 构建