2025-08-27

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Testing and Fault Tolerance Techniques for CNT-Based FPGAs	Siyuan Lu, Kangwei Xu, Peng Xie, Rui Wang, Yuanqing Cheng	2025-08-27	下载	As the semiconductor manufacturing process technology node shrinks into the nanometer-scale, the CMOS-based Field Programmable Gate Arrays (FPGAs) face big challenges in scalability of performance and...
SpeedMalloc: Improving Multi-threaded Applications via a Lightweight Core for Memory Allocation	Ruihao Li, Qinzhe Wu, Krishna Kavi, Gayatri Mehta, Jonathan C. Beard, Neeraja J. Yadwadkar, Lizy K. John	2025-08-27	下载	Memory allocation, though constituting only a small portion of the executed code, can have a "butterfly effect" on overall program performance, leading to significant and far-reaching impacts.
Large Language Models (LLMs) for Electronic Design Automation (EDA)	Kangwei Xu, Denis Schwachhofer, Jason Blocklove, Ilia Polian, Peter Domanski, Dirk Pflüger, Siddharth Garg, Ramesh Karri, Ozgur Sinanoglu, Johann Knechtel, Zhuorui Zhao, Ulf Schlichtmann, Bing Li	2025-08-27	下载	With the growing complexity of modern integrated circuits, hardware engineers are required to devote more effort to the full design-to-manufacturing workflow.
New Tools, Programming Models, and System Support for Processing-in-Memory Architectures	Geraldo F. Oliveira	2025-08-27	下载	Our goal in this dissertation is to provide tools, programming models, and system support for PIM architectures (with a focus on DRAM-based solutions), to ease the adoption of PIM in current and futur...
Exploration of Low-Power Flexible Stress Monitoring Classifiers for Conformal Wearables	Florentia Afentaki, Sri Sai Rakesh Nakkilla, Konstantinos Balaskas, Paula Carolina Lozano Duarte, Shiyi Jiang, Georgios Zervakis, Farshad Firouzi, Krishnendu Chakrabarty, Mehdi B. Tahoori	2025-08-27	下载	Conventional stress monitoring relies on episodic, symptom-focused interventions, missing the need for continuous, accessible, and cost-efficient solutions.
Demonstrator Testbed for Effective Precoding in MEO Multibeam Satellites	Jorge L. González-Rios, Liz Martínez Marrero, Juan Duncan, Luis M. Garcés-Socarrás, Raudel Cuiman Marquez, Juan A. Vásquez Peralvo, Jevgenij Krivochiza, Symeon Chatzinotas, Björn Ottersten	2025-08-27	下载	The use of communication satellites in medium Earth orbit (MEO) is foreseen to provide quasi-global broadband Internet connectivity in the coming networking ecosystems.
Support Vector Machines Classification on Bendable RISC-V	Polykarpos Vergos, Theofanis Vergos, Florentia Afentaki, Konstantinos Balaskas, Georgios Zervakis	2025-08-27	下载	Flexible Electronics (FE) technology offers uniquecharacteristics in electronic manufacturing, providing ultra-low-cost, lightweight, and environmentally-friendly alternatives totraditional rigid elec...
When Routers, Switches and Interconnects Compute: A processing-in-interconnect Paradigm for Scalable Neuromorphic AI	Madhuvanthi Srivatsav, Chiranjib Bhattacharyya, Shantanu Chakrabartty, Chetan Singh Thakur	2025-08-27	下载	Routing, switching, and the interconnect fabric are essential components in implementing large-scale neuromorphic computing architectures. While this fabric plays only a supporting role in the process...
RARO: Reliability-aware Conversion with Enhanced Read Performance for QLC SSDs	Yanyun Wang, Dingcui Yu, Yina Lv, Yunpeng Song, Yumiao Zhao, Liang Shi	2025-08-27	下载	Quad-level cell (QLC) flash offers significant benefits in cost and capacity, but its limited reliability leads to frequent read retries, which severely degrade read performance.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Predictable LLM Serving on GPU Clusters	Erfan Darzi, Shreeanant Bharadwaj, Sree Bhargavi Balija	2025-08-27	下载	Latency-sensitive inference on shared A100 clusters often suffers noisy-neighbor interference on the PCIe fabric, inflating tail latency and SLO violations.
SwizzlePerf: Hardware-Aware LLMs for GPU Kernel Performance Optimization	Arya Tschand, Muhammad Awad, Ryan Swann, Kesavan Ramakrishnan, Jeffrey Ma, Keith Lowery, Ganesh Dasika, Vijay Janapa Reddi	2025-08-27	下载	Large language models (LLMs) have shown progress in GPU kernel performance engineering using inefficient search-based methods that optimize around runtime.
SpeedMalloc: Improving Multi-threaded Applications via a Lightweight Core for Memory Allocation	Ruihao Li, Qinzhe Wu, Krishna Kavi, Gayatri Mehta, Jonathan C. Beard, Neeraja J. Yadwadkar, Lizy K. John	2025-08-27	下载	Memory allocation, though constituting only a small portion of the executed code, can have a "butterfly effect" on overall program performance, leading to significant and far-reaching impacts.
HPC Digital Twins for Evaluating Scheduling Policies, Incentive Structures and their Impact on Power and Cooling	Matthias Maiterth, Wesley H. Brewer, Jaya S. Kuruvella, Arunavo Dey, Tanzima Z. Islam, Kevin Menear, Dmitry Duplyakin, Rashadul Kabir, Tapasya Patki, Terry Jones, Feiyi Wang	2025-08-27	下载	Schedulers are critical for optimal resource utilization in high-performance computing. Traditional methods to evaluate schedulers are limited to post-deployment analysis, or simulators, which do not ...
New Tools, Programming Models, and System Support for Processing-in-Memory Architectures	Geraldo F. Oliveira	2025-08-27	下载	Our goal in this dissertation is to provide tools, programming models, and system support for PIM architectures (with a focus on DRAM-based solutions), to ease the adoption of PIM in current and futur...
Beyond Pairwise Comparisons: Unveiling Structural Landscape of Mobile Robot Models	Shota Naito, Tsukasa Ninomiya, Koichi Wada	2025-08-27	下载	Understanding the computational power of mobile robot systems is a fundamental challenge in distributed computing. While prior work has focused on pairwise separations between models, we explore how r...
Beyond the Bermuda Triangle of Contention: IOMMU Interference in Mixed Criticality Systems	Diogo Costa, Jose Martins, Sandro Pinto	2025-08-27	下载	As Mixed Criticality Systems (MCSs) evolve, they increasingly integrate heterogeneous computing platforms, combining general-purpose processors with specialized accelerators such as AI engines, GPUs, ...
A Model-agnostic Strategy to Mitigate Embedding Degradation in Personalized Federated Recommendation	Jiakui Shen, Yunqi Mi, Guoshuai Zhao, Jialie Shen, Xueming Qian	2025-08-27	下载	Centralized recommender systems encounter privacy leakage due to the need to collect user behavior and other private data. Hence, federated recommender systems (FedRec) have become a promising approac...
IsoSched: Preemptive Tile Cascaded Scheduling of Multi-DNN via Subgraph Isomorphism	Boran Zhao, Zihang Yuan, Yanbin Hu, Haiming Zhai, Haoruo Zhang, Wenzhe Zhao, Tian Xia, Pengju Ren	2025-08-27	下载	Deploying deep neural network (DNN) accelerators with Layer Temporal Scheduling (LTS) often incurs significant overheads (e.g., energy and latency), as intermediate activations must be cached in DRAM.
Taming the Chaos: Coordinated Autoscaling for Heterogeneous and Disaggregated LLM Inference	Rongzhi Li, Ruogu Du, Zefang Chu, Sida Zhao, Chunlei Han, Zuocheng Shi, Yiwen Shao, Huanle Han, Long Huang, Zherui Liu, Shufan Liu	2025-08-27	下载	Serving Large Language Models (LLMs) is a GPU-intensive task where traditional autoscalers fall short, particularly for modern Prefill-Decode (P/D) disaggregated architectures.
Aegis: Taxonomy and Optimizations for Overcoming Agent-Environment Failures in LLM Agents	Kevin Song, Anand Jayarajan, Yaoyao Ding, Qidong Su, Zhanda Zhu, Sihang Liu, Gennady Pekhimenko	2025-08-27	下载	Large Language Models (LLMs) agents augmented with domain tools promise to autonomously execute complex tasks requiring human-level intelligence, such as customer service and digital assistance.
Towards 6G Intelligence: The Role of Generative AI in Future Wireless Networks	Muhammad Ahmed Mohsin, Junaid Ahmad, Muhammad Hamza Nawaz, Muhammad Ali Jamshed	2025-08-27	下载	Ambient intelligence (AmI) is a computing paradigm in which physical environments are embedded with sensing, computation, and communication so they can perceive people and context, decide appropriate ...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
DRR-MDPF: A Queue Management Strategy Based on Dynamic Resource Allocation and Markov Decision Process in Named Data Networking (NDN)	Fatemeh Roshanzadeh, Hamid Barati, Ali Barati	2025-08-27	下载	Named Data Networking (NDN) represents a transformative shift in network architecture, prioritizing content names over host addresses to enhance data dissemination.
The LLM as a Network Operator: A Vision for Generative AI in the 6G Radio Access Network	Oluwaseyi Giwa, Michael Adewole, Tobi Awodumila, Pelumi Aderinto	2025-08-27	下载	The management of future AI-native Next-Generation (NextG) Radio Access Networks (RANs), including 6G and beyond, presents a challenge of immense complexity that exceeds the capabilities of traditiona...
A Comprehensive Survey of 5G URLLC and Challenges in the 6G Era	Md. Emadul Haque, Faisal Tariq, Muhammad R A Khandaker, Md. Sakir Hossain, Muhammad Ali Imran, Kai-Kit Wong	2025-08-27	下载	As the wireless communication paradigm is being transformed from human centered communication services towards machine centered communication services, the requirements of rate, latency and reliabilit...
ML-MaxProp: Bridging Machine Learning and Delay-Tolerant Routing for Resilient Post-Disaster Communication	Tao Xiuyuan, Milena Radenkovic	2025-08-27	下载	In disaster-stricken and large-scale urban emergency scenarios, ensuring reliable communication remains a formidable challenge, as collapsed infrastructure, unpredictable mobility, and severely constr...
A First Look at Inter-Cell Interference in the Wild	Daqian Ding, Yibo Pi, Cailian Chen	2025-08-27	下载	In cellular networks, inter-cell interference management has been studied for decades, yet its real-world effectiveness remains under-explored.
2SYN: Congestion-Aware Multihoming	Kfir Toledo, Isaac Keslassy	2025-08-27	下载	When sending flows to arbitrary destinations, current multihoming routers adopt simple congestion-oblivious mechanisms. Therefore, they cannot avoid congested paths.
Secure Multi-LLM Agentic AI and Agentification for Edge General Intelligence by Zero-Trust: A Survey	Yinqiu Liu, Ruichen Zhang, Haoxiang Luo, Yijing Lin, Geng Sun, Dusit Niyato, Hongyang Du, Zehui Xiong, Yonggang Wen, Abbas Jamalipour, Dong In Kim, Ping Zhang	2025-08-27	下载	Agentification serves as a critical enabler of Edge General Intelligence (EGI), transforming massive edge devices into cognitive agents through integrating Large Language Models (LLMs) and perception,...
Experimental Insights from OpenAirInterface 5G positioning Testbeds: Challenges and solutions	Mohsen Ahadi, Adeel Malik, Omid Esrafilian, Florian Kaltenberger, Cedric Thienot	2025-08-27	下载	5G New Radio (NR) is a key enabler of accurate positioning in smart cities and smart factories. This paper presents the experimental results from three 5G positioning testbeds running open-source Open...
A Dynamic Service Offloading Algorithm Based on Lyapunov Optimization in Edge Computing	Peiyan Yuan, Ming Li, Chenyang Wang, Ledong An, Xiaoyan Zhao, Junna Zhang, Xiangyang Li, Huadong Ma	2025-08-27	下载	This study investigates the trade-off between system stability and offloading cost in collaborative edge computing. While collaborative offloading among multiple edge servers enhances resource utiliza...
When Routers, Switches and Interconnects Compute: A processing-in-interconnect Paradigm for Scalable Neuromorphic AI	Madhuvanthi Srivatsav, Chiranjib Bhattacharyya, Shantanu Chakrabartty, Chetan Singh Thakur	2025-08-27	下载	Routing, switching, and the interconnect fabric are essential components in implementing large-scale neuromorphic computing architectures. While this fabric plays only a supporting role in the process...