Skip to content

2025-08-27

cs.AR - Architecture

标题作者发布日期PDF摘要
Testing and Fault Tolerance Techniques for CNT-Based FPGAsSiyuan Lu, Kangwei Xu, Peng Xie, Rui Wang, Yuanqing Cheng2025-08-27下载As the semiconductor manufacturing process technology node shrinks into the nanometer-scale, the CMOS-based Field Programmable Gate Arrays (FPGAs) face big challenges in scalability of performance and...
SpeedMalloc: Improving Multi-threaded Applications via a Lightweight Core for Memory AllocationRuihao Li, Qinzhe Wu, Krishna Kavi, Gayatri Mehta, Jonathan C. Beard, Neeraja J. Yadwadkar, Lizy K. John2025-08-27下载Memory allocation, though constituting only a small portion of the executed code, can have a "butterfly effect" on overall program performance, leading to significant and far-reaching impacts.
Large Language Models (LLMs) for Electronic Design Automation (EDA)Kangwei Xu, Denis Schwachhofer, Jason Blocklove, Ilia Polian, Peter Domanski, Dirk Pflüger, Siddharth Garg, Ramesh Karri, Ozgur Sinanoglu, Johann Knechtel, Zhuorui Zhao, Ulf Schlichtmann, Bing Li2025-08-27下载With the growing complexity of modern integrated circuits, hardware engineers are required to devote more effort to the full design-to-manufacturing workflow.
New Tools, Programming Models, and System Support for Processing-in-Memory ArchitecturesGeraldo F. Oliveira2025-08-27下载Our goal in this dissertation is to provide tools, programming models, and system support for PIM architectures (with a focus on DRAM-based solutions), to ease the adoption of PIM in current and futur...
Exploration of Low-Power Flexible Stress Monitoring Classifiers for Conformal WearablesFlorentia Afentaki, Sri Sai Rakesh Nakkilla, Konstantinos Balaskas, Paula Carolina Lozano Duarte, Shiyi Jiang, Georgios Zervakis, Farshad Firouzi, Krishnendu Chakrabarty, Mehdi B. Tahoori2025-08-27下载Conventional stress monitoring relies on episodic, symptom-focused interventions, missing the need for continuous, accessible, and cost-efficient solutions.
Demonstrator Testbed for Effective Precoding in MEO Multibeam SatellitesJorge L. González-Rios, Liz Martínez Marrero, Juan Duncan, Luis M. Garcés-Socarrás, Raudel Cuiman Marquez, Juan A. Vásquez Peralvo, Jevgenij Krivochiza, Symeon Chatzinotas, Björn Ottersten2025-08-27下载The use of communication satellites in medium Earth orbit (MEO) is foreseen to provide quasi-global broadband Internet connectivity in the coming networking ecosystems.
Support Vector Machines Classification on Bendable RISC-VPolykarpos Vergos, Theofanis Vergos, Florentia Afentaki, Konstantinos Balaskas, Georgios Zervakis2025-08-27下载Flexible Electronics (FE) technology offers uniquecharacteristics in electronic manufacturing, providing ultra-low-cost, lightweight, and environmentally-friendly alternatives totraditional rigid elec...
When Routers, Switches and Interconnects Compute: A processing-in-interconnect Paradigm for Scalable Neuromorphic AIMadhuvanthi Srivatsav, Chiranjib Bhattacharyya, Shantanu Chakrabartty, Chetan Singh Thakur2025-08-27下载Routing, switching, and the interconnect fabric are essential components in implementing large-scale neuromorphic computing architectures. While this fabric plays only a supporting role in the process...
RARO: Reliability-aware Conversion with Enhanced Read Performance for QLC SSDsYanyun Wang, Dingcui Yu, Yina Lv, Yunpeng Song, Yumiao Zhao, Liang Shi2025-08-27下载Quad-level cell (QLC) flash offers significant benefits in cost and capacity, but its limited reliability leads to frequent read retries, which severely degrade read performance.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Predictable LLM Serving on GPU ClustersErfan Darzi, Shreeanant Bharadwaj, Sree Bhargavi Balija2025-08-27下载Latency-sensitive inference on shared A100 clusters often suffers noisy-neighbor interference on the PCIe fabric, inflating tail latency and SLO violations.
SwizzlePerf: Hardware-Aware LLMs for GPU Kernel Performance OptimizationArya Tschand, Muhammad Awad, Ryan Swann, Kesavan Ramakrishnan, Jeffrey Ma, Keith Lowery, Ganesh Dasika, Vijay Janapa Reddi2025-08-27下载Large language models (LLMs) have shown progress in GPU kernel performance engineering using inefficient search-based methods that optimize around runtime.
SpeedMalloc: Improving Multi-threaded Applications via a Lightweight Core for Memory AllocationRuihao Li, Qinzhe Wu, Krishna Kavi, Gayatri Mehta, Jonathan C. Beard, Neeraja J. Yadwadkar, Lizy K. John2025-08-27下载Memory allocation, though constituting only a small portion of the executed code, can have a "butterfly effect" on overall program performance, leading to significant and far-reaching impacts.
HPC Digital Twins for Evaluating Scheduling Policies, Incentive Structures and their Impact on Power and CoolingMatthias Maiterth, Wesley H. Brewer, Jaya S. Kuruvella, Arunavo Dey, Tanzima Z. Islam, Kevin Menear, Dmitry Duplyakin, Rashadul Kabir, Tapasya Patki, Terry Jones, Feiyi Wang2025-08-27下载Schedulers are critical for optimal resource utilization in high-performance computing. Traditional methods to evaluate schedulers are limited to post-deployment analysis, or simulators, which do not ...
New Tools, Programming Models, and System Support for Processing-in-Memory ArchitecturesGeraldo F. Oliveira2025-08-27下载Our goal in this dissertation is to provide tools, programming models, and system support for PIM architectures (with a focus on DRAM-based solutions), to ease the adoption of PIM in current and futur...
Beyond Pairwise Comparisons: Unveiling Structural Landscape of Mobile Robot ModelsShota Naito, Tsukasa Ninomiya, Koichi Wada2025-08-27下载Understanding the computational power of mobile robot systems is a fundamental challenge in distributed computing. While prior work has focused on pairwise separations between models, we explore how r...
Beyond the Bermuda Triangle of Contention: IOMMU Interference in Mixed Criticality SystemsDiogo Costa, Jose Martins, Sandro Pinto2025-08-27下载As Mixed Criticality Systems (MCSs) evolve, they increasingly integrate heterogeneous computing platforms, combining general-purpose processors with specialized accelerators such as AI engines, GPUs, ...
A Model-agnostic Strategy to Mitigate Embedding Degradation in Personalized Federated RecommendationJiakui Shen, Yunqi Mi, Guoshuai Zhao, Jialie Shen, Xueming Qian2025-08-27下载Centralized recommender systems encounter privacy leakage due to the need to collect user behavior and other private data. Hence, federated recommender systems (FedRec) have become a promising approac...
IsoSched: Preemptive Tile Cascaded Scheduling of Multi-DNN via Subgraph IsomorphismBoran Zhao, Zihang Yuan, Yanbin Hu, Haiming Zhai, Haoruo Zhang, Wenzhe Zhao, Tian Xia, Pengju Ren2025-08-27下载Deploying deep neural network (DNN) accelerators with Layer Temporal Scheduling (LTS) often incurs significant overheads (e.g., energy and latency), as intermediate activations must be cached in DRAM.
Taming the Chaos: Coordinated Autoscaling for Heterogeneous and Disaggregated LLM InferenceRongzhi Li, Ruogu Du, Zefang Chu, Sida Zhao, Chunlei Han, Zuocheng Shi, Yiwen Shao, Huanle Han, Long Huang, Zherui Liu, Shufan Liu2025-08-27下载Serving Large Language Models (LLMs) is a GPU-intensive task where traditional autoscalers fall short, particularly for modern Prefill-Decode (P/D) disaggregated architectures.
Aegis: Taxonomy and Optimizations for Overcoming Agent-Environment Failures in LLM AgentsKevin Song, Anand Jayarajan, Yaoyao Ding, Qidong Su, Zhanda Zhu, Sihang Liu, Gennady Pekhimenko2025-08-27下载Large Language Models (LLMs) agents augmented with domain tools promise to autonomously execute complex tasks requiring human-level intelligence, such as customer service and digital assistance.
Towards 6G Intelligence: The Role of Generative AI in Future Wireless NetworksMuhammad Ahmed Mohsin, Junaid Ahmad, Muhammad Hamza Nawaz, Muhammad Ali Jamshed2025-08-27下载Ambient intelligence (AmI) is a computing paradigm in which physical environments are embedded with sensing, computation, and communication so they can perceive people and context, decide appropriate ...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
DRR-MDPF: A Queue Management Strategy Based on Dynamic Resource Allocation and Markov Decision Process in Named Data Networking (NDN)Fatemeh Roshanzadeh, Hamid Barati, Ali Barati2025-08-27下载Named Data Networking (NDN) represents a transformative shift in network architecture, prioritizing content names over host addresses to enhance data dissemination.
The LLM as a Network Operator: A Vision for Generative AI in the 6G Radio Access NetworkOluwaseyi Giwa, Michael Adewole, Tobi Awodumila, Pelumi Aderinto2025-08-27下载The management of future AI-native Next-Generation (NextG) Radio Access Networks (RANs), including 6G and beyond, presents a challenge of immense complexity that exceeds the capabilities of traditiona...
A Comprehensive Survey of 5G URLLC and Challenges in the 6G EraMd. Emadul Haque, Faisal Tariq, Muhammad R A Khandaker, Md. Sakir Hossain, Muhammad Ali Imran, Kai-Kit Wong2025-08-27下载As the wireless communication paradigm is being transformed from human centered communication services towards machine centered communication services, the requirements of rate, latency and reliabilit...
ML-MaxProp: Bridging Machine Learning and Delay-Tolerant Routing for Resilient Post-Disaster CommunicationTao Xiuyuan, Milena Radenkovic2025-08-27下载In disaster-stricken and large-scale urban emergency scenarios, ensuring reliable communication remains a formidable challenge, as collapsed infrastructure, unpredictable mobility, and severely constr...
A First Look at Inter-Cell Interference in the WildDaqian Ding, Yibo Pi, Cailian Chen2025-08-27下载In cellular networks, inter-cell interference management has been studied for decades, yet its real-world effectiveness remains under-explored.
2SYN: Congestion-Aware MultihomingKfir Toledo, Isaac Keslassy2025-08-27下载When sending flows to arbitrary destinations, current multihoming routers adopt simple congestion-oblivious mechanisms. Therefore, they cannot avoid congested paths.
Secure Multi-LLM Agentic AI and Agentification for Edge General Intelligence by Zero-Trust: A SurveyYinqiu Liu, Ruichen Zhang, Haoxiang Luo, Yijing Lin, Geng Sun, Dusit Niyato, Hongyang Du, Zehui Xiong, Yonggang Wen, Abbas Jamalipour, Dong In Kim, Ping Zhang2025-08-27下载Agentification serves as a critical enabler of Edge General Intelligence (EGI), transforming massive edge devices into cognitive agents through integrating Large Language Models (LLMs) and perception,...
Experimental Insights from OpenAirInterface 5G positioning Testbeds: Challenges and solutionsMohsen Ahadi, Adeel Malik, Omid Esrafilian, Florian Kaltenberger, Cedric Thienot2025-08-27下载5G New Radio (NR) is a key enabler of accurate positioning in smart cities and smart factories. This paper presents the experimental results from three 5G positioning testbeds running open-source Open...
A Dynamic Service Offloading Algorithm Based on Lyapunov Optimization in Edge ComputingPeiyan Yuan, Ming Li, Chenyang Wang, Ledong An, Xiaoyan Zhao, Junna Zhang, Xiangyang Li, Huadong Ma2025-08-27下载This study investigates the trade-off between system stability and offloading cost in collaborative edge computing. While collaborative offloading among multiple edge servers enhances resource utiliza...
When Routers, Switches and Interconnects Compute: A processing-in-interconnect Paradigm for Scalable Neuromorphic AIMadhuvanthi Srivatsav, Chiranjib Bhattacharyya, Shantanu Chakrabartty, Chetan Singh Thakur2025-08-27下载Routing, switching, and the interconnect fabric are essential components in implementing large-scale neuromorphic computing architectures. While this fabric plays only a supporting role in the process...

基于 VitePress 构建