Appearance
2025-08-27
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Testing and Fault Tolerance Techniques for CNT-Based FPGAs | Siyuan Lu, Kangwei Xu, Peng Xie, Rui Wang, Yuanqing Cheng | 2025-08-27 | 下载 | As the semiconductor manufacturing process technology node shrinks into the nanometer-scale, the CMOS-based Field Programmable Gate Arrays (FPGAs) face big challenges in scalability of performance and... |
| SpeedMalloc: Improving Multi-threaded Applications via a Lightweight Core for Memory Allocation | Ruihao Li, Qinzhe Wu, Krishna Kavi, Gayatri Mehta, Jonathan C. Beard, Neeraja J. Yadwadkar, Lizy K. John | 2025-08-27 | 下载 | Memory allocation, though constituting only a small portion of the executed code, can have a "butterfly effect" on overall program performance, leading to significant and far-reaching impacts. |
| Large Language Models (LLMs) for Electronic Design Automation (EDA) | Kangwei Xu, Denis Schwachhofer, Jason Blocklove, Ilia Polian, Peter Domanski, Dirk Pflüger, Siddharth Garg, Ramesh Karri, Ozgur Sinanoglu, Johann Knechtel, Zhuorui Zhao, Ulf Schlichtmann, Bing Li | 2025-08-27 | 下载 | With the growing complexity of modern integrated circuits, hardware engineers are required to devote more effort to the full design-to-manufacturing workflow. |
| New Tools, Programming Models, and System Support for Processing-in-Memory Architectures | Geraldo F. Oliveira | 2025-08-27 | 下载 | Our goal in this dissertation is to provide tools, programming models, and system support for PIM architectures (with a focus on DRAM-based solutions), to ease the adoption of PIM in current and futur... |
| Exploration of Low-Power Flexible Stress Monitoring Classifiers for Conformal Wearables | Florentia Afentaki, Sri Sai Rakesh Nakkilla, Konstantinos Balaskas, Paula Carolina Lozano Duarte, Shiyi Jiang, Georgios Zervakis, Farshad Firouzi, Krishnendu Chakrabarty, Mehdi B. Tahoori | 2025-08-27 | 下载 | Conventional stress monitoring relies on episodic, symptom-focused interventions, missing the need for continuous, accessible, and cost-efficient solutions. |
| Demonstrator Testbed for Effective Precoding in MEO Multibeam Satellites | Jorge L. González-Rios, Liz Martínez Marrero, Juan Duncan, Luis M. Garcés-Socarrás, Raudel Cuiman Marquez, Juan A. Vásquez Peralvo, Jevgenij Krivochiza, Symeon Chatzinotas, Björn Ottersten | 2025-08-27 | 下载 | The use of communication satellites in medium Earth orbit (MEO) is foreseen to provide quasi-global broadband Internet connectivity in the coming networking ecosystems. |
| Support Vector Machines Classification on Bendable RISC-V | Polykarpos Vergos, Theofanis Vergos, Florentia Afentaki, Konstantinos Balaskas, Georgios Zervakis | 2025-08-27 | 下载 | Flexible Electronics (FE) technology offers uniquecharacteristics in electronic manufacturing, providing ultra-low-cost, lightweight, and environmentally-friendly alternatives totraditional rigid elec... |
| When Routers, Switches and Interconnects Compute: A processing-in-interconnect Paradigm for Scalable Neuromorphic AI | Madhuvanthi Srivatsav, Chiranjib Bhattacharyya, Shantanu Chakrabartty, Chetan Singh Thakur | 2025-08-27 | 下载 | Routing, switching, and the interconnect fabric are essential components in implementing large-scale neuromorphic computing architectures. While this fabric plays only a supporting role in the process... |
| RARO: Reliability-aware Conversion with Enhanced Read Performance for QLC SSDs | Yanyun Wang, Dingcui Yu, Yina Lv, Yunpeng Song, Yumiao Zhao, Liang Shi | 2025-08-27 | 下载 | Quad-level cell (QLC) flash offers significant benefits in cost and capacity, but its limited reliability leads to frequent read retries, which severely degrade read performance. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Predictable LLM Serving on GPU Clusters | Erfan Darzi, Shreeanant Bharadwaj, Sree Bhargavi Balija | 2025-08-27 | 下载 | Latency-sensitive inference on shared A100 clusters often suffers noisy-neighbor interference on the PCIe fabric, inflating tail latency and SLO violations. |
| SwizzlePerf: Hardware-Aware LLMs for GPU Kernel Performance Optimization | Arya Tschand, Muhammad Awad, Ryan Swann, Kesavan Ramakrishnan, Jeffrey Ma, Keith Lowery, Ganesh Dasika, Vijay Janapa Reddi | 2025-08-27 | 下载 | Large language models (LLMs) have shown progress in GPU kernel performance engineering using inefficient search-based methods that optimize around runtime. |
| SpeedMalloc: Improving Multi-threaded Applications via a Lightweight Core for Memory Allocation | Ruihao Li, Qinzhe Wu, Krishna Kavi, Gayatri Mehta, Jonathan C. Beard, Neeraja J. Yadwadkar, Lizy K. John | 2025-08-27 | 下载 | Memory allocation, though constituting only a small portion of the executed code, can have a "butterfly effect" on overall program performance, leading to significant and far-reaching impacts. |
| HPC Digital Twins for Evaluating Scheduling Policies, Incentive Structures and their Impact on Power and Cooling | Matthias Maiterth, Wesley H. Brewer, Jaya S. Kuruvella, Arunavo Dey, Tanzima Z. Islam, Kevin Menear, Dmitry Duplyakin, Rashadul Kabir, Tapasya Patki, Terry Jones, Feiyi Wang | 2025-08-27 | 下载 | Schedulers are critical for optimal resource utilization in high-performance computing. Traditional methods to evaluate schedulers are limited to post-deployment analysis, or simulators, which do not ... |
| New Tools, Programming Models, and System Support for Processing-in-Memory Architectures | Geraldo F. Oliveira | 2025-08-27 | 下载 | Our goal in this dissertation is to provide tools, programming models, and system support for PIM architectures (with a focus on DRAM-based solutions), to ease the adoption of PIM in current and futur... |
| Beyond Pairwise Comparisons: Unveiling Structural Landscape of Mobile Robot Models | Shota Naito, Tsukasa Ninomiya, Koichi Wada | 2025-08-27 | 下载 | Understanding the computational power of mobile robot systems is a fundamental challenge in distributed computing. While prior work has focused on pairwise separations between models, we explore how r... |
| Beyond the Bermuda Triangle of Contention: IOMMU Interference in Mixed Criticality Systems | Diogo Costa, Jose Martins, Sandro Pinto | 2025-08-27 | 下载 | As Mixed Criticality Systems (MCSs) evolve, they increasingly integrate heterogeneous computing platforms, combining general-purpose processors with specialized accelerators such as AI engines, GPUs, ... |
| A Model-agnostic Strategy to Mitigate Embedding Degradation in Personalized Federated Recommendation | Jiakui Shen, Yunqi Mi, Guoshuai Zhao, Jialie Shen, Xueming Qian | 2025-08-27 | 下载 | Centralized recommender systems encounter privacy leakage due to the need to collect user behavior and other private data. Hence, federated recommender systems (FedRec) have become a promising approac... |
| IsoSched: Preemptive Tile Cascaded Scheduling of Multi-DNN via Subgraph Isomorphism | Boran Zhao, Zihang Yuan, Yanbin Hu, Haiming Zhai, Haoruo Zhang, Wenzhe Zhao, Tian Xia, Pengju Ren | 2025-08-27 | 下载 | Deploying deep neural network (DNN) accelerators with Layer Temporal Scheduling (LTS) often incurs significant overheads (e.g., energy and latency), as intermediate activations must be cached in DRAM. |
| Taming the Chaos: Coordinated Autoscaling for Heterogeneous and Disaggregated LLM Inference | Rongzhi Li, Ruogu Du, Zefang Chu, Sida Zhao, Chunlei Han, Zuocheng Shi, Yiwen Shao, Huanle Han, Long Huang, Zherui Liu, Shufan Liu | 2025-08-27 | 下载 | Serving Large Language Models (LLMs) is a GPU-intensive task where traditional autoscalers fall short, particularly for modern Prefill-Decode (P/D) disaggregated architectures. |
| Aegis: Taxonomy and Optimizations for Overcoming Agent-Environment Failures in LLM Agents | Kevin Song, Anand Jayarajan, Yaoyao Ding, Qidong Su, Zhanda Zhu, Sihang Liu, Gennady Pekhimenko | 2025-08-27 | 下载 | Large Language Models (LLMs) agents augmented with domain tools promise to autonomously execute complex tasks requiring human-level intelligence, such as customer service and digital assistance. |
| Towards 6G Intelligence: The Role of Generative AI in Future Wireless Networks | Muhammad Ahmed Mohsin, Junaid Ahmad, Muhammad Hamza Nawaz, Muhammad Ali Jamshed | 2025-08-27 | 下载 | Ambient intelligence (AmI) is a computing paradigm in which physical environments are embedded with sensing, computation, and communication so they can perceive people and context, decide appropriate ... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| DRR-MDPF: A Queue Management Strategy Based on Dynamic Resource Allocation and Markov Decision Process in Named Data Networking (NDN) | Fatemeh Roshanzadeh, Hamid Barati, Ali Barati | 2025-08-27 | 下载 | Named Data Networking (NDN) represents a transformative shift in network architecture, prioritizing content names over host addresses to enhance data dissemination. |
| The LLM as a Network Operator: A Vision for Generative AI in the 6G Radio Access Network | Oluwaseyi Giwa, Michael Adewole, Tobi Awodumila, Pelumi Aderinto | 2025-08-27 | 下载 | The management of future AI-native Next-Generation (NextG) Radio Access Networks (RANs), including 6G and beyond, presents a challenge of immense complexity that exceeds the capabilities of traditiona... |
| A Comprehensive Survey of 5G URLLC and Challenges in the 6G Era | Md. Emadul Haque, Faisal Tariq, Muhammad R A Khandaker, Md. Sakir Hossain, Muhammad Ali Imran, Kai-Kit Wong | 2025-08-27 | 下载 | As the wireless communication paradigm is being transformed from human centered communication services towards machine centered communication services, the requirements of rate, latency and reliabilit... |
| ML-MaxProp: Bridging Machine Learning and Delay-Tolerant Routing for Resilient Post-Disaster Communication | Tao Xiuyuan, Milena Radenkovic | 2025-08-27 | 下载 | In disaster-stricken and large-scale urban emergency scenarios, ensuring reliable communication remains a formidable challenge, as collapsed infrastructure, unpredictable mobility, and severely constr... |
| A First Look at Inter-Cell Interference in the Wild | Daqian Ding, Yibo Pi, Cailian Chen | 2025-08-27 | 下载 | In cellular networks, inter-cell interference management has been studied for decades, yet its real-world effectiveness remains under-explored. |
| 2SYN: Congestion-Aware Multihoming | Kfir Toledo, Isaac Keslassy | 2025-08-27 | 下载 | When sending flows to arbitrary destinations, current multihoming routers adopt simple congestion-oblivious mechanisms. Therefore, they cannot avoid congested paths. |
| Secure Multi-LLM Agentic AI and Agentification for Edge General Intelligence by Zero-Trust: A Survey | Yinqiu Liu, Ruichen Zhang, Haoxiang Luo, Yijing Lin, Geng Sun, Dusit Niyato, Hongyang Du, Zehui Xiong, Yonggang Wen, Abbas Jamalipour, Dong In Kim, Ping Zhang | 2025-08-27 | 下载 | Agentification serves as a critical enabler of Edge General Intelligence (EGI), transforming massive edge devices into cognitive agents through integrating Large Language Models (LLMs) and perception,... |
| Experimental Insights from OpenAirInterface 5G positioning Testbeds: Challenges and solutions | Mohsen Ahadi, Adeel Malik, Omid Esrafilian, Florian Kaltenberger, Cedric Thienot | 2025-08-27 | 下载 | 5G New Radio (NR) is a key enabler of accurate positioning in smart cities and smart factories. This paper presents the experimental results from three 5G positioning testbeds running open-source Open... |
| A Dynamic Service Offloading Algorithm Based on Lyapunov Optimization in Edge Computing | Peiyan Yuan, Ming Li, Chenyang Wang, Ledong An, Xiaoyan Zhao, Junna Zhang, Xiangyang Li, Huadong Ma | 2025-08-27 | 下载 | This study investigates the trade-off between system stability and offloading cost in collaborative edge computing. While collaborative offloading among multiple edge servers enhances resource utiliza... |
| When Routers, Switches and Interconnects Compute: A processing-in-interconnect Paradigm for Scalable Neuromorphic AI | Madhuvanthi Srivatsav, Chiranjib Bhattacharyya, Shantanu Chakrabartty, Chetan Singh Thakur | 2025-08-27 | 下载 | Routing, switching, and the interconnect fabric are essential components in implementing large-scale neuromorphic computing architectures. While this fabric plays only a supporting role in the process... |