2025-05-10

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Regular mixed-radix DFT matrix factorization for in-place FFT accelerators	Sergey Salishev	2025-05-10	下载	The generic vector memory based accelerator is considered which supports DIT and DIF FFT with fixed datapath. The regular mixed-radix factorization of the DFT matrix coherent with the accelerator arch...
Modeling PFAS in Semiconductor Manufacturing to Quantify Trade-offs in Energy Efficiency and Environmental Impact of Computing Systems	Mariam Elgamal, Abdulrahman Mahmoud, Gu-Yeon Wei, David Brooks, Gage Hills	2025-05-10	下载	The electronics and semiconductor industry is a prominent consumer of per- and poly-fluoroalkyl substances (PFAS), also known as forever chemicals.
Extend IVerilog to Support Batch RTL Fault Simulation	Jiaping Tang, Jianan Mu, Zizhen Liu, Zhiteng Chao, Jing Ye, Huawei Li	2025-05-10	下载	The advancement of functional safety has made RTL-level fault simulation increasingly important to achieve iterative efficiency in the early stages of design and to ensure compliance with functional s...
CaMDN: Enhancing Cache Efficiency for Multi-tenant DNNs on Integrated NPUs	Tianhao Cai, Liang Wang, Limin Xiao, Meng Han, Zeyu Wang, Lin Sun, Xiaojian Liao	2025-05-10	下载	With the rapid development of DNN applications, multi-tenant execution, where multiple DNNs are co-located on a single SoC, is becoming a prevailing trend.
FlexNeRFer: A Multi-Dataflow, Adaptive Sparsity-Aware Accelerator for On-Device NeRF Rendering	Seock-Hwan Noh, Banseok Shin, Jeik Choi, Seungpyo Lee, Jaeha Kung, Yeseong Kim	2025-05-10	下载	Neural Radiance Fields (NeRF), an AI-driven approach for 3D view reconstruction, has demonstrated impressive performance, sparking active research across fields.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Privacy-aware Berrut Approximated Coded Computing applied to general distributed learning	Xavier Martínez-Luaña, Manuel Fernández-Veiga, Rebeca P. Díaz-Redondo, Ana Fernández-Vilas	2025-05-10	下载	Coded computing is one of the techniques that can be used for privacy protection in Federated Learning. However, most of the constructions used for coded computing work only under the assumption that ...
Online Job Scheduler for Fault-tolerant Quantum Multiprogramming	Shin Nishio, Ryo Wakizaka, Daisuke Sakuma, Yosuke Ueno, Yasunari Suzuki	2025-05-10	下载	Fault-tolerant quantum computers are expected to be offered as cloud services due to their significant resource and infrastructure requirements.
Regular mixed-radix DFT matrix factorization for in-place FFT accelerators	Sergey Salishev	2025-05-10	下载	The generic vector memory based accelerator is considered which supports DIT and DIF FFT with fixed datapath. The regular mixed-radix factorization of the DFT matrix coherent with the accelerator arch...
SneakPeek: Data-Aware Model Selection and Scheduling for Inference Serving on the Edge	Joel Wolfrath, Daniel Frink, Abhishek Chandra	2025-05-10	下载	Modern applications increasingly rely on inference serving systems to provide low-latency insights with a diverse set of machine learning models.
Deterministic Self-Stabilizing BFS Construction in Constant Space	Lélia Blin, Franck Petit, Sébastien Tixeuil	2025-05-10	下载	In this paper, we resolve a long-standing question in self-stabilization by demonstrating that it is indeed possible to construct a spanning tree in a semi-uniform network using constant memory per no...
Data Version Management and Machine-Actionable Reproducibility for HPC	Andreas Knüpfer, Timothy J. Callow	2025-05-10	下载	We present a solution for research data version control and machine-actionable reproducibility of data processing for High Performance Computing (HPC) environments and the SLURM batch scheduler.
TierBase: A Workload-Driven Cost-Optimized Key-Value Store	Zhitao Shen, Shiyu Yang, Weibo Chen, Kunming Wang, Yue Li, Jiabao Jin, Wei Jia, Junwei Chen, Yuan Su, Xiaoxia Duan, Wei Chen, Lei Wang, Jie Song, Ruoyi Ruan, Xuemin Lin	2025-05-10	下载	In the current era of data-intensive applications, the demand for high-performance, cost-effective storage solutions is paramount. This paper introduces a Space-Performance Cost Model for key-value st...
QoS-Efficient Serving of Multiple Mixture-of-Expert LLMs Using Partial Runtime Reconfiguration	HamidReza Imani, Jiaxin Peng, Peiman Mohseni, Abdolah Amirany, Tarek El-Ghazawi	2025-05-10	下载	The deployment of mixture-of-experts (MoE) large language models (LLMs) presents significant challenges due to their high memory demands. These challenges become even more pronounced in multi-tenant e...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Improving 5G/B5G Network Performance with RFID-Enabled Resource Management Systems	Stella N. Arinze, Halima I. Kure, Augustine O. Nwajana	2025-05-10	下载	In the rapidly evolving landscape of 5G and B5G (beyond 5G) networks, efficient resource optimization is critical to addressing the escalating demands for high-speed, low-latency, and energy efficient...
Distributionally Robust Contract Theory for Edge AIGC Services in Teleoperation	Zijun Zhan, Yaxian Dong, Daniel Mawunyo Doe, Yuqing Hu, Shuai Li, Shaohua Cao, Lei Fan, Zhu Han	2025-05-10	下载	Advanced AI-Generated Content (AIGC) technologies have injected new impetus into teleoperation, further enhancing its security and efficiency.
Efficient Telecom Specific LLM: TSLAM-Mini with QLoRA and Digital Twin Data	Vignesh Ethiraj, Divya Vijay, Sidhanth Menon, Heblin Berscilla	2025-05-10	下载	General-purpose large language models (LLMs), despite their broad capabilities accrued from open-world data, frequently exhibit suboptimal performance when confronted with the nuanced and specialized ...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Work-in-Progress: Multi-Deadline DAG Scheduling Model for Autonomous Driving Systems	Atsushi Yano, Takuya Azumi	2025-05-10	下载	Autoware is an autonomous driving system implemented on Robot Operation System (ROS) 2, where an end-to-end timing guarantee is crucial to ensure safety.
Online Job Scheduler for Fault-tolerant Quantum Multiprogramming	Shin Nishio, Ryo Wakizaka, Daisuke Sakuma, Yosuke Ueno, Yasunari Suzuki	2025-05-10	下载	Fault-tolerant quantum computers are expected to be offered as cloud services due to their significant resource and infrastructure requirements.
RTOS Architectures that Solve the Diminishing Bandwidth Problem	Mazen Arakji	2025-05-10	下载	The Diminishing Bandwidth Problem is a long standing, previously unidentified, extensibility problem of current real-time operating systems characterized by a superficial dependency between the number...
CaMDN: Enhancing Cache Efficiency for Multi-tenant DNNs on Integrated NPUs	Tianhao Cai, Liang Wang, Limin Xiao, Meng Han, Zeyu Wang, Lin Sun, Xiaojian Liao	2025-05-10	下载	With the rapid development of DNN applications, multi-tenant execution, where multiple DNNs are co-located on a single SoC, is becoming a prevailing trend.
Work in Progress: Middleware-Transparent Callback Enforcement in Commoditized Component-Oriented Real-time Systems	Takahiro Ishikawa-Aso, Atsushi Yano, Takuya Azumi, Shinpei Kato	2025-05-10	下载	Real-time scheduling in commoditized component-oriented real-time systems, such as ROS 2 systems on Linux, has been studied under nested scheduling: OS thread scheduling and middleware layer schedulin...

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
8 Years of Optimizing Apache Otava: How disconnected open source developers took an algorithm from n3 to constant time	Henrik Ingo	2025-05-10	下载	As the project now known as Apache Otava (incubating) makes it first release, we look back over the past 8 years that the codebase was developed by a rather uncoordinated, loosely connected group of p...