Skip to content

2025-05-10

cs.AR - Architecture

标题作者发布日期PDF摘要
Regular mixed-radix DFT matrix factorization for in-place FFT acceleratorsSergey Salishev2025-05-10下载The generic vector memory based accelerator is considered which supports DIT and DIF FFT with fixed datapath. The regular mixed-radix factorization of the DFT matrix coherent with the accelerator arch...
Modeling PFAS in Semiconductor Manufacturing to Quantify Trade-offs in Energy Efficiency and Environmental Impact of Computing SystemsMariam Elgamal, Abdulrahman Mahmoud, Gu-Yeon Wei, David Brooks, Gage Hills2025-05-10下载The electronics and semiconductor industry is a prominent consumer of per- and poly-fluoroalkyl substances (PFAS), also known as forever chemicals.
Extend IVerilog to Support Batch RTL Fault SimulationJiaping Tang, Jianan Mu, Zizhen Liu, Zhiteng Chao, Jing Ye, Huawei Li2025-05-10下载The advancement of functional safety has made RTL-level fault simulation increasingly important to achieve iterative efficiency in the early stages of design and to ensure compliance with functional s...
CaMDN: Enhancing Cache Efficiency for Multi-tenant DNNs on Integrated NPUsTianhao Cai, Liang Wang, Limin Xiao, Meng Han, Zeyu Wang, Lin Sun, Xiaojian Liao2025-05-10下载With the rapid development of DNN applications, multi-tenant execution, where multiple DNNs are co-located on a single SoC, is becoming a prevailing trend.
FlexNeRFer: A Multi-Dataflow, Adaptive Sparsity-Aware Accelerator for On-Device NeRF RenderingSeock-Hwan Noh, Banseok Shin, Jeik Choi, Seungpyo Lee, Jaeha Kung, Yeseong Kim2025-05-10下载Neural Radiance Fields (NeRF), an AI-driven approach for 3D view reconstruction, has demonstrated impressive performance, sparking active research across fields.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Privacy-aware Berrut Approximated Coded Computing applied to general distributed learningXavier Martínez-Luaña, Manuel Fernández-Veiga, Rebeca P. Díaz-Redondo, Ana Fernández-Vilas2025-05-10下载Coded computing is one of the techniques that can be used for privacy protection in Federated Learning. However, most of the constructions used for coded computing work only under the assumption that ...
Online Job Scheduler for Fault-tolerant Quantum MultiprogrammingShin Nishio, Ryo Wakizaka, Daisuke Sakuma, Yosuke Ueno, Yasunari Suzuki2025-05-10下载Fault-tolerant quantum computers are expected to be offered as cloud services due to their significant resource and infrastructure requirements.
Regular mixed-radix DFT matrix factorization for in-place FFT acceleratorsSergey Salishev2025-05-10下载The generic vector memory based accelerator is considered which supports DIT and DIF FFT with fixed datapath. The regular mixed-radix factorization of the DFT matrix coherent with the accelerator arch...
SneakPeek: Data-Aware Model Selection and Scheduling for Inference Serving on the EdgeJoel Wolfrath, Daniel Frink, Abhishek Chandra2025-05-10下载Modern applications increasingly rely on inference serving systems to provide low-latency insights with a diverse set of machine learning models.
Deterministic Self-Stabilizing BFS Construction in Constant SpaceLélia Blin, Franck Petit, Sébastien Tixeuil2025-05-10下载In this paper, we resolve a long-standing question in self-stabilization by demonstrating that it is indeed possible to construct a spanning tree in a semi-uniform network using constant memory per no...
Data Version Management and Machine-Actionable Reproducibility for HPCAndreas Knüpfer, Timothy J. Callow2025-05-10下载We present a solution for research data version control and machine-actionable reproducibility of data processing for High Performance Computing (HPC) environments and the SLURM batch scheduler.
TierBase: A Workload-Driven Cost-Optimized Key-Value StoreZhitao Shen, Shiyu Yang, Weibo Chen, Kunming Wang, Yue Li, Jiabao Jin, Wei Jia, Junwei Chen, Yuan Su, Xiaoxia Duan, Wei Chen, Lei Wang, Jie Song, Ruoyi Ruan, Xuemin Lin2025-05-10下载In the current era of data-intensive applications, the demand for high-performance, cost-effective storage solutions is paramount. This paper introduces a Space-Performance Cost Model for key-value st...
QoS-Efficient Serving of Multiple Mixture-of-Expert LLMs Using Partial Runtime ReconfigurationHamidReza Imani, Jiaxin Peng, Peiman Mohseni, Abdolah Amirany, Tarek El-Ghazawi2025-05-10下载The deployment of mixture-of-experts (MoE) large language models (LLMs) presents significant challenges due to their high memory demands. These challenges become even more pronounced in multi-tenant e...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Improving 5G/B5G Network Performance with RFID-Enabled Resource Management SystemsStella N. Arinze, Halima I. Kure, Augustine O. Nwajana2025-05-10下载In the rapidly evolving landscape of 5G and B5G (beyond 5G) networks, efficient resource optimization is critical to addressing the escalating demands for high-speed, low-latency, and energy efficient...
Distributionally Robust Contract Theory for Edge AIGC Services in TeleoperationZijun Zhan, Yaxian Dong, Daniel Mawunyo Doe, Yuqing Hu, Shuai Li, Shaohua Cao, Lei Fan, Zhu Han2025-05-10下载Advanced AI-Generated Content (AIGC) technologies have injected new impetus into teleoperation, further enhancing its security and efficiency.
Efficient Telecom Specific LLM: TSLAM-Mini with QLoRA and Digital Twin DataVignesh Ethiraj, Divya Vijay, Sidhanth Menon, Heblin Berscilla2025-05-10下载General-purpose large language models (LLMs), despite their broad capabilities accrued from open-world data, frequently exhibit suboptimal performance when confronted with the nuanced and specialized ...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Work-in-Progress: Multi-Deadline DAG Scheduling Model for Autonomous Driving SystemsAtsushi Yano, Takuya Azumi2025-05-10下载Autoware is an autonomous driving system implemented on Robot Operation System (ROS) 2, where an end-to-end timing guarantee is crucial to ensure safety.
Online Job Scheduler for Fault-tolerant Quantum MultiprogrammingShin Nishio, Ryo Wakizaka, Daisuke Sakuma, Yosuke Ueno, Yasunari Suzuki2025-05-10下载Fault-tolerant quantum computers are expected to be offered as cloud services due to their significant resource and infrastructure requirements.
RTOS Architectures that Solve the Diminishing Bandwidth ProblemMazen Arakji2025-05-10下载The Diminishing Bandwidth Problem is a long standing, previously unidentified, extensibility problem of current real-time operating systems characterized by a superficial dependency between the number...
CaMDN: Enhancing Cache Efficiency for Multi-tenant DNNs on Integrated NPUsTianhao Cai, Liang Wang, Limin Xiao, Meng Han, Zeyu Wang, Lin Sun, Xiaojian Liao2025-05-10下载With the rapid development of DNN applications, multi-tenant execution, where multiple DNNs are co-located on a single SoC, is becoming a prevailing trend.
Work in Progress: Middleware-Transparent Callback Enforcement in Commoditized Component-Oriented Real-time SystemsTakahiro Ishikawa-Aso, Atsushi Yano, Takuya Azumi, Shinpei Kato2025-05-10下载Real-time scheduling in commoditized component-oriented real-time systems, such as ROS 2 systems on Linux, has been studied under nested scheduling: OS thread scheduling and middleware layer schedulin...

cs.PF - Performance

标题作者发布日期PDF摘要
8 Years of Optimizing Apache Otava: How disconnected open source developers took an algorithm from n3 to constant timeHenrik Ingo2025-05-10下载As the project now known as Apache Otava (incubating) makes it first release, we look back over the past 8 years that the codebase was developed by a rather uncoordinated, loosely connected group of p...

基于 VitePress 构建