Skip to content

2025-08-14

cs.AR - Architecture

标题作者发布日期PDF摘要
THERMOS: Thermally-Aware Multi-Objective Scheduling of AI Workloads on Heterogeneous Multi-Chiplet PIM ArchitecturesAlish Kanani, Lukas Pfromm, Harsh Sharma, Janardhan Rao Doppa, Partha Pratim Pande, Umit Y. Ogras2025-08-14下载Chiplet-based integration enables large-scale systems that combine diverse technologies, enabling higher yield, lower costs, and scalability, making them well-suited to AI workloads.
AnalogSeeker: An Open-source Foundation Language Model for Analog Circuit DesignZihao Chen, Ji Zhuang, Jinyi Shen, Xiaoyue Ke, Xinyi Yang, Mingjie Zhou, Zhuoyao Du, Xu Yan, Zhouyang Wu, Zhenyu Xu, Jiangli Huang, Li Shang, Xuan Zeng, Fan Yang2025-08-14下载In this paper, we propose AnalogSeeker, an effort toward an open-source foundation language model for analog circuit design, with the aim of integrating domain knowledge and giving design assistance.
DiffAxE: Diffusion-driven Hardware Accelerator Generation and Design Space ExplorationArkapravo Ghosh, Abhishek Moitra, Abhiroop Bhattacharjee, Ruokai Yin, Priyadarshini Panda2025-08-14下载Design space exploration (DSE) is critical for developing optimized hardware architectures, especially for AI workloads such as deep neural networks (DNNs) and large language models (LLMs), which requ...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
EMLIO: Minimizing I/O Latency and Energy Consumption for Large-Scale AI TrainingHasibul Jamil, MD S Q Zulkar Nine, Tevfik Kosar2025-08-14下载Large-scale deep learning workloads increasingly suffer from I/O bottlenecks as datasets grow beyond local storage capacities and GPU compute outpaces network and disk latencies.
Minimmit: Fast Finality with Even Faster BlocksBrendan Kobayashi Chou, Andrew Lewis-Pye, Patrick O'Grady2025-08-14下载Achieving low-latency consensus in geographically distributed systems remains a key challenge for blockchain and distributed database applications.
Introducing CQ: A C-like API for Quantum Accelerated HPCOliver Thomson Brown, Mateusz Meller, James Richings2025-08-14下载In this paper we present CQ, a specification for a C-like API for quantum accelerated HPC, as well as CQ-SimBE, a reference implementation of CQ written in C99, and built on top of the statevector sim...
Dalek: An Unconventional and Energy-Aware Heterogeneous ClusterAdrien Cassagne, Noé Amiot, Manuel Bouyer2025-08-14下载Dalek is an experimental compute cluster designed to evaluate the performance of heterogeneous, consumer-grade hardware for software design, prototyping, and algorithm development.
Flexible Personalized Split Federated Learning for On-Device Fine-Tuning of Foundation ModelsTianjun Yuan, Jiaxiang Geng, Pengchao Han, Xianhao Chen, Bing Luo2025-08-14下载Fine-tuning foundation models is critical for superior performance on personalized downstream tasks, compared to using pre-trained models. Collaborative learning can leverage local clients' datasets f...
GPZ: GPU-Accelerated Lossy Compressor for Particle DataRuoyu Li, Yafan Huang, Longtao Zhang, Zhuoxun Yang, Sheng Di, Jiajun Huang, Jinyang Liu, Jiannan Tian, Xin Liang, Guanpeng Li, Hanqi Guo, Franck Cappello, Kai Zhao2025-08-14下载Particle-based simulations and point-cloud applications generate massive, irregular datasets that challenge storage, I/O, and real-time analytics.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Routing and Wavelength Assignment with Minimal Attack Radius for QKD NetworksMengyao Li, Qiaolun Zhang, Zongshuai Yang, Stefano Bregni, Alberto Gatto, Raouf Boutaba, Massimo Tornatore2025-08-14下载Quantum Key Distribution (QKD) can distribute keys with guaranteed security but remains susceptible to key exchange interruption due to physical-layer threats, such as high-power jamming attacks.
Balancing the Energy Consumption and Latency of Over-the-Air Firmware Updates in LoRaWANSiddhartha S. Borkotoky2025-08-14下载Over-the-air firmware updates are crucial for mitigating security threats and maintaining up-to-date device functionality in Long Range Wide Area Networks (LoRaWANs).
Federated Learning Over LoRa Networks: Simulator Design and Performance EvaluationAnshika Singh, Siddhartha S. Borkotoky2025-08-14下载Federated learning (FL) over long-range (LoRa) low-power wide area networks faces unique challenges due to limited bandwidth, interference, and strict duty-cycle constraints.
Probabilistic Latency Analysis of the Data Distribution Service in ROS 2Sanghoon Lee, Hyung-Seok Park, Jiyeong Chae, Kyung-Joon Park2025-08-14下载Robot Operating System 2 (ROS 2) is now the de facto standard for robotic communication, pairing UDP transport with the Data Distribution Service (DDS) publish-subscribe middleware.
Semantic Communication with Distribution Learning through Sequential ObservationsSamer Lahoud, Kinda Khawam2025-08-14下载Semantic communication aims to convey meaning rather than bit-perfect reproduction, representing a paradigm shift from traditional communication.
A Hierarchical IDS for Zero-Day Attack Detection in Internet of Medical Things NetworksMd Ashraf Uddin, Nam H. Chu, Reza Rafeh2025-08-14下载The Internet of Medical Things (IoMT) is driving a healthcare revolution but remains vulnerable to cyberattacks such as denial of service, ransomware, data hijacking, and spoofing.
Near-realtime Earth Observation Via Starlink LEO Satellite ConstellationBo Wu, David Tipper, Pengfei Zhou2025-08-14下载Earth observation (EO) satellites in Low Earth Orbit (LEO) are collecting vast amounts of data, which are invaluable for applications such as monitoring forest fires.
Design of a Timer Queue Supporting Dynamic Update OperationsZekun Wang, Binghao Yue, Weitao Pan, Jiangyi Shi, Yue Hao2025-08-14下载Large-scale timers are ubiquitous in network processing, including flow table entry expiration control in software defined network (SDN) switches, MAC address aging in Ethernet bridges, and retransmis...
Rethinking Reliability Using Network Coding: a Practical 5G EvaluationLaura Landon, Vipindev Adat Vasudevan, Junmo Sung, Muriel Médard2025-08-14下载This work presents the design and implementation of a real-time network coding system integrated into the IP layer of a 5G testbed, offering an alternative to conventional retransmission-based reliabi...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Leveraging OS-Level Primitives for Robotic Action ManagementWenxin Zheng, Boyang Li, Bin Xu, Erhu Feng, Jinyu Gu, Haibo Chen2025-08-14下载End-to-end imitation learning frameworks (e.g., VLA) are increasingly prominent in robotics, as they enable rapid task transfer by learning directly from perception to control, eliminating the need fo...

cs.PF - Performance

标题作者发布日期PDF摘要
Meta-Metrics and Best Practices for System-Level Inference Performance BenchmarkingShweta Salaria, Zhuoran Liu, Nelson Mimura Gonzalez2025-08-14下载Benchmarking inference performance (speed) of Foundation Models such as Large Language Models (LLM) involves navigating a vast experimental landscape to understand the complex interactions between har...

基于 VitePress 构建