2025-04-29

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Iceberg Beyond the Tip: Co-Compilation of a Quantum Error Detection Code and a Quantum Algorithm	Yuwei Jin, Zichang He, Tianyi Hao, David Amaro, Swamit Tannu, Ruslan Shaydulin, Marco Pistoia	2025-04-29	下载	The rapid progress in quantum hardware is expected to make them viable tools for the study of quantum algorithms in the near term. The timeline to useful algorithmic experimentation can be accelerated...
MCMComm: Hardware-Software Co-Optimization for End-to-End Communication in Multi-Chip-Modules	Ritik Raj, Shengjie Lin, William Won, Tushar Krishna	2025-04-29	下载	Increasing AI computing demands and slowing transistor scaling have led to the advent of Multi-Chip-Module (MCMs) based accelerators. MCMs enable cost-effective scalability, higher yield, and modular ...
STAMP-2.5D: Structural and Thermal Aware Methodology for Placement in 2.5D Integration	Varun Darshana Parekh, Zachary Wyatt Hazenstab, Srivatsa Rangachar Srinivasa, Krishnendu Chakrabarty, Kai Ni, Vijaykrishnan Narayanan	2025-04-29	下载	Chiplet-based architectures and advanced packaging has emerged as transformative approaches in semiconductor design. While conventional physical design for 2.
OneDSE: A Unified Microprocessor Metric Prediction and Design Space Exploration Framework	Ritik Raj, Akshat Ramachandran, Jeff Nye, Shashank Nemawarkar, Tushar Krishna	2025-04-29	下载	With the slowing of Moores Law and increasing impact of power constraints, processor designs rely on architectural innovation to achieve differentiating performance.
DejaVuzz: Disclosing Transient Execution Bugs with Dynamic Swappable Memory and Differential Information Flow Tracking assisted Processor Fuzzing	Jinyan Xu, Yangye Zhou, Xingzhi Zhang, Yinshuai Li, Qinhan Tan, Yinqian Zhang, Yajin Zhou, Rui Chang, Wenbo Shen	2025-04-29	下载	Transient execution vulnerabilities have emerged as a critical threat to modern processors. Hardware fuzzing testing techniques have recently shown promising results in discovering transient execution...
Overcoming Quadratic Hardware Scaling for a Fully Connected Digital Oscillatory Neural Network	Bram Haverkort, Aida Todri-Sanial	2025-04-29	下载	Computing with coupled oscillators or oscillatory neural networks (ONNs) has recently attracted a lot of interest due to their potential for massive parallelism and energy-efficient computing.
Nonlinear Computation with Linear Optics via Source-Position Encoding	N. Richardson, C. Bosch, R. P. Adams	2025-04-29	下载	Optical computing systems provide an alternate hardware model which appears to be aligned with the demands of neural network workloads. However, the challenge of implementing energy efficient nonlinea...
DEER: Deep Runahead for Instruction Prefetching on Modern Mobile Workloads	Parmida Vahdatniya, Julian Humecki, Henry Kao, Tony Li, Ali Sedaghati, Fang Su, Ruoyu Zhou, Alex Bi, Reza Azimi, Maziar Goudarzi	2025-04-29	下载	Mobile workloads incur heavy frontend stalls due to increasingly large code footprints as well as long repeat cycles. Existing instruction-prefetching techniques suffer from low coverage, poor timelin...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
FedHERO: A Federated Learning Approach for Node Classification Task on Heterophilic Graphs	Zihan Chen, Xingbo Fu, Yushun Dong, Jundong Li, Cong Shen	2025-04-29	下载	Federated Graph Learning (FGL) empowers clients to collaboratively train Graph neural networks (GNNs) in a distributed manner while preserving data privacy.
Federated One-Shot Learning with Data Privacy and Objective-Hiding	Maximilian Egger, Rüdiger Urbanke, Rawad Bitar	2025-04-29	下载	Privacy in federated learning is crucial, encompassing two key aspects: safeguarding the privacy of clients' data and maintaining the privacy of the federator's objective from the clients.
Hubs and Spokes Learning: Efficient and Scalable Collaborative Machine Learning	Atul Sharma, Kavindu Herath, Saurabh Bagchi, Chaoyue Liu, Somali Chaterji	2025-04-29	下载	We introduce the Hubs and Spokes Learning (HSL) framework, a novel paradigm for collaborative machine learning that combines the strengths of Federated Learning (FL) and Decentralized Learning (P2PL).
Predicting the Performance of Scientific Workflow Tasks for Cluster Resource Management: An Overview of the State of the Art	Jonathan Bader, Kathleen West, Soeren Becker, Svetlana Kulagina, Fabian Lehmann, Lauritz Thamsen, Henning Meyerhenke, Odej Kao	2025-04-29	下载	Scientific workflow management systems support large-scale data analysis on cluster infrastructures. For this, they interact with resource managers which schedule workflow tasks onto cluster nodes.
Towards Easy and Realistic Network Infrastructure Testing for Large-scale Machine Learning	Jinsun Yoo, ChonLam Lao, Lianjie Cao, Bob Lantz, Minlan Yu, Tushar Krishna, Puneet Sharma	2025-04-29	下载	This paper lays the foundation for Genie, a testing framework that captures the impact of real hardware network behavior on ML workload performance, without requiring expensive GPUs.
Cost-Effective Edge Data Distribution with End-To-End Delay Guarantees in Edge Computing	Ravi Shankar, Aryabartta Sahu	2025-04-29	下载	Cloud Computing is the delivery of computing resources which includes servers, storage, databases, networking, software, analytics, and intelligence over the internet to offer faster innovation, flexi...
AI-Based Crypto Tokens: The Illusion of Decentralized AI?	Rischan Mafrur	2025-04-29	下载	The convergence of blockchain and artificial intelligence (AI) has led to the emergence of AI-based tokens, which are cryptographic assets designed to power decentralized AI platforms and services.
Formal and Empirical Study of Metadata-Based Profiling for Resource Management in the Computing Continuum	Andrea Morichetta, Stefan Nastic, Victor Casamayor Pujol, Schahram Dustdar	2025-04-29	下载	We present and formalize a general approach for profiling workload by leveraging only a priori available static metadata to supply appropriate resource needs.
EDD-NSTE: Edge Data Distribution as a Network Steiner Tree Estimation in Edge Computing	Ravi Shankar, Aryabartta Sahu	2025-04-29	下载	Edge computing is a distributed computing paradigm that brings computation and data storage closer to the user's geographical location to improve response times and save bandwidth.
Intelligent Task Offloading in VANETs: A Hybrid AI-Driven Approach for Low-Latency and Energy Efficiency	Tariq Qayyum, Asadullah Tariq, Muhammad Ali, Mohamed Adel Serhani, Zouheir Trabelsi, Maite López-Sánchez	2025-04-29	下载	Vehicular Ad-hoc Networks (VANETs) are integral to intelligent transportation systems, enabling vehicles to offload computational tasks to nearby roadside units (RSUs) and mobile edge computing (MEC) ...
Efficient patient-centric EMR sharing block tree	Xiaohan Hu, Jyoti Sahni, Colin R. Simpson, Normalia Samian, Winston K. G. Seah	2025-04-29	下载	Flexible sharing of electronic medical records (EMRs) is an urgent need in healthcare, as fragmented storage creates EMR management complexity for both practitioners and patients.
Hetu v2: A General and Scalable Deep Learning System with Hierarchical and Heterogeneous Single Program Multiple Data Annotations	Haoyang Li, Fangcheng Fu, Hao Ge, Sheng Lin, Xuanyu Wang, Jiawen Niu, Xupeng Miao, Bin Cui	2025-04-29	下载	The Single Program Multiple Data (SPMD) paradigm provides a unified abstraction to annotate various parallel dimensions in distributed deep learning (DL) training.
Efficient Graph-Based Approximate Nearest Neighbor Search Achieving: Low Latency Without Throughput Loss	Jingjia Luo, Mingxing Zhang, Kang Chen, Xia Liao, Yingdi Shan, Jinlei Jiang, Yongwei Wu	2025-04-29	下载	The increase in the dimensionality of neural embedding models has enhanced the accuracy of semantic search capabilities but also amplified the computational demands for Approximate Nearest Neighbor Se...
CloudQC: A Network-aware Framework for Multi-tenant Distributed Quantum Computing	Ruilin Zhou, Yuhang Gan, Yi Liu, Chen Qian	2025-04-29	下载	Distributed quantum computing (DQC) that allows a large quantum circuit to be executed simultaneously on multiple quantum processing units (QPUs) becomes a promising approach to increase the scalabili...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Green Satellite Networks Using Segment Routing and Software-Defined Networking	Jintao Liang, Pablo G. Madoery, Chung-Horng Lung, Halim Yanikomeroglu, Gunes Karabulut Kurt	2025-04-29	下载	This paper presents a comprehensive evaluation of network performance in software defined networking (SDN)-based low Earth orbit (LEO) satellite networks, focusing on the Telesat Lightspeed constellat...
Flexible Semantic-Aware Resource Allocation: Serving More Users Through Similarity Range Constraints	Nasrin Gholami, Neda Moghim, Behrouz Shahgholi Ghahfarokhi, Pouyan Salavati, Christo Kurisummoottil Thomas, Sachin Shetty, Tahereh Rahmati	2025-04-29	下载	Semantic communication (SemCom) aims to enhance the resource efficiency of next-generation networks by transmitting the underlying meaning of messages, focusing on information relevant to the end user...
Towards Easy and Realistic Network Infrastructure Testing for Large-scale Machine Learning	Jinsun Yoo, ChonLam Lao, Lianjie Cao, Bob Lantz, Minlan Yu, Tushar Krishna, Puneet Sharma	2025-04-29	下载	This paper lays the foundation for Genie, a testing framework that captures the impact of real hardware network behavior on ML workload performance, without requiring expensive GPUs.
did:self A registry-less DID method	Nikos Fotiou, George C. Polyzos, Vasilios A. Siris	2025-04-29	下载	We introduce did:self, a Decentralized Identifier (DID) method that does not depend on any trusted registry for storing the corresponding DID documents.
MACH: Multi-Agent Coordination for RSU-centric Handovers	Nikolaus Spring, Andrea Morichetta, Boris Sedlak, Schahram Dustdar	2025-04-29	下载	This paper introduces MACH, a novel approach for optimizing task handover in vehicular computing scenarios. To ensure fast and latency-aware placement of tasks, the decision-making -- where and when s...
Handling Large-Scale Network Flow Records: A Comparative Study on Lossy Compression	Gabriele Merlach, Damiano Ravalico, Martino Trevisan, Fabio Palmese, Giovanni Baccichet, Alessandro E. C. Redondi	2025-04-29	下载	Flow records, that summarize the characteristics of traffic flows, represent a practical and powerful way to monitor a network. While they already offer significant compression compared to full packet...
WakeLoc: An Ultra-Low Power, Accurate and Scalable On-Demand RTLS using Wake-Up Radios	Silvano Cortesi, Christian Vogt, Michele Magno	2025-04-29	下载	For future large scale robotic moon missions, the availability of infrastructure-less, cheap and low power real-time locating systems (RTLSs) is critical.
Fiber to the Room: Key Technologies, Challenges, and Prospects	Jinhan Cai, Xiaolong Zhang, Xiang Wang, Tianhai Chang, Gangxiang Shen	2025-04-29	下载	Fiber to the Room (FTTR) is a next-generation access network designed to deliver high bandwidth, low latency, and room-level optical coverage.
VA-CDH: A Variance-Aware Method to Optimize Latency for Caching with Delayed Hits	Bowen Jiang, Chaofan Ma, Duo Wang	2025-04-29	下载	Caches are fundamental to latency-sensitive systems like Content Delivery Networks (CDNs) and Mobile Edge Computing (MEC). However, the delayed hit phenomenon where multiple requests for an object occ...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
OSVBench: Benchmarking LLMs on Specification Generation Tasks for Operating System Verification	Shangyu Li, Juyong Jiang, Tiancheng Zhao, Jiasi Shen	2025-04-29	下载	We introduce OSVBench, a new benchmark for evaluating Large Language Models (LLMs) on the task of generating complete formal specifications for verifying the functional correctness of operating system...
CrashFixer: A crash resolution agent for the Linux kernel	Alex Mathai, Chenxi Huang, Suwei Ma, Jihwan Kim, Hailie Mitchell, Aleksandr Nogikh, Petros Maniatis, Franjo Ivančić, Junfeng Yang, Baishakhi Ray	2025-04-29	下载	Code large language models (LLMs) have shown impressive capabilities on a multitude of software engineering tasks. In particular, they have demonstrated remarkable utility in the task of code repair.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Information Retrieval in the Age of Generative AI: The RGB Model	Michele Garetto, Alessandro Cornacchia, Franco Galante, Emilio Leonardi, Alessandro Nordio, Alberto Tarable	2025-04-29	下载	The advent of Large Language Models (LLMs) and generative AI is fundamentally transforming information retrieval and processing on the Internet, bringing both great potential and significant concerns ...
DEER: Deep Runahead for Instruction Prefetching on Modern Mobile Workloads	Parmida Vahdatniya, Julian Humecki, Henry Kao, Tony Li, Ali Sedaghati, Fang Su, Ruoyu Zhou, Alex Bi, Reza Azimi, Maziar Goudarzi	2025-04-29	下载	Mobile workloads incur heavy frontend stalls due to increasingly large code footprints as well as long repeat cycles. Existing instruction-prefetching techniques suffer from low coverage, poor timelin...
CarbonCall: Sustainability-Aware Function Calling for Large Language Models on Edge Devices	Varatheepan Paramanayakam, Andreas Karatzas, Iraklis Anagnostopoulos, Dimitrios Stamoulis	2025-04-29	下载	Large Language Models (LLMs) enable real-time function calling in edge AI systems but introduce significant computational overhead, leading to high power consumption and carbon emissions.