Skip to content

2025-04-29

cs.AR - Architecture

标题作者发布日期PDF摘要
Iceberg Beyond the Tip: Co-Compilation of a Quantum Error Detection Code and a Quantum AlgorithmYuwei Jin, Zichang He, Tianyi Hao, David Amaro, Swamit Tannu, Ruslan Shaydulin, Marco Pistoia2025-04-29下载The rapid progress in quantum hardware is expected to make them viable tools for the study of quantum algorithms in the near term. The timeline to useful algorithmic experimentation can be accelerated...
MCMComm: Hardware-Software Co-Optimization for End-to-End Communication in Multi-Chip-ModulesRitik Raj, Shengjie Lin, William Won, Tushar Krishna2025-04-29下载Increasing AI computing demands and slowing transistor scaling have led to the advent of Multi-Chip-Module (MCMs) based accelerators. MCMs enable cost-effective scalability, higher yield, and modular ...
STAMP-2.5D: Structural and Thermal Aware Methodology for Placement in 2.5D IntegrationVarun Darshana Parekh, Zachary Wyatt Hazenstab, Srivatsa Rangachar Srinivasa, Krishnendu Chakrabarty, Kai Ni, Vijaykrishnan Narayanan2025-04-29下载Chiplet-based architectures and advanced packaging has emerged as transformative approaches in semiconductor design. While conventional physical design for 2.
OneDSE: A Unified Microprocessor Metric Prediction and Design Space Exploration FrameworkRitik Raj, Akshat Ramachandran, Jeff Nye, Shashank Nemawarkar, Tushar Krishna2025-04-29下载With the slowing of Moores Law and increasing impact of power constraints, processor designs rely on architectural innovation to achieve differentiating performance.
DejaVuzz: Disclosing Transient Execution Bugs with Dynamic Swappable Memory and Differential Information Flow Tracking assisted Processor FuzzingJinyan Xu, Yangye Zhou, Xingzhi Zhang, Yinshuai Li, Qinhan Tan, Yinqian Zhang, Yajin Zhou, Rui Chang, Wenbo Shen2025-04-29下载Transient execution vulnerabilities have emerged as a critical threat to modern processors. Hardware fuzzing testing techniques have recently shown promising results in discovering transient execution...
Overcoming Quadratic Hardware Scaling for a Fully Connected Digital Oscillatory Neural NetworkBram Haverkort, Aida Todri-Sanial2025-04-29下载Computing with coupled oscillators or oscillatory neural networks (ONNs) has recently attracted a lot of interest due to their potential for massive parallelism and energy-efficient computing.
Nonlinear Computation with Linear Optics via Source-Position EncodingN. Richardson, C. Bosch, R. P. Adams2025-04-29下载Optical computing systems provide an alternate hardware model which appears to be aligned with the demands of neural network workloads. However, the challenge of implementing energy efficient nonlinea...
DEER: Deep Runahead for Instruction Prefetching on Modern Mobile WorkloadsParmida Vahdatniya, Julian Humecki, Henry Kao, Tony Li, Ali Sedaghati, Fang Su, Ruoyu Zhou, Alex Bi, Reza Azimi, Maziar Goudarzi2025-04-29下载Mobile workloads incur heavy frontend stalls due to increasingly large code footprints as well as long repeat cycles. Existing instruction-prefetching techniques suffer from low coverage, poor timelin...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
FedHERO: A Federated Learning Approach for Node Classification Task on Heterophilic GraphsZihan Chen, Xingbo Fu, Yushun Dong, Jundong Li, Cong Shen2025-04-29下载Federated Graph Learning (FGL) empowers clients to collaboratively train Graph neural networks (GNNs) in a distributed manner while preserving data privacy.
Federated One-Shot Learning with Data Privacy and Objective-HidingMaximilian Egger, Rüdiger Urbanke, Rawad Bitar2025-04-29下载Privacy in federated learning is crucial, encompassing two key aspects: safeguarding the privacy of clients' data and maintaining the privacy of the federator's objective from the clients.
Hubs and Spokes Learning: Efficient and Scalable Collaborative Machine LearningAtul Sharma, Kavindu Herath, Saurabh Bagchi, Chaoyue Liu, Somali Chaterji2025-04-29下载We introduce the Hubs and Spokes Learning (HSL) framework, a novel paradigm for collaborative machine learning that combines the strengths of Federated Learning (FL) and Decentralized Learning (P2PL).
Predicting the Performance of Scientific Workflow Tasks for Cluster Resource Management: An Overview of the State of the ArtJonathan Bader, Kathleen West, Soeren Becker, Svetlana Kulagina, Fabian Lehmann, Lauritz Thamsen, Henning Meyerhenke, Odej Kao2025-04-29下载Scientific workflow management systems support large-scale data analysis on cluster infrastructures. For this, they interact with resource managers which schedule workflow tasks onto cluster nodes.
Towards Easy and Realistic Network Infrastructure Testing for Large-scale Machine LearningJinsun Yoo, ChonLam Lao, Lianjie Cao, Bob Lantz, Minlan Yu, Tushar Krishna, Puneet Sharma2025-04-29下载This paper lays the foundation for Genie, a testing framework that captures the impact of real hardware network behavior on ML workload performance, without requiring expensive GPUs.
Cost-Effective Edge Data Distribution with End-To-End Delay Guarantees in Edge ComputingRavi Shankar, Aryabartta Sahu2025-04-29下载Cloud Computing is the delivery of computing resources which includes servers, storage, databases, networking, software, analytics, and intelligence over the internet to offer faster innovation, flexi...
AI-Based Crypto Tokens: The Illusion of Decentralized AI?Rischan Mafrur2025-04-29下载The convergence of blockchain and artificial intelligence (AI) has led to the emergence of AI-based tokens, which are cryptographic assets designed to power decentralized AI platforms and services.
Formal and Empirical Study of Metadata-Based Profiling for Resource Management in the Computing ContinuumAndrea Morichetta, Stefan Nastic, Victor Casamayor Pujol, Schahram Dustdar2025-04-29下载We present and formalize a general approach for profiling workload by leveraging only a priori available static metadata to supply appropriate resource needs.
EDD-NSTE: Edge Data Distribution as a Network Steiner Tree Estimation in Edge ComputingRavi Shankar, Aryabartta Sahu2025-04-29下载Edge computing is a distributed computing paradigm that brings computation and data storage closer to the user's geographical location to improve response times and save bandwidth.
Intelligent Task Offloading in VANETs: A Hybrid AI-Driven Approach for Low-Latency and Energy EfficiencyTariq Qayyum, Asadullah Tariq, Muhammad Ali, Mohamed Adel Serhani, Zouheir Trabelsi, Maite López-Sánchez2025-04-29下载Vehicular Ad-hoc Networks (VANETs) are integral to intelligent transportation systems, enabling vehicles to offload computational tasks to nearby roadside units (RSUs) and mobile edge computing (MEC) ...
Efficient patient-centric EMR sharing block treeXiaohan Hu, Jyoti Sahni, Colin R. Simpson, Normalia Samian, Winston K. G. Seah2025-04-29下载Flexible sharing of electronic medical records (EMRs) is an urgent need in healthcare, as fragmented storage creates EMR management complexity for both practitioners and patients.
Hetu v2: A General and Scalable Deep Learning System with Hierarchical and Heterogeneous Single Program Multiple Data AnnotationsHaoyang Li, Fangcheng Fu, Hao Ge, Sheng Lin, Xuanyu Wang, Jiawen Niu, Xupeng Miao, Bin Cui2025-04-29下载The Single Program Multiple Data (SPMD) paradigm provides a unified abstraction to annotate various parallel dimensions in distributed deep learning (DL) training.
Efficient Graph-Based Approximate Nearest Neighbor Search Achieving: Low Latency Without Throughput LossJingjia Luo, Mingxing Zhang, Kang Chen, Xia Liao, Yingdi Shan, Jinlei Jiang, Yongwei Wu2025-04-29下载The increase in the dimensionality of neural embedding models has enhanced the accuracy of semantic search capabilities but also amplified the computational demands for Approximate Nearest Neighbor Se...
CloudQC: A Network-aware Framework for Multi-tenant Distributed Quantum ComputingRuilin Zhou, Yuhang Gan, Yi Liu, Chen Qian2025-04-29下载Distributed quantum computing (DQC) that allows a large quantum circuit to be executed simultaneously on multiple quantum processing units (QPUs) becomes a promising approach to increase the scalabili...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Green Satellite Networks Using Segment Routing and Software-Defined NetworkingJintao Liang, Pablo G. Madoery, Chung-Horng Lung, Halim Yanikomeroglu, Gunes Karabulut Kurt2025-04-29下载This paper presents a comprehensive evaluation of network performance in software defined networking (SDN)-based low Earth orbit (LEO) satellite networks, focusing on the Telesat Lightspeed constellat...
Flexible Semantic-Aware Resource Allocation: Serving More Users Through Similarity Range ConstraintsNasrin Gholami, Neda Moghim, Behrouz Shahgholi Ghahfarokhi, Pouyan Salavati, Christo Kurisummoottil Thomas, Sachin Shetty, Tahereh Rahmati2025-04-29下载Semantic communication (SemCom) aims to enhance the resource efficiency of next-generation networks by transmitting the underlying meaning of messages, focusing on information relevant to the end user...
Towards Easy and Realistic Network Infrastructure Testing for Large-scale Machine LearningJinsun Yoo, ChonLam Lao, Lianjie Cao, Bob Lantz, Minlan Yu, Tushar Krishna, Puneet Sharma2025-04-29下载This paper lays the foundation for Genie, a testing framework that captures the impact of real hardware network behavior on ML workload performance, without requiring expensive GPUs.
did:self A registry-less DID methodNikos Fotiou, George C. Polyzos, Vasilios A. Siris2025-04-29下载We introduce did:self, a Decentralized Identifier (DID) method that does not depend on any trusted registry for storing the corresponding DID documents.
MACH: Multi-Agent Coordination for RSU-centric HandoversNikolaus Spring, Andrea Morichetta, Boris Sedlak, Schahram Dustdar2025-04-29下载This paper introduces MACH, a novel approach for optimizing task handover in vehicular computing scenarios. To ensure fast and latency-aware placement of tasks, the decision-making -- where and when s...
Handling Large-Scale Network Flow Records: A Comparative Study on Lossy CompressionGabriele Merlach, Damiano Ravalico, Martino Trevisan, Fabio Palmese, Giovanni Baccichet, Alessandro E. C. Redondi2025-04-29下载Flow records, that summarize the characteristics of traffic flows, represent a practical and powerful way to monitor a network. While they already offer significant compression compared to full packet...
WakeLoc: An Ultra-Low Power, Accurate and Scalable On-Demand RTLS using Wake-Up RadiosSilvano Cortesi, Christian Vogt, Michele Magno2025-04-29下载For future large scale robotic moon missions, the availability of infrastructure-less, cheap and low power real-time locating systems (RTLSs) is critical.
Fiber to the Room: Key Technologies, Challenges, and ProspectsJinhan Cai, Xiaolong Zhang, Xiang Wang, Tianhai Chang, Gangxiang Shen2025-04-29下载Fiber to the Room (FTTR) is a next-generation access network designed to deliver high bandwidth, low latency, and room-level optical coverage.
VA-CDH: A Variance-Aware Method to Optimize Latency for Caching with Delayed HitsBowen Jiang, Chaofan Ma, Duo Wang2025-04-29下载Caches are fundamental to latency-sensitive systems like Content Delivery Networks (CDNs) and Mobile Edge Computing (MEC). However, the delayed hit phenomenon where multiple requests for an object occ...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
OSVBench: Benchmarking LLMs on Specification Generation Tasks for Operating System VerificationShangyu Li, Juyong Jiang, Tiancheng Zhao, Jiasi Shen2025-04-29下载We introduce OSVBench, a new benchmark for evaluating Large Language Models (LLMs) on the task of generating complete formal specifications for verifying the functional correctness of operating system...
CrashFixer: A crash resolution agent for the Linux kernelAlex Mathai, Chenxi Huang, Suwei Ma, Jihwan Kim, Hailie Mitchell, Aleksandr Nogikh, Petros Maniatis, Franjo Ivančić, Junfeng Yang, Baishakhi Ray2025-04-29下载Code large language models (LLMs) have shown impressive capabilities on a multitude of software engineering tasks. In particular, they have demonstrated remarkable utility in the task of code repair.

cs.PF - Performance

标题作者发布日期PDF摘要
Information Retrieval in the Age of Generative AI: The RGB ModelMichele Garetto, Alessandro Cornacchia, Franco Galante, Emilio Leonardi, Alessandro Nordio, Alberto Tarable2025-04-29下载The advent of Large Language Models (LLMs) and generative AI is fundamentally transforming information retrieval and processing on the Internet, bringing both great potential and significant concerns ...
DEER: Deep Runahead for Instruction Prefetching on Modern Mobile WorkloadsParmida Vahdatniya, Julian Humecki, Henry Kao, Tony Li, Ali Sedaghati, Fang Su, Ruoyu Zhou, Alex Bi, Reza Azimi, Maziar Goudarzi2025-04-29下载Mobile workloads incur heavy frontend stalls due to increasingly large code footprints as well as long repeat cycles. Existing instruction-prefetching techniques suffer from low coverage, poor timelin...
CarbonCall: Sustainability-Aware Function Calling for Large Language Models on Edge DevicesVaratheepan Paramanayakam, Andreas Karatzas, Iraklis Anagnostopoulos, Dimitrios Stamoulis2025-04-29下载Large Language Models (LLMs) enable real-time function calling in edge AI systems but introduce significant computational overhead, leading to high power consumption and carbon emissions.

基于 VitePress 构建