Skip to content

2026-03-23

cs.AR - Architecture

标题作者发布日期PDF摘要
CPU Simulation Using Two-Phase Stratified SamplingMagnus Ekman2026-03-23下载Simulation remains a cornerstone of computer architecture research, yet full end-to-end application execution is prohibitively time-consuming.
CPU Simulation with Ranked Set Sampling and Repeated SubsamplingMagnus Ekman2026-03-23下载Computer system simulation studies routinely rely on executing a limited number of short application regions, since full end-to-end simulation is prohibitively time-consuming.
SCALE-Sim TPU: Validating and Extending SCALE-Sim for TPUsJingtian Dang, Ritik Raj, Changhai Man, Jianming Tong, Tushar Krishna2026-03-23下载Cycle-accurate simulators are widely used to study systolic accelerators, yet their accuracy and usability are often limited by weak validation against real hardware and poor integration with modern M...
Linux and High-Performance ComputingDavid A. Bader2026-03-23下载In the 1980s, high-performance computing (HPC) became another tool for research in the open (non-defense) science and engineering research communities.
Low Latency GNN Accelerator for Quantum Error CorrectionAlessio Cicero, Luigi Altamura, Moritz Lange, Mats Granath, Pedro Trancoso2026-03-23下载Quantum computers have the potential to solve certain complex problems in a much more efficient way than classical computers. Nevertheless, current quantum computer implementations are limited by high...
Convolutions Predictable Offloading to an Accelerator: Formalization and OptimizationBenjamin Husson, Mohammed Belcaïd, Thomas Carle, Claire Pagetti2026-03-23下载Convolutional neural networks (CNNs) require a large number of multiply-accumulate (MAC) operations. To meet real-time constraints, they often need to be executed on specialized accelerators composed ...
Quantifying Uncertainty in FMEDA Safety Metrics: An Error Propagation Approach for Enhanced ASIC VerificationAntonino Armato, Christian Kehl, Sebastian Fischer2026-03-23下载Accurate and reliable safety metrics are paramount for functional safety verification of ASICs in automotive systems. Traditional FMEDA (Failure Modes, Effects, and Diagnostic Analysis) metrics, such ...
IMMSched: Interruptible Multi-DNN Scheduling via Parallel Multi-Particle Optimizing Subgraph IsomorphismBoran Zhao, Hetian Liu, Zihang Yuan, Yanbin Hu, Wenzhe Zhao, Tian Xia, Pengju Ren2026-03-23下载The growing demand for multi-DNN workloads with unpredictable task arrival times has highlighted the need for interruptible scheduling on edge accelerators.
PRISM: Breaking the O(n) Memory Wall in Long-Context LLM Inference via O(1) Photonic Block SelectionHyoseok Park, Yeonsang Park2026-03-23下载Long-context LLM inference is bottlenecked not by compute but by the O(n) memory bandwidth cost of scanning the KV cache at every decode step -- a wall that no amount of arithmetic scaling can break.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Interactive and Urgent HPC: State of the ResearchAlbert Reuther, William Arndt, Johannes Blaschke, Christian Boehme, Nick Brown, Antony Chazapis, Bjoern Enders, Jens Henrik Goebbert, Robert Henschel, Julian Kunkel, Maxime Martinasso, Michael Ringenburg, Rollin Thomas2026-03-23下载When we think of how we use smartphones, e-commerce, collaboration platforms, LLMs, etc., most of our interactions with computers are interactive and often urgent.
Communication-Efficient Approximate Gradient CodingSifat Munim, Aditya Ramamoorthy2026-03-23下载Large-scale distributed learning aims at minimizing a loss function LL that depends on a training dataset with respect to a dd-length parameter vector.
Linux and High-Performance ComputingDavid A. Bader2026-03-23下载In the 1980s, high-performance computing (HPC) became another tool for research in the open (non-defense) science and engineering research communities.
A Theoretical Framework for Energy-Aware Gradient Pruning in Federated LearningEmmanouil M. Athanasakos2026-03-23下载Federated Learning (FL) is constrained by the communication and energy limitations of decentralized edge devices. While gradient sparsification via Top-K magnitude pruning effectively reduces the comm...
exaCB: Reproducible Continuous Benchmark Collections at Scale Leveraging an Incremental ApproachJayesh Badwaik, Mathis Bode, Michal Rajski, Andreas Herten2026-03-23下载The increasing heterogeneity of high-performance computing (HPC) systems and the transition to exascale architectures require systematic and reproducible performance evaluation across diverse workload...
A Density-Delay Law for Stable Event-Driven State Progression in Open Distributed SystemsBin Chen, Dechuang Huang2026-03-23下载Distributed systems in which concurrent proposals are mutually exclusive face a fundamental stability constraint under network delay. In open systems where global state progression is event-driven rat...
Reasoning Provenance for Autonomous AI Agents: Structured Behavioral Analytics Beyond State Checkpoints and Execution TracesNeelmani Vispute2026-03-23下载As AI agents transition from human-supervised copilots to autonomous platform infrastructure, the ability to analyze their reasoning behavior across populations of investigations becomes a pressing in...
Benchmarking Message Brokers for IoT Edge Computing: A Comprehensive Performance StudyTapajit Chandra Paul, Pawissanutt Lertpongrujikorn, Hai Duc Nguyen, Mohsen Amini Salehi2026-03-23下载Asynchronous messaging is a cornerstone of modern distributed systems, enabling decoupled communication for scalable and resilient applications.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Satellite-Terrestrial Spectrum Sharing in FR3 through QoS-Aware Power Control and Spatial NullingMaria Tsampazi, Paolo Testolina, Michele Polese, Tommaso Melodia2026-03-23下载Frequency Range 3 (FR3), encompassing frequencies between 7.125 and 24.25 GHz, is an emerging frequency band for 6th generation (6G) applications.
Architectural Enhancements for Efficient Sensing Data Utilization in 6G ISACMuhammad Awais Jadoon, Sebastian Robitzsch2026-03-23下载Current architecture proposals within standards development organizations such as ETSI and 3GPP enable sensing capabilities in mobile networks; however, they do not include a repository for storing se...
A Theoretical Framework for Energy-Aware Gradient Pruning in Federated LearningEmmanouil M. Athanasakos2026-03-23下载Federated Learning (FL) is constrained by the communication and energy limitations of decentralized edge devices. While gradient sparsification via Top-K magnitude pruning effectively reduces the comm...
A Density-Delay Law for Stable Event-Driven State Progression in Open Distributed SystemsBin Chen, Dechuang Huang2026-03-23下载Distributed systems in which concurrent proposals are mutually exclusive face a fundamental stability constraint under network delay. In open systems where global state progression is event-driven rat...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Tock: From Research to Securing 10 Million ComputersLeon Schuermann, Brad Campbell, Branden Ghena, Philip Levis, Amit Levy, Pat Pannuto2026-03-23下载Tock began 10 years ago as a research operating system developed by academics to help other academics build urban sensing applications. By leveraging a new language (Rust) and new hardware protection ...
GateANN: I/O-Efficient Filtered Vector Search on SSDsNakyung Lee, Soobin Cho, Jiwoong Park, Gyuyeong Kim2026-03-23下载We present GateANN, an I/O-efficient SSD-based graph ANNS system that supports filtered vector search on an unmodified graph index. Existing SSD-based systems either waste I/O by post-filtering, or re...

cs.PF - Performance

标题作者发布日期PDF摘要
SCALE-Sim TPU: Validating and Extending SCALE-Sim for TPUsJingtian Dang, Ritik Raj, Changhai Man, Jianming Tong, Tushar Krishna2026-03-23下载Cycle-accurate simulators are widely used to study systolic accelerators, yet their accuracy and usability are often limited by weak validation against real hardware and poor integration with modern M...

基于 VitePress 构建