2026-03-02

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
CUCo: An Agentic Framework for Compute and Communication Co-design	Bodun Hu, Yoga Sri Varshan, Saurabh Agarwal, Aditya Akella	2026-03-02	下载	Custom CUDA kernel development is essential for maximizing GPU utilization in large-scale distributed LLM training and inference, yet manually writing kernels that jointly leverage both computation an...
Security Risks in Machining Process Monitoring: Sequence-to-Sequence Learning for Reconstruction of CNC Axis Positions	Lukas Krupp, Rickmar Stahlschmidt, Norbert Wehn	2026-03-02	下载	Accelerometer-based process monitoring is widely deployed in modern machining systems. When mounted on moving machine components, such sensors implicitly capture kinematic information related to machi...
TeraPool: A Physical Design Aware, 1024 RISC-V Cores Shared-L1-Memory Scaled-up Cluster Design with High Bandwidth Main Memory Link	Yichao Zhang, Marco Bertuletti, Chi Zhang, Samuel Riedel, Diyou Shen, Bowen Wang, Alessandro Vanelli-Coralli, Luca Benini	2026-03-02	下载	Shared L1-memory clusters of streamlined instruction processors (processing elements - PEs) are commonly used as building blocks in modern, massively parallel computing architectures (e.g. GP-GPUs).
Closing the Gap Between Float and Posit Hardware Efficiency	Aditya Anirudh Jonnalagadda, Rishi Thotli, John L. Gustafson	2026-03-02	下载	The b-posit, or bounded posit, is a variation of the posit format designed for high performance computing (HPC) and AI applications. Unlike traditional floating-point formats (floats), posits use vari...
Hermes: A Unified High-Performance NTT Architecture with Hybrid Dataflow	Hang Gu, Teng Wang, Qianyu Cheng, Jinao Li, Zhendong Zheng, Lei Gong, Wenqi Lou, Xi Li, Xuehai Zhou	2026-03-02	下载	Fully Homomorphic Encryption (FHE) relies heavily on the Number Theoretic Transform (NTT), making NTT a major performance bottleneck due to its intensive polynomial computations.
RoboGPU: Accelerating GPU Collision Detection for Robotics	Lufei Liu, Liwei Xue, Youssef Mohammed, Jocelyn Zhao, Yuan Hsi Chou, Tor M. Aamodt	2026-03-02	下载	Autonomous robots are increasingly prevalent in our society, emerging in medical care, transportation vehicles, and home assistance. These robots rely on motion planning and collision detection to ide...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
CUCo: An Agentic Framework for Compute and Communication Co-design	Bodun Hu, Yoga Sri Varshan, Saurabh Agarwal, Aditya Akella	2026-03-02	下载	Custom CUDA kernel development is essential for maximizing GPU utilization in large-scale distributed LLM training and inference, yet manually writing kernels that jointly leverage both computation an...
Trident: Adaptive Scheduling for Heterogeneous Multimodal Data Pipelines	Ding Pan, Zhuangzhuang Zhou, Long Qian, Binhang Yuan	2026-03-02	下载	The rapid adoption of large language models and multimodal foundation models has made multimodal data preparation pipelines critical AI infrastructure.
Subcubic Coin Tossing in Asynchrony without Setup	Mose Mizrahi, Roger Wattenhofer	2026-03-02	下载	We consider an asynchronous network of $n$ parties connected to each other via secure channels, up to $t$ of which are byzantine. We study common coin tossing, a task where the parties try to agree on...
Beyond Microservices: Testing Web-Scale RCA Methods on GPU-Driven LLM Workloads	Dominik Scheinert, Alexander Acker, Thorsten Wittkopp, Soeren Becker, Hamza Yous, Karnakar Reddy, Ibrahim Farhat, Hakim Hacid, Odej Kao	2026-03-02	下载	Large language model (LLM) services have become an integral part of search, assistance, and decision-making applications. However, unlike traditional web or microservices, the hardware and software st...
GoldbachGPU: An Open Source GPU-Accelerated Framework for Verification of Goldbach's Conjecture	Isaac Llorente-Saguer	2026-03-02	下载	We present GoldbachGPU, an open-source framework for large-scale computational verification of Goldbach's conjecture using commodity GPU hardware.
Extension of ACETONE C code generator for multi-core architectures	Yanis Aït-Aïssa, Thomas Carle, Sergei Chichin, Benjamin Lesage, Claire Pagetti	2026-03-02	下载	As the industry's interest in machine learning has grown in recent years, some solutions have emerged to safely embed them in safety-critical systems, such as the C code generator ACETONE.
CA-AFP: Cluster-Aware Adaptive Federated Pruning	Om Govind Jha, Harsh Shukla, Haroon R. Lone	2026-03-02	下载	Federated Learning (FL) faces major challenges in real-world deployments due to statistical heterogeneity across clients and system heterogeneity arising from resource-constrained devices.
HeRo: Adaptive Orchestration of Agentic RAG on Heterogeneous Mobile SoC	Maoliang Li, Jiayu Chen, Zihao Zheng, Ziqian Li, Xinhao Sun, Guojie Luo, Chenchen Liu, Xiang Chen	2026-03-02	下载	With the increasing computational capability of mobile devices, deploying agentic retrieval-augmented generation (RAG) locally on heterogeneous System-on-Chips (SoCs) has become a promising way to enh...
TeraPool: A Physical Design Aware, 1024 RISC-V Cores Shared-L1-Memory Scaled-up Cluster Design with High Bandwidth Main Memory Link	Yichao Zhang, Marco Bertuletti, Chi Zhang, Samuel Riedel, Diyou Shen, Bowen Wang, Alessandro Vanelli-Coralli, Luca Benini	2026-03-02	下载	Shared L1-memory clusters of streamlined instruction processors (processing elements - PEs) are commonly used as building blocks in modern, massively parallel computing architectures (e.g. GP-GPUs).
A Cascaded Graph Neural Network for Joint Root Cause Localization and Analysis in Edge Computing Environments	Duneesha Fernando, Maria A. Rodriguez, Rajkumar Buyya	2026-03-02	下载	Edge computing environments host increasingly complex microservice-based IoT applications that are prone to performance anomalies propagating across dependent services.
The Semantic Arrow of Time, Part I: From Eddington to Ethernet	Paul Borrill	2026-03-02	下载	This is the first of five papers comprising The Semantic Arrow of Time. The argument begins with a claim: computing's arrow of time is semantic, not thermodynamic.
Message Passing Without Temporal Direction: Constraint Semantics and the FITO Category Mistake	Paul Borrill	2026-03-02	下载	Message passing is widely assumed to be a fundamental primitive of distributed systems. This paper argues that conventional message systems embed a category mistake: they misinterpret logical dependen...
Quasar: Quantized Self-Speculative Acceleration for Rapid Inference via Memory-Efficient Verification	Guang Huang, Zeyi Wen	2026-03-02	下载	Speculative Decoding (SD) has emerged as a premier technique for accelerating Large Language Model (LLM) inference by decoupling token generation into rapid drafting and parallel verification.
Unix Tools and the FITO Category Mistake: Crash Consistency and the Protocol Nature of Persistence	Paul Borrill	2026-03-02	下载	Unix tools such as ls, cp, mv, and rename expose a filesystem abstraction that appears to present a single, authoritative state evolving through atomic transitions. This abstraction is false.
Fed-GAME: Personalized Federated Learning with Graph Attention Mixture-of-Experts For Time-Series Forecasting	Yi Li, Han Liu, Mingfeng Fan, Guo Chen, Chaojie Li, Biplab Sikdar	2026-03-02	下载	Federated learning (FL) on graphs shows promise for distributed time-series forecasting. Yet, existing methods rely on static topologies and struggle with client heterogeneity.

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Characterizing Information Accuracy in Timeliness-Based Gossip Networks	Emirhan Tekez, Melih Bastopcu, Sinan Gezici	2026-03-02	下载	We investigate information accuracy in timeliness-based gossip networks where the source evolves according to a continuous-time Markov chain (CTMC) with $M$ states and disseminates status updates to a...
Adaptive Intent-Aware PoW Mechanism in SDN for Multi-Domain SYN Flood Mitigation	Wenyang Jia	2026-03-02	下载	The stability of Internet services is persistently challenged by the escalating scale of volumetric TCP SYN floods, as conventional defenses like SYN Cookies fail by exacerbating bandwidth depletion u...
How Small Can 6G Reason? Scaling Tiny Language Models for AI-Native Networks	Mohamed Amine Ferrag, Abderrahmane Lakas, Merouane Debbah	2026-03-02	下载	Emerging 6G visions, reflected in ongoing standardization efforts within 3GPP, IETF, ETSI, ITU-T, and the O-RAN Alliance, increasingly characterize networks as AI-native systems in which high-level se...
Demonstration of a 1.2 Gbps Always-on Fully-Connected Mesh Network with RFSoC SDRs	Hatef Nouri, George Sklivanitis, Dimitris A. Pados, Elizabeth Serena Bentley	2026-03-02	下载	We design and implement on Radio Frequency System-on-Chip (RFSoC) software-defined radios (SDRs) a complete-graph network of four unmanned aerial vehicles and demonstrate real-time 4K video streaming ...
Resilient Chaotic Cross-Layer Routing for Smart Grid IoT Networks	Dhrumil Bhatt, Anakha Kurup, R. C. Mala	2026-03-02	下载	This paper presents the Distributed Adaptive Multi-Radio Cross-Layer Routing (DAMCR) protocol, designed to enhance reliability, adaptability, and energy efficiency in smart grid and industrial Interne...
Federated Agentic AI for Wireless Networks: Fundamentals, Approaches, and Applications	Lingyi Cai, Yu Zhang, Ruichen Zhang, Yinqiu Liu, Tao Jiang, Dusit Niyato, Wei Ni, Abbas Jamalipour	2026-03-02	下载	Agentic artificial intelligence (AI) presents a promising pathway toward realizing autonomous and self-improving wireless network services. However, resource-constrained, widely distributed, and data-...
Predictive Importance Sampling Based Coverage Verification for Multi-UAV Trajectory Planning	Snehashish Ghosh, Sasthi C. Ghosh	2026-03-02	下载	Unmanned aerial vehicle (UAV) networks are emerging as a promising solution for ultra-reliable low-latency communication (URLLC) in next-generation wireless systems.
Contract-based Agentic Intent Framework for Network Slicing in O-RAN	Fransiscus Asisi Bimo, Chun-Kai Lai, Zhi-Yuan Yang, Ray-Guang Cheng	2026-03-02	下载	Intent-based networking aims to simplify network operation by translating operator intents into a collection of policies, configurations, and control actions.
Large Language Models as Bidding Agents in Repeated HetNet Auction	Ismail Lotfi, Ali Ghrayeb, Samson Lasaulce, Merouane Debbah	2026-03-02	下载	This paper investigates the integration of large language models (LLMs) as reasoning agents in repeated spectrum auctions within heterogeneous networks (HetNets).
Regularized Diffusion-based Contract Model for Covert Semantic Entropy Control in LAENets	Yansheng Liu, Jinbo Wen, Kun Zhu, Yang Zhang, Jiawen Kang	2026-03-02	下载	Low-Altitude Economy Networks (LAENets) have emerged as a critical communication paradigm for operation-critical and regulation-aware applications, where Unmanned Aerial Vehicles (UAVs) transmit task-...
Energy Efficient Traffic Scheduling For Optical LEO Satellite Downlinks	Ethan Fettes, Pablo G. Madoery, Halim Yanikomeroglu, Gunes Karabulut Kurt, Abhishek Naik, Stéphane Martel	2026-03-02	下载	In recent years, the number of satellites in orbit has increased rapidly, with megaconstellations like Starlink providing near-global, delay-sensitive communication services.

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Machine Learning (ML) library in Linux kernel	Viacheslav Dubeyko	2026-03-02	下载	Linux kernel is a huge code base with enormous number of subsystems and possible configuration options that results in unmanageable complexity of elaborating an efficient configuration.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
GoldbachGPU: An Open Source GPU-Accelerated Framework for Verification of Goldbach's Conjecture	Isaac Llorente-Saguer	2026-03-02	下载	We present GoldbachGPU, an open-source framework for large-scale computational verification of Goldbach's conjecture using commodity GPU hardware.
Fast Entropy Decoding for Sparse MVM on GPUs	Emil Schätzle, Tommaso Pegolotti, Markus Püschel	2026-03-02	下载	We present a novel, practical approach to speed up sparse matrix-vector multiplication (SpMVM) on GPUs. The novel key idea is to apply lossless entropy coding to further compress the sparse matrix whe...