Appearance
2026-03-02
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| CUCo: An Agentic Framework for Compute and Communication Co-design | Bodun Hu, Yoga Sri Varshan, Saurabh Agarwal, Aditya Akella | 2026-03-02 | 下载 | Custom CUDA kernel development is essential for maximizing GPU utilization in large-scale distributed LLM training and inference, yet manually writing kernels that jointly leverage both computation an... |
| Security Risks in Machining Process Monitoring: Sequence-to-Sequence Learning for Reconstruction of CNC Axis Positions | Lukas Krupp, Rickmar Stahlschmidt, Norbert Wehn | 2026-03-02 | 下载 | Accelerometer-based process monitoring is widely deployed in modern machining systems. When mounted on moving machine components, such sensors implicitly capture kinematic information related to machi... |
| TeraPool: A Physical Design Aware, 1024 RISC-V Cores Shared-L1-Memory Scaled-up Cluster Design with High Bandwidth Main Memory Link | Yichao Zhang, Marco Bertuletti, Chi Zhang, Samuel Riedel, Diyou Shen, Bowen Wang, Alessandro Vanelli-Coralli, Luca Benini | 2026-03-02 | 下载 | Shared L1-memory clusters of streamlined instruction processors (processing elements - PEs) are commonly used as building blocks in modern, massively parallel computing architectures (e.g. GP-GPUs). |
| Closing the Gap Between Float and Posit Hardware Efficiency | Aditya Anirudh Jonnalagadda, Rishi Thotli, John L. Gustafson | 2026-03-02 | 下载 | The b-posit, or bounded posit, is a variation of the posit format designed for high performance computing (HPC) and AI applications. Unlike traditional floating-point formats (floats), posits use vari... |
| Hermes: A Unified High-Performance NTT Architecture with Hybrid Dataflow | Hang Gu, Teng Wang, Qianyu Cheng, Jinao Li, Zhendong Zheng, Lei Gong, Wenqi Lou, Xi Li, Xuehai Zhou | 2026-03-02 | 下载 | Fully Homomorphic Encryption (FHE) relies heavily on the Number Theoretic Transform (NTT), making NTT a major performance bottleneck due to its intensive polynomial computations. |
| RoboGPU: Accelerating GPU Collision Detection for Robotics | Lufei Liu, Liwei Xue, Youssef Mohammed, Jocelyn Zhao, Yuan Hsi Chou, Tor M. Aamodt | 2026-03-02 | 下载 | Autonomous robots are increasingly prevalent in our society, emerging in medical care, transportation vehicles, and home assistance. These robots rely on motion planning and collision detection to ide... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| CUCo: An Agentic Framework for Compute and Communication Co-design | Bodun Hu, Yoga Sri Varshan, Saurabh Agarwal, Aditya Akella | 2026-03-02 | 下载 | Custom CUDA kernel development is essential for maximizing GPU utilization in large-scale distributed LLM training and inference, yet manually writing kernels that jointly leverage both computation an... |
| Trident: Adaptive Scheduling for Heterogeneous Multimodal Data Pipelines | Ding Pan, Zhuangzhuang Zhou, Long Qian, Binhang Yuan | 2026-03-02 | 下载 | The rapid adoption of large language models and multimodal foundation models has made multimodal data preparation pipelines critical AI infrastructure. |
| Subcubic Coin Tossing in Asynchrony without Setup | Mose Mizrahi, Roger Wattenhofer | 2026-03-02 | 下载 | We consider an asynchronous network of parties connected to each other via secure channels, up to of which are byzantine. We study common coin tossing, a task where the parties try to agree on... |
| Beyond Microservices: Testing Web-Scale RCA Methods on GPU-Driven LLM Workloads | Dominik Scheinert, Alexander Acker, Thorsten Wittkopp, Soeren Becker, Hamza Yous, Karnakar Reddy, Ibrahim Farhat, Hakim Hacid, Odej Kao | 2026-03-02 | 下载 | Large language model (LLM) services have become an integral part of search, assistance, and decision-making applications. However, unlike traditional web or microservices, the hardware and software st... |
| GoldbachGPU: An Open Source GPU-Accelerated Framework for Verification of Goldbach's Conjecture | Isaac Llorente-Saguer | 2026-03-02 | 下载 | We present GoldbachGPU, an open-source framework for large-scale computational verification of Goldbach's conjecture using commodity GPU hardware. |
| Extension of ACETONE C code generator for multi-core architectures | Yanis Aït-Aïssa, Thomas Carle, Sergei Chichin, Benjamin Lesage, Claire Pagetti | 2026-03-02 | 下载 | As the industry's interest in machine learning has grown in recent years, some solutions have emerged to safely embed them in safety-critical systems, such as the C code generator ACETONE. |
| CA-AFP: Cluster-Aware Adaptive Federated Pruning | Om Govind Jha, Harsh Shukla, Haroon R. Lone | 2026-03-02 | 下载 | Federated Learning (FL) faces major challenges in real-world deployments due to statistical heterogeneity across clients and system heterogeneity arising from resource-constrained devices. |
| HeRo: Adaptive Orchestration of Agentic RAG on Heterogeneous Mobile SoC | Maoliang Li, Jiayu Chen, Zihao Zheng, Ziqian Li, Xinhao Sun, Guojie Luo, Chenchen Liu, Xiang Chen | 2026-03-02 | 下载 | With the increasing computational capability of mobile devices, deploying agentic retrieval-augmented generation (RAG) locally on heterogeneous System-on-Chips (SoCs) has become a promising way to enh... |
| TeraPool: A Physical Design Aware, 1024 RISC-V Cores Shared-L1-Memory Scaled-up Cluster Design with High Bandwidth Main Memory Link | Yichao Zhang, Marco Bertuletti, Chi Zhang, Samuel Riedel, Diyou Shen, Bowen Wang, Alessandro Vanelli-Coralli, Luca Benini | 2026-03-02 | 下载 | Shared L1-memory clusters of streamlined instruction processors (processing elements - PEs) are commonly used as building blocks in modern, massively parallel computing architectures (e.g. GP-GPUs). |
| A Cascaded Graph Neural Network for Joint Root Cause Localization and Analysis in Edge Computing Environments | Duneesha Fernando, Maria A. Rodriguez, Rajkumar Buyya | 2026-03-02 | 下载 | Edge computing environments host increasingly complex microservice-based IoT applications that are prone to performance anomalies propagating across dependent services. |
| The Semantic Arrow of Time, Part I: From Eddington to Ethernet | Paul Borrill | 2026-03-02 | 下载 | This is the first of five papers comprising The Semantic Arrow of Time. The argument begins with a claim: computing's arrow of time is semantic, not thermodynamic. |
| Message Passing Without Temporal Direction: Constraint Semantics and the FITO Category Mistake | Paul Borrill | 2026-03-02 | 下载 | Message passing is widely assumed to be a fundamental primitive of distributed systems. This paper argues that conventional message systems embed a category mistake: they misinterpret logical dependen... |
| Quasar: Quantized Self-Speculative Acceleration for Rapid Inference via Memory-Efficient Verification | Guang Huang, Zeyi Wen | 2026-03-02 | 下载 | Speculative Decoding (SD) has emerged as a premier technique for accelerating Large Language Model (LLM) inference by decoupling token generation into rapid drafting and parallel verification. |
| Unix Tools and the FITO Category Mistake: Crash Consistency and the Protocol Nature of Persistence | Paul Borrill | 2026-03-02 | 下载 | Unix tools such as ls, cp, mv, and rename expose a filesystem abstraction that appears to present a single, authoritative state evolving through atomic transitions. This abstraction is false. |
| Fed-GAME: Personalized Federated Learning with Graph Attention Mixture-of-Experts For Time-Series Forecasting | Yi Li, Han Liu, Mingfeng Fan, Guo Chen, Chaojie Li, Biplab Sikdar | 2026-03-02 | 下载 | Federated learning (FL) on graphs shows promise for distributed time-series forecasting. Yet, existing methods rely on static topologies and struggle with client heterogeneity. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Characterizing Information Accuracy in Timeliness-Based Gossip Networks | Emirhan Tekez, Melih Bastopcu, Sinan Gezici | 2026-03-02 | 下载 | We investigate information accuracy in timeliness-based gossip networks where the source evolves according to a continuous-time Markov chain (CTMC) with states and disseminates status updates to a... |
| Adaptive Intent-Aware PoW Mechanism in SDN for Multi-Domain SYN Flood Mitigation | Wenyang Jia | 2026-03-02 | 下载 | The stability of Internet services is persistently challenged by the escalating scale of volumetric TCP SYN floods, as conventional defenses like SYN Cookies fail by exacerbating bandwidth depletion u... |
| How Small Can 6G Reason? Scaling Tiny Language Models for AI-Native Networks | Mohamed Amine Ferrag, Abderrahmane Lakas, Merouane Debbah | 2026-03-02 | 下载 | Emerging 6G visions, reflected in ongoing standardization efforts within 3GPP, IETF, ETSI, ITU-T, and the O-RAN Alliance, increasingly characterize networks as AI-native systems in which high-level se... |
| Demonstration of a 1.2 Gbps Always-on Fully-Connected Mesh Network with RFSoC SDRs | Hatef Nouri, George Sklivanitis, Dimitris A. Pados, Elizabeth Serena Bentley | 2026-03-02 | 下载 | We design and implement on Radio Frequency System-on-Chip (RFSoC) software-defined radios (SDRs) a complete-graph network of four unmanned aerial vehicles and demonstrate real-time 4K video streaming ... |
| Resilient Chaotic Cross-Layer Routing for Smart Grid IoT Networks | Dhrumil Bhatt, Anakha Kurup, R. C. Mala | 2026-03-02 | 下载 | This paper presents the Distributed Adaptive Multi-Radio Cross-Layer Routing (DAMCR) protocol, designed to enhance reliability, adaptability, and energy efficiency in smart grid and industrial Interne... |
| Federated Agentic AI for Wireless Networks: Fundamentals, Approaches, and Applications | Lingyi Cai, Yu Zhang, Ruichen Zhang, Yinqiu Liu, Tao Jiang, Dusit Niyato, Wei Ni, Abbas Jamalipour | 2026-03-02 | 下载 | Agentic artificial intelligence (AI) presents a promising pathway toward realizing autonomous and self-improving wireless network services. However, resource-constrained, widely distributed, and data-... |
| Predictive Importance Sampling Based Coverage Verification for Multi-UAV Trajectory Planning | Snehashish Ghosh, Sasthi C. Ghosh | 2026-03-02 | 下载 | Unmanned aerial vehicle (UAV) networks are emerging as a promising solution for ultra-reliable low-latency communication (URLLC) in next-generation wireless systems. |
| Contract-based Agentic Intent Framework for Network Slicing in O-RAN | Fransiscus Asisi Bimo, Chun-Kai Lai, Zhi-Yuan Yang, Ray-Guang Cheng | 2026-03-02 | 下载 | Intent-based networking aims to simplify network operation by translating operator intents into a collection of policies, configurations, and control actions. |
| Large Language Models as Bidding Agents in Repeated HetNet Auction | Ismail Lotfi, Ali Ghrayeb, Samson Lasaulce, Merouane Debbah | 2026-03-02 | 下载 | This paper investigates the integration of large language models (LLMs) as reasoning agents in repeated spectrum auctions within heterogeneous networks (HetNets). |
| Regularized Diffusion-based Contract Model for Covert Semantic Entropy Control in LAENets | Yansheng Liu, Jinbo Wen, Kun Zhu, Yang Zhang, Jiawen Kang | 2026-03-02 | 下载 | Low-Altitude Economy Networks (LAENets) have emerged as a critical communication paradigm for operation-critical and regulation-aware applications, where Unmanned Aerial Vehicles (UAVs) transmit task-... |
| Energy Efficient Traffic Scheduling For Optical LEO Satellite Downlinks | Ethan Fettes, Pablo G. Madoery, Halim Yanikomeroglu, Gunes Karabulut Kurt, Abhishek Naik, Stéphane Martel | 2026-03-02 | 下载 | In recent years, the number of satellites in orbit has increased rapidly, with megaconstellations like Starlink providing near-global, delay-sensitive communication services. |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Machine Learning (ML) library in Linux kernel | Viacheslav Dubeyko | 2026-03-02 | 下载 | Linux kernel is a huge code base with enormous number of subsystems and possible configuration options that results in unmanageable complexity of elaborating an efficient configuration. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| GoldbachGPU: An Open Source GPU-Accelerated Framework for Verification of Goldbach's Conjecture | Isaac Llorente-Saguer | 2026-03-02 | 下载 | We present GoldbachGPU, an open-source framework for large-scale computational verification of Goldbach's conjecture using commodity GPU hardware. |
| Fast Entropy Decoding for Sparse MVM on GPUs | Emil Schätzle, Tommaso Pegolotti, Markus Püschel | 2026-03-02 | 下载 | We present a novel, practical approach to speed up sparse matrix-vector multiplication (SpMVM) on GPUs. The novel key idea is to apply lossless entropy coding to further compress the sparse matrix whe... |