Skip to content

2026-03-02

cs.AR - Architecture

标题作者发布日期PDF摘要
CUCo: An Agentic Framework for Compute and Communication Co-designBodun Hu, Yoga Sri Varshan, Saurabh Agarwal, Aditya Akella2026-03-02下载Custom CUDA kernel development is essential for maximizing GPU utilization in large-scale distributed LLM training and inference, yet manually writing kernels that jointly leverage both computation an...
Security Risks in Machining Process Monitoring: Sequence-to-Sequence Learning for Reconstruction of CNC Axis PositionsLukas Krupp, Rickmar Stahlschmidt, Norbert Wehn2026-03-02下载Accelerometer-based process monitoring is widely deployed in modern machining systems. When mounted on moving machine components, such sensors implicitly capture kinematic information related to machi...
TeraPool: A Physical Design Aware, 1024 RISC-V Cores Shared-L1-Memory Scaled-up Cluster Design with High Bandwidth Main Memory LinkYichao Zhang, Marco Bertuletti, Chi Zhang, Samuel Riedel, Diyou Shen, Bowen Wang, Alessandro Vanelli-Coralli, Luca Benini2026-03-02下载Shared L1-memory clusters of streamlined instruction processors (processing elements - PEs) are commonly used as building blocks in modern, massively parallel computing architectures (e.g. GP-GPUs).
Closing the Gap Between Float and Posit Hardware EfficiencyAditya Anirudh Jonnalagadda, Rishi Thotli, John L. Gustafson2026-03-02下载The b-posit, or bounded posit, is a variation of the posit format designed for high performance computing (HPC) and AI applications. Unlike traditional floating-point formats (floats), posits use vari...
Hermes: A Unified High-Performance NTT Architecture with Hybrid DataflowHang Gu, Teng Wang, Qianyu Cheng, Jinao Li, Zhendong Zheng, Lei Gong, Wenqi Lou, Xi Li, Xuehai Zhou2026-03-02下载Fully Homomorphic Encryption (FHE) relies heavily on the Number Theoretic Transform (NTT), making NTT a major performance bottleneck due to its intensive polynomial computations.
RoboGPU: Accelerating GPU Collision Detection for RoboticsLufei Liu, Liwei Xue, Youssef Mohammed, Jocelyn Zhao, Yuan Hsi Chou, Tor M. Aamodt2026-03-02下载Autonomous robots are increasingly prevalent in our society, emerging in medical care, transportation vehicles, and home assistance. These robots rely on motion planning and collision detection to ide...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
CUCo: An Agentic Framework for Compute and Communication Co-designBodun Hu, Yoga Sri Varshan, Saurabh Agarwal, Aditya Akella2026-03-02下载Custom CUDA kernel development is essential for maximizing GPU utilization in large-scale distributed LLM training and inference, yet manually writing kernels that jointly leverage both computation an...
Trident: Adaptive Scheduling for Heterogeneous Multimodal Data PipelinesDing Pan, Zhuangzhuang Zhou, Long Qian, Binhang Yuan2026-03-02下载The rapid adoption of large language models and multimodal foundation models has made multimodal data preparation pipelines critical AI infrastructure.
Subcubic Coin Tossing in Asynchrony without SetupMose Mizrahi, Roger Wattenhofer2026-03-02下载We consider an asynchronous network of nn parties connected to each other via secure channels, up to tt of which are byzantine. We study common coin tossing, a task where the parties try to agree on...
Beyond Microservices: Testing Web-Scale RCA Methods on GPU-Driven LLM WorkloadsDominik Scheinert, Alexander Acker, Thorsten Wittkopp, Soeren Becker, Hamza Yous, Karnakar Reddy, Ibrahim Farhat, Hakim Hacid, Odej Kao2026-03-02下载Large language model (LLM) services have become an integral part of search, assistance, and decision-making applications. However, unlike traditional web or microservices, the hardware and software st...
GoldbachGPU: An Open Source GPU-Accelerated Framework for Verification of Goldbach's ConjectureIsaac Llorente-Saguer2026-03-02下载We present GoldbachGPU, an open-source framework for large-scale computational verification of Goldbach's conjecture using commodity GPU hardware.
Extension of ACETONE C code generator for multi-core architecturesYanis Aït-Aïssa, Thomas Carle, Sergei Chichin, Benjamin Lesage, Claire Pagetti2026-03-02下载As the industry's interest in machine learning has grown in recent years, some solutions have emerged to safely embed them in safety-critical systems, such as the C code generator ACETONE.
CA-AFP: Cluster-Aware Adaptive Federated PruningOm Govind Jha, Harsh Shukla, Haroon R. Lone2026-03-02下载Federated Learning (FL) faces major challenges in real-world deployments due to statistical heterogeneity across clients and system heterogeneity arising from resource-constrained devices.
HeRo: Adaptive Orchestration of Agentic RAG on Heterogeneous Mobile SoCMaoliang Li, Jiayu Chen, Zihao Zheng, Ziqian Li, Xinhao Sun, Guojie Luo, Chenchen Liu, Xiang Chen2026-03-02下载With the increasing computational capability of mobile devices, deploying agentic retrieval-augmented generation (RAG) locally on heterogeneous System-on-Chips (SoCs) has become a promising way to enh...
TeraPool: A Physical Design Aware, 1024 RISC-V Cores Shared-L1-Memory Scaled-up Cluster Design with High Bandwidth Main Memory LinkYichao Zhang, Marco Bertuletti, Chi Zhang, Samuel Riedel, Diyou Shen, Bowen Wang, Alessandro Vanelli-Coralli, Luca Benini2026-03-02下载Shared L1-memory clusters of streamlined instruction processors (processing elements - PEs) are commonly used as building blocks in modern, massively parallel computing architectures (e.g. GP-GPUs).
A Cascaded Graph Neural Network for Joint Root Cause Localization and Analysis in Edge Computing EnvironmentsDuneesha Fernando, Maria A. Rodriguez, Rajkumar Buyya2026-03-02下载Edge computing environments host increasingly complex microservice-based IoT applications that are prone to performance anomalies propagating across dependent services.
The Semantic Arrow of Time, Part I: From Eddington to EthernetPaul Borrill2026-03-02下载This is the first of five papers comprising The Semantic Arrow of Time. The argument begins with a claim: computing's arrow of time is semantic, not thermodynamic.
Message Passing Without Temporal Direction: Constraint Semantics and the FITO Category MistakePaul Borrill2026-03-02下载Message passing is widely assumed to be a fundamental primitive of distributed systems. This paper argues that conventional message systems embed a category mistake: they misinterpret logical dependen...
Quasar: Quantized Self-Speculative Acceleration for Rapid Inference via Memory-Efficient VerificationGuang Huang, Zeyi Wen2026-03-02下载Speculative Decoding (SD) has emerged as a premier technique for accelerating Large Language Model (LLM) inference by decoupling token generation into rapid drafting and parallel verification.
Unix Tools and the FITO Category Mistake: Crash Consistency and the Protocol Nature of PersistencePaul Borrill2026-03-02下载Unix tools such as ls, cp, mv, and rename expose a filesystem abstraction that appears to present a single, authoritative state evolving through atomic transitions. This abstraction is false.
Fed-GAME: Personalized Federated Learning with Graph Attention Mixture-of-Experts For Time-Series ForecastingYi Li, Han Liu, Mingfeng Fan, Guo Chen, Chaojie Li, Biplab Sikdar2026-03-02下载Federated learning (FL) on graphs shows promise for distributed time-series forecasting. Yet, existing methods rely on static topologies and struggle with client heterogeneity.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Characterizing Information Accuracy in Timeliness-Based Gossip NetworksEmirhan Tekez, Melih Bastopcu, Sinan Gezici2026-03-02下载We investigate information accuracy in timeliness-based gossip networks where the source evolves according to a continuous-time Markov chain (CTMC) with MM states and disseminates status updates to a...
Adaptive Intent-Aware PoW Mechanism in SDN for Multi-Domain SYN Flood MitigationWenyang Jia2026-03-02下载The stability of Internet services is persistently challenged by the escalating scale of volumetric TCP SYN floods, as conventional defenses like SYN Cookies fail by exacerbating bandwidth depletion u...
How Small Can 6G Reason? Scaling Tiny Language Models for AI-Native NetworksMohamed Amine Ferrag, Abderrahmane Lakas, Merouane Debbah2026-03-02下载Emerging 6G visions, reflected in ongoing standardization efforts within 3GPP, IETF, ETSI, ITU-T, and the O-RAN Alliance, increasingly characterize networks as AI-native systems in which high-level se...
Demonstration of a 1.2 Gbps Always-on Fully-Connected Mesh Network with RFSoC SDRsHatef Nouri, George Sklivanitis, Dimitris A. Pados, Elizabeth Serena Bentley2026-03-02下载We design and implement on Radio Frequency System-on-Chip (RFSoC) software-defined radios (SDRs) a complete-graph network of four unmanned aerial vehicles and demonstrate real-time 4K video streaming ...
Resilient Chaotic Cross-Layer Routing for Smart Grid IoT NetworksDhrumil Bhatt, Anakha Kurup, R. C. Mala2026-03-02下载This paper presents the Distributed Adaptive Multi-Radio Cross-Layer Routing (DAMCR) protocol, designed to enhance reliability, adaptability, and energy efficiency in smart grid and industrial Interne...
Federated Agentic AI for Wireless Networks: Fundamentals, Approaches, and ApplicationsLingyi Cai, Yu Zhang, Ruichen Zhang, Yinqiu Liu, Tao Jiang, Dusit Niyato, Wei Ni, Abbas Jamalipour2026-03-02下载Agentic artificial intelligence (AI) presents a promising pathway toward realizing autonomous and self-improving wireless network services. However, resource-constrained, widely distributed, and data-...
Predictive Importance Sampling Based Coverage Verification for Multi-UAV Trajectory PlanningSnehashish Ghosh, Sasthi C. Ghosh2026-03-02下载Unmanned aerial vehicle (UAV) networks are emerging as a promising solution for ultra-reliable low-latency communication (URLLC) in next-generation wireless systems.
Contract-based Agentic Intent Framework for Network Slicing in O-RANFransiscus Asisi Bimo, Chun-Kai Lai, Zhi-Yuan Yang, Ray-Guang Cheng2026-03-02下载Intent-based networking aims to simplify network operation by translating operator intents into a collection of policies, configurations, and control actions.
Large Language Models as Bidding Agents in Repeated HetNet AuctionIsmail Lotfi, Ali Ghrayeb, Samson Lasaulce, Merouane Debbah2026-03-02下载This paper investigates the integration of large language models (LLMs) as reasoning agents in repeated spectrum auctions within heterogeneous networks (HetNets).
Regularized Diffusion-based Contract Model for Covert Semantic Entropy Control in LAENetsYansheng Liu, Jinbo Wen, Kun Zhu, Yang Zhang, Jiawen Kang2026-03-02下载Low-Altitude Economy Networks (LAENets) have emerged as a critical communication paradigm for operation-critical and regulation-aware applications, where Unmanned Aerial Vehicles (UAVs) transmit task-...
Energy Efficient Traffic Scheduling For Optical LEO Satellite DownlinksEthan Fettes, Pablo G. Madoery, Halim Yanikomeroglu, Gunes Karabulut Kurt, Abhishek Naik, Stéphane Martel2026-03-02下载In recent years, the number of satellites in orbit has increased rapidly, with megaconstellations like Starlink providing near-global, delay-sensitive communication services.

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Machine Learning (ML) library in Linux kernelViacheslav Dubeyko2026-03-02下载Linux kernel is a huge code base with enormous number of subsystems and possible configuration options that results in unmanageable complexity of elaborating an efficient configuration.

cs.PF - Performance

标题作者发布日期PDF摘要
GoldbachGPU: An Open Source GPU-Accelerated Framework for Verification of Goldbach's ConjectureIsaac Llorente-Saguer2026-03-02下载We present GoldbachGPU, an open-source framework for large-scale computational verification of Goldbach's conjecture using commodity GPU hardware.
Fast Entropy Decoding for Sparse MVM on GPUsEmil Schätzle, Tommaso Pegolotti, Markus Püschel2026-03-02下载We present a novel, practical approach to speed up sparse matrix-vector multiplication (SpMVM) on GPUs. The novel key idea is to apply lossless entropy coding to further compress the sparse matrix whe...

基于 VitePress 构建