2026-03-24

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Causal AI For AMS Circuit Design: Interpretable Parameter Effects Analysis	Mohyeu Hussain, David Koblah, Reiner Dizon-Paradis, Domenic Forte	2026-03-24	下载	Analog-mixed-signal (AMS) circuits are highly non-linear and operate on continuous real-world signals, making them far more difficult to model with data-driven AI than digital blocks.
Energy Efficient Software Hardware CoDesign for Machine Learning: From TinyML to Large Language Models	Mohammad Saleh Vahdatpour, Yanqing Zhang	2026-03-24	下载	The rapid deployment of machine learning across platforms from milliwatt-class TinyML devices to large language models has made energy efficiency a primary constraint for sustainable AI.
On the Vulnerability of FHE Computation to Silent Data Corruption	Jianan Mu, Ge Yu, Zhaoxuan Kan, Song Bian, Liang Kong, Zizhen Liu, Cheng Liu, Jing Ye, Huawei Li	2026-03-24	下载	Fully Homomorphic Encryption (FHE) is rapidly emerging as a promising foundation for privacy-preserving cloud services, enabling computation directly on encrypted data.
TRINE: A Token-Aware, Runtime-Adaptive FPGA Inference Engine for Multimodal AI	Hyunwoo Oh, Hanning Chen, Sanggeon Yun, Yang Ni, Suyeon Jang, Behnam Khaleghi, Fei Wen, Mohsen Imani	2026-03-24	下载	Multimodal stacks that mix ViTs, CNNs, GNNs, and transformer NLP strain embedded platforms because their compute/memory patterns diverge and hard real-time targets leave little slack.
TorR: Towards Brain-Inspired Task-Oriented Reasoning via Cache-Oriented Algorithm-Architecture Co-design	Hyunwoo Oh, SungHeon Jeong, Suyeon Jang, Hanning Chen, Sanggeon Yun, Tamoghno Das, Mohsen Imani	2026-03-24	下载	Task-oriented object detection (TOOD) atop CLIP offers open-vocabulary, prompt-driven semantics, yet dense per-window computation and heavy memory traffic hinder real-time, power-limited edge deployme...
Characterizing CPU-Induced Slowdowns in Multi-GPU LLM Inference	Euijun Chung, Yuxiao Jia, Aaron Jezghani, Hyesoon Kim	2026-03-24	下载	Large-scale machine learning workloads increasingly rely on multi-GPU systems, yet their performance is often limited by an overlooked component: the CPU.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
AetherWeave: Sybil-Resistant Robust Peer Discovery with Stake	Kaya Alpturer, Constantine Doumanidis, Aviv Zohar	2026-03-24	下载	Peer-discovery protocols within P2P networks are often vulnerable: because creating network identities is essentially free, adversaries can eclipse honest nodes or partition the overlay.
n-VM: A Multi-VM Layer-1 Architecture with Shared Identity and Token State	Jian Sheng Wang	2026-03-24	下载	Multi-chain ecosystems suffer from fragmented identity, siloed liquidity, and bridge-dependent token transfers. We present n-VM, a Layer-1 architecture that hosts n heterogeneous virtual machines as c...
LLM Inference at the Edge: Mobile, NPU, and GPU Performance Efficiency Trade-offs Under Sustained Load	Pranay Tummalapalli, Sahil Arayakandy, Ritam Pal, Kautuk Kundan	2026-03-24	下载	Deploying large language models on-device for always-on personal agents demands sustained inference from hardware tightly constrained in power, thermal envelope, and memory. We benchmark Qwen 2.5 1.
SNARE: A TRAP for Rational Players to Solve Byzantine Consensus in the 5f+1 Model	Alejandro Ranchal-Pedrosa, Benjamin Marsh	2026-03-24	下载	The TRAP protocol solves rational agreement by combining accountable consensus with a one-shot BFTCR finalization phase. We present SNARE (Scalable Nash Agreement via Reward and Exclusion), the adapta...
StepCache: Step-Level Reuse with Lightweight Verification and Selective Patching for LLM Serving	Azam Nouri	2026-03-24	下载	We address LLM serving workloads where repeated requests share a common solution structure but differ in localized constraints, such as output schema, variable names, or numeric constants.
Communication-Aware Diffusion Load Balancing for Persistently Interacting Objects	Maya Taylor, Kavitha Chandrasekar, Laxmikant V. Kale	2026-03-24	下载	Parallel applications with irregular and time-varying workloads often suffer from load imbalance. Dynamic load balancing techniques address this challenge by redistributing work during execution.
Rewriting TTS Inference Economics: Lightning V2 on Tenstorrent Achieves 4x Lower Cost Than NVIDIA L40S	Ranjith M. S., Akshat Mandloi, Sudarshan Kamath	2026-03-24	下载	Text-to-Speech (TTS) models are significantly more numerically fragile than Large Language Models (LLMs) due to their continuous waveform generation and perceptual sensitivity to small numerical pertu...
PCR: A Prefetch-Enhanced Cache Reuse System for Low-Latency RAG Serving	Wenfeng Wang, Xiaofeng Hou, Peng Tang, Hengyi Zhou, Jing Wang, Xinkai Wang, Chao Li, Minyi Guo	2026-03-24	下载	Retrieval-Augmented Generation (RAG) systems enhance the performance of large language models (LLMs) by incorporating supplementary retrieved documents, enabling more accurate and context-aware respon...
Characterizing CPU-Induced Slowdowns in Multi-GPU LLM Inference	Euijun Chung, Yuxiao Jia, Aaron Jezghani, Hyesoon Kim	2026-03-24	下载	Large-scale machine learning workloads increasingly rely on multi-GPU systems, yet their performance is often limited by an overlooked component: the CPU.
Rank-Aware Resource Scheduling for Tightly-Coupled MPI Workloads on Kubernetes	Tianfang Xie	2026-03-24	下载	Fully provisioned Message Passing Interface (MPI) parallelism achieves near-optimal wall-clock time for Computational Fluid Dynamics (CFD) solvers.

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
AI-driven Intent-Based Networking Approach for Self-configuration of Next Generation Networks	Md. Kamrul Hossain, Walid Aljoby	2026-03-24	下载	Intent-Based Networking (IBN) aims to simplify operating heterogeneous infrastructures by translating high-level intents into enforceable policies and assuring compliance.
AgenticNet: Utilizing AI Coding Agents To Create Hybrid Network Experiments	Majd Latah, Kubra Kalkan	2026-03-24	下载	Traditional network experiments focus on validation through either simulation or emulation. Each approach has its own advantages and limitations.
Scalable Air-to-Ground Wireless Channel Modeling Using Environmental Context and Generative Diffusion	Jingyi Tian, Lin Cai	2026-03-24	下载	The fast motion of Low Earth Orbit (LEO) satellites causes the propagation channel to vary rapidly, and its behavior is strongly shaped by the surrounding environment, especially at low elevation angl...
Index-Based Scheduling for a Resource-Constrained Quantum Switch	Subhankar Banerjee, Stavros Mitrolaris, Sennur Ulukus	2026-03-24	下载	We consider a quantum switch with a finite number of quantum memory registers that aims to serve multipartite entanglement requests among $N$ users.
A Joint Reinforcement Learning Scheduling and Compression Framework for Teleoperated Driving	Giacomo Avanzi, Marco Giordani, Michele Zorzi	2026-03-24	下载	Teleoperated driving (TD) is envisioned as a key application of future sixth generation (6G) networks. In this paradigm, connected vehicles transmit sensor-perception data to a remote (software) drive...
What a Mesh: Formal Security Analysis of WPA3 SAE Wireless Authentication	Roberto Metere, Mario Lilli, Luca Arnaboldi, Elvinia Riccobene	2026-03-24	下载	The latest Wi-Fi security standard, IEEE 802.11, includes a secure authentication protocol called SAE, whose use is mandatory for WPA3-Personal networks.
Can NR-V2X Sidelink support A2A links?	Vittorio Todisco, Alessandro Bazzi	2026-03-24	下载	In the context of 5G, 3GPP introduced New Radio vehicle to everything (NR-V2X) for direct vehicle-to-vehicle communication. However, starting from Release 18 the focus of the standard has been expande...
PNap: Lifecycle-aware Edge Multi-state sleep for Energy Efficient MEC	Federico Giarrè, Holger Karl	2026-03-24	下载	Multi-access Edge Computings (MECs) enables low-latency services by executing applications at the network edge. To fulfill low-latency requirements of mobile users, providers have to keep multiple edg...
Modeling Edge-to-Cloud Offloading Workloads for Autonomous Vehicles	Longkun Li, Evangelos Pournaras	2026-03-24	下载	Autonomous vehicles generate large volumes of data for applications such as fleet monitoring, model retraining, and high-definition map updates.
AI Lifecycle-Aware Feasibility Framework for Split-RIC Orchestration in NTN O-RAN	Daniele Tarchi	2026-03-24	下载	Integrating Artificial Intelligence (AI) into Non-Terrestrial Networks (NTN) is constrained by the joint limits of satellite SWaP and feeder-link capacity, which directly impact O-RAN closed-loop cont...
RF-Zero-Wire: Design and Analysis of Multi-Hop Low-latency Symbol-synchronous RF Communication	Xinlei Liu, Andrey Belogaev, Jonathan Oostvogels, Bingwu Fang, Danny Hughes, Jeroen Famaey	2026-03-24	下载	The latency gap between wired and wireless networks poses a challenge in the adoption of wireless technologies in latency-sensitive scenarios.
Symbol-Synchronous Communication for Ultra-Low-Power Multi-Hop Ambient IoT Networks	Xinlei Liu, Andrey Belogaev, Jeroen Famaey	2026-03-24	下载	Ambient Internet of Things (A-IoT) devices, as a critical enabler of future green IoT networks, have attracted broad interest from both industry and academia due to their ability to operate without ba...
Digital Twin Enabled Simultaneous Learning and Modeling for UAV-assisted Secure Communications with Eavesdropping Attacks	Jieting Yuan, Songhan Zhao, Ye Xue, Yu Zhao, Bo Gu, Shimin Gong	2026-03-24	下载	This paper focuses on secure communications in UAV-assisted wireless networks, which comprise multiple legitimate UAVs (LE-UAVs) and an intelligent eavesdropping UAV (EA-UAV).

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
StepCache: Step-Level Reuse with Lightweight Verification and Selective Patching for LLM Serving	Azam Nouri	2026-03-24	下载	We address LLM serving workloads where repeated requests share a common solution structure but differ in localized constraints, such as output schema, variable names, or numeric constants.
Wayfinder: Automated Operating System Specialization	Alexander Jung, Cezar Crăciunoiu, Nikolaos Karaolidis, Hugo Lefeuvre, Daniel Oñoro Rubio, Felipe Huici, Charalampos Rotsos, Pierre Olivier	2026-03-24	下载	Specializing an OS to optimize the performance of a particular application is typically a manual process that requires great expertise. Specialization through configuration lends itself well to automa...

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Numerical Kernels on a Spatial Accelerator: A Study of Tenstorrent Wormhole	Maya Taylor, Carl Pearson, Luc Berger-Vergiat, Giovanni Long, Jan Ciesko	2026-03-24	下载	As AI accelerators gain prominence, their potential for traditional scientific computing workloads remains unclear. This paper explores Tenstorrent's Wormhole architecture, a spatial computing platfor...
Communication-Aware Diffusion Load Balancing for Persistently Interacting Objects	Maya Taylor, Kavitha Chandrasekar, Laxmikant V. Kale	2026-03-24	下载	Parallel applications with irregular and time-varying workloads often suffer from load imbalance. Dynamic load balancing techniques address this challenge by redistributing work during execution.