Skip to content

2026-03-24

cs.AR - Architecture

标题作者发布日期PDF摘要
Causal AI For AMS Circuit Design: Interpretable Parameter Effects AnalysisMohyeu Hussain, David Koblah, Reiner Dizon-Paradis, Domenic Forte2026-03-24下载Analog-mixed-signal (AMS) circuits are highly non-linear and operate on continuous real-world signals, making them far more difficult to model with data-driven AI than digital blocks.
Energy Efficient Software Hardware CoDesign for Machine Learning: From TinyML to Large Language ModelsMohammad Saleh Vahdatpour, Yanqing Zhang2026-03-24下载The rapid deployment of machine learning across platforms from milliwatt-class TinyML devices to large language models has made energy efficiency a primary constraint for sustainable AI.
On the Vulnerability of FHE Computation to Silent Data CorruptionJianan Mu, Ge Yu, Zhaoxuan Kan, Song Bian, Liang Kong, Zizhen Liu, Cheng Liu, Jing Ye, Huawei Li2026-03-24下载Fully Homomorphic Encryption (FHE) is rapidly emerging as a promising foundation for privacy-preserving cloud services, enabling computation directly on encrypted data.
TRINE: A Token-Aware, Runtime-Adaptive FPGA Inference Engine for Multimodal AIHyunwoo Oh, Hanning Chen, Sanggeon Yun, Yang Ni, Suyeon Jang, Behnam Khaleghi, Fei Wen, Mohsen Imani2026-03-24下载Multimodal stacks that mix ViTs, CNNs, GNNs, and transformer NLP strain embedded platforms because their compute/memory patterns diverge and hard real-time targets leave little slack.
TorR: Towards Brain-Inspired Task-Oriented Reasoning via Cache-Oriented Algorithm-Architecture Co-designHyunwoo Oh, SungHeon Jeong, Suyeon Jang, Hanning Chen, Sanggeon Yun, Tamoghno Das, Mohsen Imani2026-03-24下载Task-oriented object detection (TOOD) atop CLIP offers open-vocabulary, prompt-driven semantics, yet dense per-window computation and heavy memory traffic hinder real-time, power-limited edge deployme...
Characterizing CPU-Induced Slowdowns in Multi-GPU LLM InferenceEuijun Chung, Yuxiao Jia, Aaron Jezghani, Hyesoon Kim2026-03-24下载Large-scale machine learning workloads increasingly rely on multi-GPU systems, yet their performance is often limited by an overlooked component: the CPU.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
AetherWeave: Sybil-Resistant Robust Peer Discovery with StakeKaya Alpturer, Constantine Doumanidis, Aviv Zohar2026-03-24下载Peer-discovery protocols within P2P networks are often vulnerable: because creating network identities is essentially free, adversaries can eclipse honest nodes or partition the overlay.
n-VM: A Multi-VM Layer-1 Architecture with Shared Identity and Token StateJian Sheng Wang2026-03-24下载Multi-chain ecosystems suffer from fragmented identity, siloed liquidity, and bridge-dependent token transfers. We present n-VM, a Layer-1 architecture that hosts n heterogeneous virtual machines as c...
LLM Inference at the Edge: Mobile, NPU, and GPU Performance Efficiency Trade-offs Under Sustained LoadPranay Tummalapalli, Sahil Arayakandy, Ritam Pal, Kautuk Kundan2026-03-24下载Deploying large language models on-device for always-on personal agents demands sustained inference from hardware tightly constrained in power, thermal envelope, and memory. We benchmark Qwen 2.5 1.
SNARE: A TRAP for Rational Players to Solve Byzantine Consensus in the 5f+1 ModelAlejandro Ranchal-Pedrosa, Benjamin Marsh2026-03-24下载The TRAP protocol solves rational agreement by combining accountable consensus with a one-shot BFTCR finalization phase. We present SNARE (Scalable Nash Agreement via Reward and Exclusion), the adapta...
StepCache: Step-Level Reuse with Lightweight Verification and Selective Patching for LLM ServingAzam Nouri2026-03-24下载We address LLM serving workloads where repeated requests share a common solution structure but differ in localized constraints, such as output schema, variable names, or numeric constants.
Communication-Aware Diffusion Load Balancing for Persistently Interacting ObjectsMaya Taylor, Kavitha Chandrasekar, Laxmikant V. Kale2026-03-24下载Parallel applications with irregular and time-varying workloads often suffer from load imbalance. Dynamic load balancing techniques address this challenge by redistributing work during execution.
Rewriting TTS Inference Economics: Lightning V2 on Tenstorrent Achieves 4x Lower Cost Than NVIDIA L40SRanjith M. S., Akshat Mandloi, Sudarshan Kamath2026-03-24下载Text-to-Speech (TTS) models are significantly more numerically fragile than Large Language Models (LLMs) due to their continuous waveform generation and perceptual sensitivity to small numerical pertu...
PCR: A Prefetch-Enhanced Cache Reuse System for Low-Latency RAG ServingWenfeng Wang, Xiaofeng Hou, Peng Tang, Hengyi Zhou, Jing Wang, Xinkai Wang, Chao Li, Minyi Guo2026-03-24下载Retrieval-Augmented Generation (RAG) systems enhance the performance of large language models (LLMs) by incorporating supplementary retrieved documents, enabling more accurate and context-aware respon...
Characterizing CPU-Induced Slowdowns in Multi-GPU LLM InferenceEuijun Chung, Yuxiao Jia, Aaron Jezghani, Hyesoon Kim2026-03-24下载Large-scale machine learning workloads increasingly rely on multi-GPU systems, yet their performance is often limited by an overlooked component: the CPU.
Rank-Aware Resource Scheduling for Tightly-Coupled MPI Workloads on KubernetesTianfang Xie2026-03-24下载Fully provisioned Message Passing Interface (MPI) parallelism achieves near-optimal wall-clock time for Computational Fluid Dynamics (CFD) solvers.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
AI-driven Intent-Based Networking Approach for Self-configuration of Next Generation NetworksMd. Kamrul Hossain, Walid Aljoby2026-03-24下载Intent-Based Networking (IBN) aims to simplify operating heterogeneous infrastructures by translating high-level intents into enforceable policies and assuring compliance.
AgenticNet: Utilizing AI Coding Agents To Create Hybrid Network ExperimentsMajd Latah, Kubra Kalkan2026-03-24下载Traditional network experiments focus on validation through either simulation or emulation. Each approach has its own advantages and limitations.
Scalable Air-to-Ground Wireless Channel Modeling Using Environmental Context and Generative DiffusionJingyi Tian, Lin Cai2026-03-24下载The fast motion of Low Earth Orbit (LEO) satellites causes the propagation channel to vary rapidly, and its behavior is strongly shaped by the surrounding environment, especially at low elevation angl...
Index-Based Scheduling for a Resource-Constrained Quantum SwitchSubhankar Banerjee, Stavros Mitrolaris, Sennur Ulukus2026-03-24下载We consider a quantum switch with a finite number of quantum memory registers that aims to serve multipartite entanglement requests among NN users.
A Joint Reinforcement Learning Scheduling and Compression Framework for Teleoperated DrivingGiacomo Avanzi, Marco Giordani, Michele Zorzi2026-03-24下载Teleoperated driving (TD) is envisioned as a key application of future sixth generation (6G) networks. In this paradigm, connected vehicles transmit sensor-perception data to a remote (software) drive...
What a Mesh: Formal Security Analysis of WPA3 SAE Wireless AuthenticationRoberto Metere, Mario Lilli, Luca Arnaboldi, Elvinia Riccobene2026-03-24下载The latest Wi-Fi security standard, IEEE 802.11, includes a secure authentication protocol called SAE, whose use is mandatory for WPA3-Personal networks.
Can NR-V2X Sidelink support A2A links?Vittorio Todisco, Alessandro Bazzi2026-03-24下载In the context of 5G, 3GPP introduced New Radio vehicle to everything (NR-V2X) for direct vehicle-to-vehicle communication. However, starting from Release 18 the focus of the standard has been expande...
PNap: Lifecycle-aware Edge Multi-state sleep for Energy Efficient MECFederico Giarrè, Holger Karl2026-03-24下载Multi-access Edge Computings (MECs) enables low-latency services by executing applications at the network edge. To fulfill low-latency requirements of mobile users, providers have to keep multiple edg...
Modeling Edge-to-Cloud Offloading Workloads for Autonomous VehiclesLongkun Li, Evangelos Pournaras2026-03-24下载Autonomous vehicles generate large volumes of data for applications such as fleet monitoring, model retraining, and high-definition map updates.
AI Lifecycle-Aware Feasibility Framework for Split-RIC Orchestration in NTN O-RANDaniele Tarchi2026-03-24下载Integrating Artificial Intelligence (AI) into Non-Terrestrial Networks (NTN) is constrained by the joint limits of satellite SWaP and feeder-link capacity, which directly impact O-RAN closed-loop cont...
RF-Zero-Wire: Design and Analysis of Multi-Hop Low-latency Symbol-synchronous RF CommunicationXinlei Liu, Andrey Belogaev, Jonathan Oostvogels, Bingwu Fang, Danny Hughes, Jeroen Famaey2026-03-24下载The latency gap between wired and wireless networks poses a challenge in the adoption of wireless technologies in latency-sensitive scenarios.
Symbol-Synchronous Communication for Ultra-Low-Power Multi-Hop Ambient IoT NetworksXinlei Liu, Andrey Belogaev, Jeroen Famaey2026-03-24下载Ambient Internet of Things (A-IoT) devices, as a critical enabler of future green IoT networks, have attracted broad interest from both industry and academia due to their ability to operate without ba...
Digital Twin Enabled Simultaneous Learning and Modeling for UAV-assisted Secure Communications with Eavesdropping AttacksJieting Yuan, Songhan Zhao, Ye Xue, Yu Zhao, Bo Gu, Shimin Gong2026-03-24下载This paper focuses on secure communications in UAV-assisted wireless networks, which comprise multiple legitimate UAVs (LE-UAVs) and an intelligent eavesdropping UAV (EA-UAV).

cs.OS - Operating Systems

标题作者发布日期PDF摘要
StepCache: Step-Level Reuse with Lightweight Verification and Selective Patching for LLM ServingAzam Nouri2026-03-24下载We address LLM serving workloads where repeated requests share a common solution structure but differ in localized constraints, such as output schema, variable names, or numeric constants.
Wayfinder: Automated Operating System SpecializationAlexander Jung, Cezar Crăciunoiu, Nikolaos Karaolidis, Hugo Lefeuvre, Daniel Oñoro Rubio, Felipe Huici, Charalampos Rotsos, Pierre Olivier2026-03-24下载Specializing an OS to optimize the performance of a particular application is typically a manual process that requires great expertise. Specialization through configuration lends itself well to automa...

cs.PF - Performance

标题作者发布日期PDF摘要
Numerical Kernels on a Spatial Accelerator: A Study of Tenstorrent WormholeMaya Taylor, Carl Pearson, Luc Berger-Vergiat, Giovanni Long, Jan Ciesko2026-03-24下载As AI accelerators gain prominence, their potential for traditional scientific computing workloads remains unclear. This paper explores Tenstorrent's Wormhole architecture, a spatial computing platfor...
Communication-Aware Diffusion Load Balancing for Persistently Interacting ObjectsMaya Taylor, Kavitha Chandrasekar, Laxmikant V. Kale2026-03-24下载Parallel applications with irregular and time-varying workloads often suffer from load imbalance. Dynamic load balancing techniques address this challenge by redistributing work during execution.

基于 VitePress 构建