Appearance
2026-03-24
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Causal AI For AMS Circuit Design: Interpretable Parameter Effects Analysis | Mohyeu Hussain, David Koblah, Reiner Dizon-Paradis, Domenic Forte | 2026-03-24 | 下载 | Analog-mixed-signal (AMS) circuits are highly non-linear and operate on continuous real-world signals, making them far more difficult to model with data-driven AI than digital blocks. |
| Energy Efficient Software Hardware CoDesign for Machine Learning: From TinyML to Large Language Models | Mohammad Saleh Vahdatpour, Yanqing Zhang | 2026-03-24 | 下载 | The rapid deployment of machine learning across platforms from milliwatt-class TinyML devices to large language models has made energy efficiency a primary constraint for sustainable AI. |
| On the Vulnerability of FHE Computation to Silent Data Corruption | Jianan Mu, Ge Yu, Zhaoxuan Kan, Song Bian, Liang Kong, Zizhen Liu, Cheng Liu, Jing Ye, Huawei Li | 2026-03-24 | 下载 | Fully Homomorphic Encryption (FHE) is rapidly emerging as a promising foundation for privacy-preserving cloud services, enabling computation directly on encrypted data. |
| TRINE: A Token-Aware, Runtime-Adaptive FPGA Inference Engine for Multimodal AI | Hyunwoo Oh, Hanning Chen, Sanggeon Yun, Yang Ni, Suyeon Jang, Behnam Khaleghi, Fei Wen, Mohsen Imani | 2026-03-24 | 下载 | Multimodal stacks that mix ViTs, CNNs, GNNs, and transformer NLP strain embedded platforms because their compute/memory patterns diverge and hard real-time targets leave little slack. |
| TorR: Towards Brain-Inspired Task-Oriented Reasoning via Cache-Oriented Algorithm-Architecture Co-design | Hyunwoo Oh, SungHeon Jeong, Suyeon Jang, Hanning Chen, Sanggeon Yun, Tamoghno Das, Mohsen Imani | 2026-03-24 | 下载 | Task-oriented object detection (TOOD) atop CLIP offers open-vocabulary, prompt-driven semantics, yet dense per-window computation and heavy memory traffic hinder real-time, power-limited edge deployme... |
| Characterizing CPU-Induced Slowdowns in Multi-GPU LLM Inference | Euijun Chung, Yuxiao Jia, Aaron Jezghani, Hyesoon Kim | 2026-03-24 | 下载 | Large-scale machine learning workloads increasingly rely on multi-GPU systems, yet their performance is often limited by an overlooked component: the CPU. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| AetherWeave: Sybil-Resistant Robust Peer Discovery with Stake | Kaya Alpturer, Constantine Doumanidis, Aviv Zohar | 2026-03-24 | 下载 | Peer-discovery protocols within P2P networks are often vulnerable: because creating network identities is essentially free, adversaries can eclipse honest nodes or partition the overlay. |
| n-VM: A Multi-VM Layer-1 Architecture with Shared Identity and Token State | Jian Sheng Wang | 2026-03-24 | 下载 | Multi-chain ecosystems suffer from fragmented identity, siloed liquidity, and bridge-dependent token transfers. We present n-VM, a Layer-1 architecture that hosts n heterogeneous virtual machines as c... |
| LLM Inference at the Edge: Mobile, NPU, and GPU Performance Efficiency Trade-offs Under Sustained Load | Pranay Tummalapalli, Sahil Arayakandy, Ritam Pal, Kautuk Kundan | 2026-03-24 | 下载 | Deploying large language models on-device for always-on personal agents demands sustained inference from hardware tightly constrained in power, thermal envelope, and memory. We benchmark Qwen 2.5 1. |
| SNARE: A TRAP for Rational Players to Solve Byzantine Consensus in the 5f+1 Model | Alejandro Ranchal-Pedrosa, Benjamin Marsh | 2026-03-24 | 下载 | The TRAP protocol solves rational agreement by combining accountable consensus with a one-shot BFTCR finalization phase. We present SNARE (Scalable Nash Agreement via Reward and Exclusion), the adapta... |
| StepCache: Step-Level Reuse with Lightweight Verification and Selective Patching for LLM Serving | Azam Nouri | 2026-03-24 | 下载 | We address LLM serving workloads where repeated requests share a common solution structure but differ in localized constraints, such as output schema, variable names, or numeric constants. |
| Communication-Aware Diffusion Load Balancing for Persistently Interacting Objects | Maya Taylor, Kavitha Chandrasekar, Laxmikant V. Kale | 2026-03-24 | 下载 | Parallel applications with irregular and time-varying workloads often suffer from load imbalance. Dynamic load balancing techniques address this challenge by redistributing work during execution. |
| Rewriting TTS Inference Economics: Lightning V2 on Tenstorrent Achieves 4x Lower Cost Than NVIDIA L40S | Ranjith M. S., Akshat Mandloi, Sudarshan Kamath | 2026-03-24 | 下载 | Text-to-Speech (TTS) models are significantly more numerically fragile than Large Language Models (LLMs) due to their continuous waveform generation and perceptual sensitivity to small numerical pertu... |
| PCR: A Prefetch-Enhanced Cache Reuse System for Low-Latency RAG Serving | Wenfeng Wang, Xiaofeng Hou, Peng Tang, Hengyi Zhou, Jing Wang, Xinkai Wang, Chao Li, Minyi Guo | 2026-03-24 | 下载 | Retrieval-Augmented Generation (RAG) systems enhance the performance of large language models (LLMs) by incorporating supplementary retrieved documents, enabling more accurate and context-aware respon... |
| Characterizing CPU-Induced Slowdowns in Multi-GPU LLM Inference | Euijun Chung, Yuxiao Jia, Aaron Jezghani, Hyesoon Kim | 2026-03-24 | 下载 | Large-scale machine learning workloads increasingly rely on multi-GPU systems, yet their performance is often limited by an overlooked component: the CPU. |
| Rank-Aware Resource Scheduling for Tightly-Coupled MPI Workloads on Kubernetes | Tianfang Xie | 2026-03-24 | 下载 | Fully provisioned Message Passing Interface (MPI) parallelism achieves near-optimal wall-clock time for Computational Fluid Dynamics (CFD) solvers. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| AI-driven Intent-Based Networking Approach for Self-configuration of Next Generation Networks | Md. Kamrul Hossain, Walid Aljoby | 2026-03-24 | 下载 | Intent-Based Networking (IBN) aims to simplify operating heterogeneous infrastructures by translating high-level intents into enforceable policies and assuring compliance. |
| AgenticNet: Utilizing AI Coding Agents To Create Hybrid Network Experiments | Majd Latah, Kubra Kalkan | 2026-03-24 | 下载 | Traditional network experiments focus on validation through either simulation or emulation. Each approach has its own advantages and limitations. |
| Scalable Air-to-Ground Wireless Channel Modeling Using Environmental Context and Generative Diffusion | Jingyi Tian, Lin Cai | 2026-03-24 | 下载 | The fast motion of Low Earth Orbit (LEO) satellites causes the propagation channel to vary rapidly, and its behavior is strongly shaped by the surrounding environment, especially at low elevation angl... |
| Index-Based Scheduling for a Resource-Constrained Quantum Switch | Subhankar Banerjee, Stavros Mitrolaris, Sennur Ulukus | 2026-03-24 | 下载 | We consider a quantum switch with a finite number of quantum memory registers that aims to serve multipartite entanglement requests among users. |
| A Joint Reinforcement Learning Scheduling and Compression Framework for Teleoperated Driving | Giacomo Avanzi, Marco Giordani, Michele Zorzi | 2026-03-24 | 下载 | Teleoperated driving (TD) is envisioned as a key application of future sixth generation (6G) networks. In this paradigm, connected vehicles transmit sensor-perception data to a remote (software) drive... |
| What a Mesh: Formal Security Analysis of WPA3 SAE Wireless Authentication | Roberto Metere, Mario Lilli, Luca Arnaboldi, Elvinia Riccobene | 2026-03-24 | 下载 | The latest Wi-Fi security standard, IEEE 802.11, includes a secure authentication protocol called SAE, whose use is mandatory for WPA3-Personal networks. |
| Can NR-V2X Sidelink support A2A links? | Vittorio Todisco, Alessandro Bazzi | 2026-03-24 | 下载 | In the context of 5G, 3GPP introduced New Radio vehicle to everything (NR-V2X) for direct vehicle-to-vehicle communication. However, starting from Release 18 the focus of the standard has been expande... |
| PNap: Lifecycle-aware Edge Multi-state sleep for Energy Efficient MEC | Federico Giarrè, Holger Karl | 2026-03-24 | 下载 | Multi-access Edge Computings (MECs) enables low-latency services by executing applications at the network edge. To fulfill low-latency requirements of mobile users, providers have to keep multiple edg... |
| Modeling Edge-to-Cloud Offloading Workloads for Autonomous Vehicles | Longkun Li, Evangelos Pournaras | 2026-03-24 | 下载 | Autonomous vehicles generate large volumes of data for applications such as fleet monitoring, model retraining, and high-definition map updates. |
| AI Lifecycle-Aware Feasibility Framework for Split-RIC Orchestration in NTN O-RAN | Daniele Tarchi | 2026-03-24 | 下载 | Integrating Artificial Intelligence (AI) into Non-Terrestrial Networks (NTN) is constrained by the joint limits of satellite SWaP and feeder-link capacity, which directly impact O-RAN closed-loop cont... |
| RF-Zero-Wire: Design and Analysis of Multi-Hop Low-latency Symbol-synchronous RF Communication | Xinlei Liu, Andrey Belogaev, Jonathan Oostvogels, Bingwu Fang, Danny Hughes, Jeroen Famaey | 2026-03-24 | 下载 | The latency gap between wired and wireless networks poses a challenge in the adoption of wireless technologies in latency-sensitive scenarios. |
| Symbol-Synchronous Communication for Ultra-Low-Power Multi-Hop Ambient IoT Networks | Xinlei Liu, Andrey Belogaev, Jeroen Famaey | 2026-03-24 | 下载 | Ambient Internet of Things (A-IoT) devices, as a critical enabler of future green IoT networks, have attracted broad interest from both industry and academia due to their ability to operate without ba... |
| Digital Twin Enabled Simultaneous Learning and Modeling for UAV-assisted Secure Communications with Eavesdropping Attacks | Jieting Yuan, Songhan Zhao, Ye Xue, Yu Zhao, Bo Gu, Shimin Gong | 2026-03-24 | 下载 | This paper focuses on secure communications in UAV-assisted wireless networks, which comprise multiple legitimate UAVs (LE-UAVs) and an intelligent eavesdropping UAV (EA-UAV). |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| StepCache: Step-Level Reuse with Lightweight Verification and Selective Patching for LLM Serving | Azam Nouri | 2026-03-24 | 下载 | We address LLM serving workloads where repeated requests share a common solution structure but differ in localized constraints, such as output schema, variable names, or numeric constants. |
| Wayfinder: Automated Operating System Specialization | Alexander Jung, Cezar Crăciunoiu, Nikolaos Karaolidis, Hugo Lefeuvre, Daniel Oñoro Rubio, Felipe Huici, Charalampos Rotsos, Pierre Olivier | 2026-03-24 | 下载 | Specializing an OS to optimize the performance of a particular application is typically a manual process that requires great expertise. Specialization through configuration lends itself well to automa... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Numerical Kernels on a Spatial Accelerator: A Study of Tenstorrent Wormhole | Maya Taylor, Carl Pearson, Luc Berger-Vergiat, Giovanni Long, Jan Ciesko | 2026-03-24 | 下载 | As AI accelerators gain prominence, their potential for traditional scientific computing workloads remains unclear. This paper explores Tenstorrent's Wormhole architecture, a spatial computing platfor... |
| Communication-Aware Diffusion Load Balancing for Persistently Interacting Objects | Maya Taylor, Kavitha Chandrasekar, Laxmikant V. Kale | 2026-03-24 | 下载 | Parallel applications with irregular and time-varying workloads often suffer from load imbalance. Dynamic load balancing techniques address this challenge by redistributing work during execution. |