2026-03-10

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Unifying Logical and Physical Layout Representations via Heterogeneous Graphs for Circuit Congestion Prediction	Runbang Hu, Bo Fang, Bingzhe Li, Yuede Ji	2026-03-10	下载	As Very Large Scale Integration (VLSI) designs continue to scale in size and complexity, layout verification has become a central challenge in modern Electronic Design Automation (EDA) workflows.
Hardware Efficient Approximate Convolution with Tunable Error Tolerance for CNNs	Vishal Shashidhar, Anupam Kumari, Roy P Paily	2026-03-10	下载	Modern CNNs' high computational demands hinder edge deployment, as traditional ``hard'' sparsity (skipping mathematical zeros) loses effectiveness in deep layers or with smooth activations like Tanh.
Pooling Engram Conditional Memory in Large Language Models using CXL	Ruiyang Ma, Teng Ma, Zhiyuan Su, Hantian Zha, Xinpeng Zhao, Xuchun Shang, Xingrui Yi, Zheng Liu, Zhu Cao, An Wu, Zhichong Dou, Ziqian Liu, Daikang Kuang, Guojie Luo	2026-03-10	下载	Engram conditional memory has emerged as a promising component for LLMs by decoupling static knowledge lookup from dynamic computation. Since Engram exhibits sparse access patterns and supports prefet...
Nemo: A Low-Write-Amplification Cache for Tiny Objects on Log-Structured Flash Devices	Xufeng Yang, Tingting Tan, Jingxin Hu, Congming Gao, Mingyang Liu, Tianyang Jiang, Jian Chen, Linbo Long, Yina Lv, Jiwu Shu	2026-03-10	下载	Modern storage systems predominantly use flash-based SSDs as a cache layer due to their favorable performance and cost efficiency. However, in tiny-object workloads, existing flash cache designs still...
TrainDeeploy: Hardware-Accelerated Parameter-Efficient Fine-Tuning of Small Transformer Models at the Extreme Edge	Run Wang, Victor J. B. Jung, Philip Wiese, Francesco Conti, Alessio Burrello, Luca Benini	2026-03-10	下载	On-device tuning of deep neural networks enables long-term adaptation at the edge while preserving data privacy. However, the high computational and memory demands of backpropagation pose significant ...
DendroNN: Dendrocentric Neural Networks for Energy-Efficient Classification of Event-Based Data	Jann Krausse, Zhe Su, Kyrus Mama, Maryada, Klaus Knobloch, Giacomo Indiveri, Jürgen Becker	2026-03-10	下载	Spatiotemporal information is at the core of diverse sensory processing and computational tasks. Feed-forward spiking neural networks can be used to solve these tasks while offering potential benefits...
Wrong Code, Right Structure: Learning Netlist Representations from Imperfect LLM-Generated RTL	Siyang Cai, Cangyuan Li, Yinhe Han, Ying Wang	2026-03-10	下载	Learning effective netlist representations is fundamentally constrained by the scarcity of labeled datasets, as real designs are protected by Intellectual Property (IP) and costly to annotate.
Two Teachers Better Than One: Hardware-Physics Co-Guided Distributed Scientific Machine Learning	Yuchen Yuan, Junhuan Yang, Hao Wan, Yipei Liu, Hanhan Wu, Youzuo Lin, Lei Yang	2026-03-10	下载	Scientific machine learning (SciML) is increasingly applied to in-field processing, controlling, and monitoring; however, wide-area sensing, real-time demands, and strict energy and reliability constr...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
ACE Runtime - A ZKP-Native Blockchain Runtime with Sub-Second Cryptographic Finality	Jian Sheng Wang	2026-03-10	下载	Existing high performance blockchains verify one signature per transaction on the critical path, which creates O(N) verification cost, high hardware pressure, and difficult post quantum migration.
The Bureaucracy of Speed: Structural Equivalence Between Memory Consistency Models and Multi-Agent Authorization Revocation	Vladyslav Parakhin	2026-03-10	下载	The temporal assumptions underpinning conventional Identity and Access Management collapse under agentic execution regimes. A sixty-second revocation window permits on the order of $6 \times 10^3$ una...
Rate-Distortion Bounds for Heterogeneous Random Fields on Finite Lattices	Sujata Sinha, Vishwas Rao, Robert Underwood, David Lenz, Sheng Di, Franck Cappello, Lingjia Liu	2026-03-10	下载	Since Shannon's foundational work, rate-distortion theory has defined the fundamental limits of lossy compression. Classical results, derived for memoryless and stationary ergodic sources in the asymp...
Ensuring Data Freshness in Multi-Rate Task Chains Scheduling	José Luis Conradi Hoffmann, Antônio Augusto Fröhlich	2026-03-10	下载	In safety-critical autonomous systems, data freshness presents a fundamental design challenge. While the Logical Execution Time (LET) paradigm ensures compositional determinism, it often does so at th...
Pooling Engram Conditional Memory in Large Language Models using CXL	Ruiyang Ma, Teng Ma, Zhiyuan Su, Hantian Zha, Xinpeng Zhao, Xuchun Shang, Xingrui Yi, Zheng Liu, Zhu Cao, An Wu, Zhichong Dou, Ziqian Liu, Daikang Kuang, Guojie Luo	2026-03-10	下载	Engram conditional memory has emerged as a promising component for LLMs by decoupling static knowledge lookup from dynamic computation. Since Engram exhibits sparse access patterns and supports prefet...
Multi-DNN Inference of Sparse Models on Edge SoCs	Jiawei Luo, Di Wu, Simon Dobson, Blesson Varghese	2026-03-10	下载	Modern edge applications increasingly require multi-DNN inference systems to execute tasks on heterogeneous processors, gaining performance from both concurrent execution and from matching each model ...
Randomized Distributed Function Computation (RDFC): Ultra-Efficient Semantic Communication Applications to Privacy	Onur Günlü	2026-03-10	下载	We establish the randomized distributed function computation (RDFC) framework, in which a sender transmits just enough information for a receiver to generate a randomized function of the input data.
Case Study: Performance Analysis of a Virtualized XRootD Frontend in Large-Scale WAN Transfers	J M da Silva, M A Costa, R L Iope	2026-03-10	下载	This paper presents a detailed case study of the T2_BR_SPRACE storage frontend architecture and its observed performance in high-intensity data transfers.
Compiler-First State Space Duality and Portable $O(1)$ Autoregressive Caching for Inference	Cosmo Santoni	2026-03-10	下载	State-space model releases are typically coupled to fused CUDA and Triton kernels, inheriting a hard dependency on NVIDIA hardware. We show that Mamba-2's state space duality algorithm -- diagonal sta...
Flash-KMeans: Fast and Memory-Efficient Exact K-Means	Shuo Yang, Haocheng Xi, Yilong Zhao, Muyang Li, Xiaoze Fan, Jintao Zhang, Han Cai, Yujun Lin, Xiuyu Li, Kurt Keutzer, Song Han, Chenfeng Xu, Ion Stoica	2026-03-10	下载	$k$ -means has historically been positioned primarily as an offline processing primitive, typically used for dataset organization or embedding preprocessing rather than as a first-class component in on...
PIM-SHERPA: Software Method for On-device LLM Inference by Resolving PIM Memory Attribute and Layout Inconsistencies	Sunjung Lee, Sanghoon Cha, Hyeonsu Kim, Seungwoo Seo, Yuhwan Ro, Sukhan Lee, Byeongho Kim, Yongjun Park, Kyomin Sohn, Seungwon Lee, Jaehoon Yu	2026-03-10	下载	On-device deployments of large language models (LLMs) are rapidly proliferating across mobile and edge platforms. LLM inference comprises a compute-intensive prefill phase and a memory bandwidth-inten...
Hierarchical Observe-Orient-Decide-Act Enabled UAV Swarms in Uncertain Environments: Frameworks, Potentials, and Challenges	Ziye Jia, Yao Wu, Qihui Wu, Lijun He, Qiuming Zhu, Fuhui Zhou, Zhu Han	2026-03-10	下载	Unmanned aerial vehicle (UAV) swarms are increasingly explored for their potentials in various applications such as surveillance, disaster response, and military.
Nezha: A Key-Value Separated Distributed Store with Optimized Raft Integration	Yangyang Wang, Yucong Dong, Ziqian Cheng, Zichen Xu	2026-03-10	下载	Distributed key-value stores are widely adopted to support elastic big data applications, leveraging purpose-built consensus algorithms like Raft to ensure data consistency.
Accelerating High-Order Finite Element Simulations at Extreme Scale with FP64 Tensor Cores	Jiqun Tu, Ian Karlin, John Camier, Veselin Dobrev, Tzanio Kolev, Stefan Henneking, Omar Ghattas	2026-03-10	下载	Finite element simulations play a critical role in a wide range of applications, from automotive design to tsunami modeling and computational electromagnetics.
Two Teachers Better Than One: Hardware-Physics Co-Guided Distributed Scientific Machine Learning	Yuchen Yuan, Junhuan Yang, Hao Wan, Yipei Liu, Hanhan Wu, Youzuo Lin, Lei Yang	2026-03-10	下载	Scientific machine learning (SciML) is increasingly applied to in-field processing, controlling, and monitoring; however, wide-area sensing, real-time demands, and strict energy and reliability constr...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Fly-PRAC: Packet Recovery for Random Linear Network Coding	Hosein K. Nazari, Stefan Senk, Peyman Pahlevani, Juan A. Cabrera, Frank H. P. Fitzek	2026-03-10	下载	Network Coding (NC) is a compelling solution for increasing network efficiency. However, it discards corrupted packets and cannot achieve optimal performance in noisy communications.
Performance Evaluation of Delay Tolerant Network Protocols to Improve Nepal Earthquake Rescue Communications	Xiaofei Liu, Milena Radenkovic	2026-03-10	下载	In the fields of disaster rescue and communication in extreme environments, Delay Tolerant Network (DTN) has become an important technology due to its "store-carry-forward" mechanism.
Towards Flexible Spectrum Access: Data-Driven Insights into Spectrum Demand	Mohamad Alkadamani, Amir Ghasemi, Halim Yanikomeroglu	2026-03-10	下载	In the diverse landscape of 6G networks, where wireless connectivity demands surge and spectrum resources remain limited, flexible spectrum access becomes paramount.
Role Classification of Hosts within Enterprise Networks Based on Connection Patterns	Godfrey Tan, Massimiliano Poletto, John Guttag, Frans Kaashoek	2026-03-10	下载	Role classification involves grouping hosts into related roles. It exposes the logical structure of a network, simplifies network management tasks such as policy checking and network segmentation, and...
The 802.11 MAC protocol leads to inefficient equilibria	Godfrey Tan, John Guttag	2026-03-10	下载	Wireless local area networks (WLANs) based on the family of 802.11 technologies are becoming ubiquitous. These technologies support multiple data transmission rates.
A Graph-Based Approach to Spectrum Demand Prediction Using Hierarchical Attention Networks	Mohamad Alkadamani, Halim Yanikomeroglu, Amir Ghasemi	2026-03-10	下载	The surge in wireless connectivity demand, coupled with the finite nature of spectrum resources, compels the development of efficient spectrum management approaches.
PixelConfig: Longitudinal Measurement and Reverse-Engineering of Meta Pixel Configurations	Abdullah Ghani, Yash Vekaria, Zubair Shafiq	2026-03-10	下载	Tracking pixels are used to optimize online ad campaigns through personalization, re-targeting, and conversion tracking. Past research has primarily focused on detecting the prevalence of tracking pix...
PPO-Based Hybrid Optimization for RIS-Assisted Semantic Vehicular Edge Computing	Wei Feng, Jingbo Zhang, Qiong Wu, Pingyi Fan, Qiang Fan	2026-03-10	下载	To support latency-sensitive Internet of Vehicles (IoV) applications amidst dynamic environments and intermittent links, this paper proposes a Reconfigurable Intelligent Surface (RIS)-aided semantic-a...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Ensuring Data Freshness in Multi-Rate Task Chains Scheduling	José Luis Conradi Hoffmann, Antônio Augusto Fröhlich	2026-03-10	下载	In safety-critical autonomous systems, data freshness presents a fundamental design challenge. While the Logical Execution Time (LET) paradigm ensures compositional determinism, it often does so at th...
FlexServe: A Fast and Secure LLM Serving System for Mobile Devices with Flexible Resource Isolation	Yinpeng Wu, Yitong Chen, Lixiang Wang, Jinyu Gu, Zhichao Hua, Yubin Xia	2026-03-10	下载	Device-side Large Language Models (LLMs) have witnessed explosive growth, offering higher privacy and availability compared to cloud-side LLMs.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Multi-DNN Inference of Sparse Models on Edge SoCs	Jiawei Luo, Di Wu, Simon Dobson, Blesson Varghese	2026-03-10	下载	Modern edge applications increasingly require multi-DNN inference systems to execute tasks on heterogeneous processors, gaining performance from both concurrent execution and from matching each model ...
Compiler-First State Space Duality and Portable $O(1)$ Autoregressive Caching for Inference	Cosmo Santoni	2026-03-10	下载	State-space model releases are typically coupled to fused CUDA and Triton kernels, inheriting a hard dependency on NVIDIA hardware. We show that Mamba-2's state space duality algorithm -- diagonal sta...
Dynamic Precision Math Engine for Linear Algebra and Trigonometry Acceleration on Xtensa LX6 Microcontrollers	Elian Alfonso Lopez Preciado	2026-03-10	下载	Low-cost embedded processors such as the ESP32 (Xtensa LX6, 32-bit dual-core, 240 MHz) are increasingly used in edge computing applications that require real-time physical simulation, sensor fusion, a...
Accelerating High-Order Finite Element Simulations at Extreme Scale with FP64 Tensor Cores	Jiqun Tu, Ian Karlin, John Camier, Veselin Dobrev, Tzanio Kolev, Stefan Henneking, Omar Ghattas	2026-03-10	下载	Finite element simulations play a critical role in a wide range of applications, from automotive design to tsunami modeling and computational electromagnetics.