Appearance
2026-03-10
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Unifying Logical and Physical Layout Representations via Heterogeneous Graphs for Circuit Congestion Prediction | Runbang Hu, Bo Fang, Bingzhe Li, Yuede Ji | 2026-03-10 | 下载 | As Very Large Scale Integration (VLSI) designs continue to scale in size and complexity, layout verification has become a central challenge in modern Electronic Design Automation (EDA) workflows. |
| Hardware Efficient Approximate Convolution with Tunable Error Tolerance for CNNs | Vishal Shashidhar, Anupam Kumari, Roy P Paily | 2026-03-10 | 下载 | Modern CNNs' high computational demands hinder edge deployment, as traditional ``hard'' sparsity (skipping mathematical zeros) loses effectiveness in deep layers or with smooth activations like Tanh. |
| Pooling Engram Conditional Memory in Large Language Models using CXL | Ruiyang Ma, Teng Ma, Zhiyuan Su, Hantian Zha, Xinpeng Zhao, Xuchun Shang, Xingrui Yi, Zheng Liu, Zhu Cao, An Wu, Zhichong Dou, Ziqian Liu, Daikang Kuang, Guojie Luo | 2026-03-10 | 下载 | Engram conditional memory has emerged as a promising component for LLMs by decoupling static knowledge lookup from dynamic computation. Since Engram exhibits sparse access patterns and supports prefet... |
| Nemo: A Low-Write-Amplification Cache for Tiny Objects on Log-Structured Flash Devices | Xufeng Yang, Tingting Tan, Jingxin Hu, Congming Gao, Mingyang Liu, Tianyang Jiang, Jian Chen, Linbo Long, Yina Lv, Jiwu Shu | 2026-03-10 | 下载 | Modern storage systems predominantly use flash-based SSDs as a cache layer due to their favorable performance and cost efficiency. However, in tiny-object workloads, existing flash cache designs still... |
| TrainDeeploy: Hardware-Accelerated Parameter-Efficient Fine-Tuning of Small Transformer Models at the Extreme Edge | Run Wang, Victor J. B. Jung, Philip Wiese, Francesco Conti, Alessio Burrello, Luca Benini | 2026-03-10 | 下载 | On-device tuning of deep neural networks enables long-term adaptation at the edge while preserving data privacy. However, the high computational and memory demands of backpropagation pose significant ... |
| DendroNN: Dendrocentric Neural Networks for Energy-Efficient Classification of Event-Based Data | Jann Krausse, Zhe Su, Kyrus Mama, Maryada, Klaus Knobloch, Giacomo Indiveri, Jürgen Becker | 2026-03-10 | 下载 | Spatiotemporal information is at the core of diverse sensory processing and computational tasks. Feed-forward spiking neural networks can be used to solve these tasks while offering potential benefits... |
| Wrong Code, Right Structure: Learning Netlist Representations from Imperfect LLM-Generated RTL | Siyang Cai, Cangyuan Li, Yinhe Han, Ying Wang | 2026-03-10 | 下载 | Learning effective netlist representations is fundamentally constrained by the scarcity of labeled datasets, as real designs are protected by Intellectual Property (IP) and costly to annotate. |
| Two Teachers Better Than One: Hardware-Physics Co-Guided Distributed Scientific Machine Learning | Yuchen Yuan, Junhuan Yang, Hao Wan, Yipei Liu, Hanhan Wu, Youzuo Lin, Lei Yang | 2026-03-10 | 下载 | Scientific machine learning (SciML) is increasingly applied to in-field processing, controlling, and monitoring; however, wide-area sensing, real-time demands, and strict energy and reliability constr... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| ACE Runtime - A ZKP-Native Blockchain Runtime with Sub-Second Cryptographic Finality | Jian Sheng Wang | 2026-03-10 | 下载 | Existing high performance blockchains verify one signature per transaction on the critical path, which creates O(N) verification cost, high hardware pressure, and difficult post quantum migration. |
| The Bureaucracy of Speed: Structural Equivalence Between Memory Consistency Models and Multi-Agent Authorization Revocation | Vladyslav Parakhin | 2026-03-10 | 下载 | The temporal assumptions underpinning conventional Identity and Access Management collapse under agentic execution regimes. A sixty-second revocation window permits on the order of una... |
| Rate-Distortion Bounds for Heterogeneous Random Fields on Finite Lattices | Sujata Sinha, Vishwas Rao, Robert Underwood, David Lenz, Sheng Di, Franck Cappello, Lingjia Liu | 2026-03-10 | 下载 | Since Shannon's foundational work, rate-distortion theory has defined the fundamental limits of lossy compression. Classical results, derived for memoryless and stationary ergodic sources in the asymp... |
| Ensuring Data Freshness in Multi-Rate Task Chains Scheduling | José Luis Conradi Hoffmann, Antônio Augusto Fröhlich | 2026-03-10 | 下载 | In safety-critical autonomous systems, data freshness presents a fundamental design challenge. While the Logical Execution Time (LET) paradigm ensures compositional determinism, it often does so at th... |
| Pooling Engram Conditional Memory in Large Language Models using CXL | Ruiyang Ma, Teng Ma, Zhiyuan Su, Hantian Zha, Xinpeng Zhao, Xuchun Shang, Xingrui Yi, Zheng Liu, Zhu Cao, An Wu, Zhichong Dou, Ziqian Liu, Daikang Kuang, Guojie Luo | 2026-03-10 | 下载 | Engram conditional memory has emerged as a promising component for LLMs by decoupling static knowledge lookup from dynamic computation. Since Engram exhibits sparse access patterns and supports prefet... |
| Multi-DNN Inference of Sparse Models on Edge SoCs | Jiawei Luo, Di Wu, Simon Dobson, Blesson Varghese | 2026-03-10 | 下载 | Modern edge applications increasingly require multi-DNN inference systems to execute tasks on heterogeneous processors, gaining performance from both concurrent execution and from matching each model ... |
| Randomized Distributed Function Computation (RDFC): Ultra-Efficient Semantic Communication Applications to Privacy | Onur Günlü | 2026-03-10 | 下载 | We establish the randomized distributed function computation (RDFC) framework, in which a sender transmits just enough information for a receiver to generate a randomized function of the input data. |
| Case Study: Performance Analysis of a Virtualized XRootD Frontend in Large-Scale WAN Transfers | J M da Silva, M A Costa, R L Iope | 2026-03-10 | 下载 | This paper presents a detailed case study of the T2_BR_SPRACE storage frontend architecture and its observed performance in high-intensity data transfers. |
| Compiler-First State Space Duality and Portable Autoregressive Caching for Inference | Cosmo Santoni | 2026-03-10 | 下载 | State-space model releases are typically coupled to fused CUDA and Triton kernels, inheriting a hard dependency on NVIDIA hardware. We show that Mamba-2's state space duality algorithm -- diagonal sta... |
| Flash-KMeans: Fast and Memory-Efficient Exact K-Means | Shuo Yang, Haocheng Xi, Yilong Zhao, Muyang Li, Xiaoze Fan, Jintao Zhang, Han Cai, Yujun Lin, Xiuyu Li, Kurt Keutzer, Song Han, Chenfeng Xu, Ion Stoica | 2026-03-10 | 下载 | -means has historically been positioned primarily as an offline processing primitive, typically used for dataset organization or embedding preprocessing rather than as a first-class component in on... |
| PIM-SHERPA: Software Method for On-device LLM Inference by Resolving PIM Memory Attribute and Layout Inconsistencies | Sunjung Lee, Sanghoon Cha, Hyeonsu Kim, Seungwoo Seo, Yuhwan Ro, Sukhan Lee, Byeongho Kim, Yongjun Park, Kyomin Sohn, Seungwon Lee, Jaehoon Yu | 2026-03-10 | 下载 | On-device deployments of large language models (LLMs) are rapidly proliferating across mobile and edge platforms. LLM inference comprises a compute-intensive prefill phase and a memory bandwidth-inten... |
| Hierarchical Observe-Orient-Decide-Act Enabled UAV Swarms in Uncertain Environments: Frameworks, Potentials, and Challenges | Ziye Jia, Yao Wu, Qihui Wu, Lijun He, Qiuming Zhu, Fuhui Zhou, Zhu Han | 2026-03-10 | 下载 | Unmanned aerial vehicle (UAV) swarms are increasingly explored for their potentials in various applications such as surveillance, disaster response, and military. |
| Nezha: A Key-Value Separated Distributed Store with Optimized Raft Integration | Yangyang Wang, Yucong Dong, Ziqian Cheng, Zichen Xu | 2026-03-10 | 下载 | Distributed key-value stores are widely adopted to support elastic big data applications, leveraging purpose-built consensus algorithms like Raft to ensure data consistency. |
| Accelerating High-Order Finite Element Simulations at Extreme Scale with FP64 Tensor Cores | Jiqun Tu, Ian Karlin, John Camier, Veselin Dobrev, Tzanio Kolev, Stefan Henneking, Omar Ghattas | 2026-03-10 | 下载 | Finite element simulations play a critical role in a wide range of applications, from automotive design to tsunami modeling and computational electromagnetics. |
| Two Teachers Better Than One: Hardware-Physics Co-Guided Distributed Scientific Machine Learning | Yuchen Yuan, Junhuan Yang, Hao Wan, Yipei Liu, Hanhan Wu, Youzuo Lin, Lei Yang | 2026-03-10 | 下载 | Scientific machine learning (SciML) is increasingly applied to in-field processing, controlling, and monitoring; however, wide-area sensing, real-time demands, and strict energy and reliability constr... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Fly-PRAC: Packet Recovery for Random Linear Network Coding | Hosein K. Nazari, Stefan Senk, Peyman Pahlevani, Juan A. Cabrera, Frank H. P. Fitzek | 2026-03-10 | 下载 | Network Coding (NC) is a compelling solution for increasing network efficiency. However, it discards corrupted packets and cannot achieve optimal performance in noisy communications. |
| Performance Evaluation of Delay Tolerant Network Protocols to Improve Nepal Earthquake Rescue Communications | Xiaofei Liu, Milena Radenkovic | 2026-03-10 | 下载 | In the fields of disaster rescue and communication in extreme environments, Delay Tolerant Network (DTN) has become an important technology due to its "store-carry-forward" mechanism. |
| Towards Flexible Spectrum Access: Data-Driven Insights into Spectrum Demand | Mohamad Alkadamani, Amir Ghasemi, Halim Yanikomeroglu | 2026-03-10 | 下载 | In the diverse landscape of 6G networks, where wireless connectivity demands surge and spectrum resources remain limited, flexible spectrum access becomes paramount. |
| Role Classification of Hosts within Enterprise Networks Based on Connection Patterns | Godfrey Tan, Massimiliano Poletto, John Guttag, Frans Kaashoek | 2026-03-10 | 下载 | Role classification involves grouping hosts into related roles. It exposes the logical structure of a network, simplifies network management tasks such as policy checking and network segmentation, and... |
| The 802.11 MAC protocol leads to inefficient equilibria | Godfrey Tan, John Guttag | 2026-03-10 | 下载 | Wireless local area networks (WLANs) based on the family of 802.11 technologies are becoming ubiquitous. These technologies support multiple data transmission rates. |
| A Graph-Based Approach to Spectrum Demand Prediction Using Hierarchical Attention Networks | Mohamad Alkadamani, Halim Yanikomeroglu, Amir Ghasemi | 2026-03-10 | 下载 | The surge in wireless connectivity demand, coupled with the finite nature of spectrum resources, compels the development of efficient spectrum management approaches. |
| PixelConfig: Longitudinal Measurement and Reverse-Engineering of Meta Pixel Configurations | Abdullah Ghani, Yash Vekaria, Zubair Shafiq | 2026-03-10 | 下载 | Tracking pixels are used to optimize online ad campaigns through personalization, re-targeting, and conversion tracking. Past research has primarily focused on detecting the prevalence of tracking pix... |
| PPO-Based Hybrid Optimization for RIS-Assisted Semantic Vehicular Edge Computing | Wei Feng, Jingbo Zhang, Qiong Wu, Pingyi Fan, Qiang Fan | 2026-03-10 | 下载 | To support latency-sensitive Internet of Vehicles (IoV) applications amidst dynamic environments and intermittent links, this paper proposes a Reconfigurable Intelligent Surface (RIS)-aided semantic-a... |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Ensuring Data Freshness in Multi-Rate Task Chains Scheduling | José Luis Conradi Hoffmann, Antônio Augusto Fröhlich | 2026-03-10 | 下载 | In safety-critical autonomous systems, data freshness presents a fundamental design challenge. While the Logical Execution Time (LET) paradigm ensures compositional determinism, it often does so at th... |
| FlexServe: A Fast and Secure LLM Serving System for Mobile Devices with Flexible Resource Isolation | Yinpeng Wu, Yitong Chen, Lixiang Wang, Jinyu Gu, Zhichao Hua, Yubin Xia | 2026-03-10 | 下载 | Device-side Large Language Models (LLMs) have witnessed explosive growth, offering higher privacy and availability compared to cloud-side LLMs. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Multi-DNN Inference of Sparse Models on Edge SoCs | Jiawei Luo, Di Wu, Simon Dobson, Blesson Varghese | 2026-03-10 | 下载 | Modern edge applications increasingly require multi-DNN inference systems to execute tasks on heterogeneous processors, gaining performance from both concurrent execution and from matching each model ... |
| Compiler-First State Space Duality and Portable Autoregressive Caching for Inference | Cosmo Santoni | 2026-03-10 | 下载 | State-space model releases are typically coupled to fused CUDA and Triton kernels, inheriting a hard dependency on NVIDIA hardware. We show that Mamba-2's state space duality algorithm -- diagonal sta... |
| Dynamic Precision Math Engine for Linear Algebra and Trigonometry Acceleration on Xtensa LX6 Microcontrollers | Elian Alfonso Lopez Preciado | 2026-03-10 | 下载 | Low-cost embedded processors such as the ESP32 (Xtensa LX6, 32-bit dual-core, 240 MHz) are increasingly used in edge computing applications that require real-time physical simulation, sensor fusion, a... |
| Accelerating High-Order Finite Element Simulations at Extreme Scale with FP64 Tensor Cores | Jiqun Tu, Ian Karlin, John Camier, Veselin Dobrev, Tzanio Kolev, Stefan Henneking, Omar Ghattas | 2026-03-10 | 下载 | Finite element simulations play a critical role in a wide range of applications, from automotive design to tsunami modeling and computational electromagnetics. |