2026-02-11

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Metastable Dynamical Computing with Energy Landscapes: A Primer	Christian Z. Pratt, Kyle J. Ray, James P. Crutchfield	2026-02-11	下载	Smartphones, laptops, and data centers are CMOS-based technologies that ushered our world into the information age of the 21st century. Despite their advantages for scalable computing, their implement...
A 16 nm 1.60TOPS/W High Utilization DNN Accelerator with 3D Spatial Data Reuse and Efficient Shared Memory Access	Xiaoling Yi, Ryan Antonio, Yunhao Deng, Fanchen Kong, Joren Dumoulin, Jun Yin, Marian Verhelst	2026-02-11	下载	Achieving high compute utilization across a wide range of AI workloads is crucial for the efficiency of versatile DNN accelerators. This paper presents the Voltra chip and its utilization-optimised DN...
HiFloat4 Format for Language Model Inference	Yuanyong Luo, Jing Huang, Yu Cheng, Ziwei Yu, Kaihua Tang, Xinda Ma, Xin Wang, Anping Tong, Guipeng Hu, Yun Xu, Mehran Taghian, Peng Wu, Guanglin Li, Yunke Peng, Tianchi Hu, Minqi Chen, Michael Bi Mi, Hu Liu, Xiping Zhou, Junsong Wang, Qiang Lin, Heng Liao	2026-02-11	下载	This paper introduces HiFloat4 (HiF4), a block floating-point data format tailored for deep learning. Each HiF4 unit packs 64 4-bit elements with 32 bits of shared scaling metadata, averaging 4.
Reed-Muller Error-Correction Code Encoder for SFQ-to-CMOS Interface Circuits	Yerzhan Mustafa, Berker Peköz, Selçuk Köse	2026-02-11	下载	Data transmission from superconducting digital electronics such as single flux quantum (SFQ) logic to semiconductor (CMOS) circuits is subject to bit errors due to, e.g.
Vulnerabilities in Partial TEE-Shielded LLM Inference with Precomputed Noise	Abhishek Saini, Haolin Jiang, Hang Liu	2026-02-11	下载	The deployment of large language models (LLMs) on third-party devices requires new ways to protect model intellectual property. While Trusted Execution Environments (TEEs) offer a promising solution, ...
From Buffers to Registers: Unlocking Fine-Grained FlashAttention with Hybrid-Bonded 3D NPU Co-Design	Jinxin Yu, Yudong Pan, Mengdi Wang, Huawei Li, Yinhe Han, Xiaowei Li, Ying Wang	2026-02-11	下载	Transformer-based models dominate modern AI workloads but exacerbate memory bottlenecks due to their quadratic attention complexity and ever-growing model sizes.
Fault Tolerant Design of IGZO-based Binary Search ADCs	Paula Carolina Lozano Duarte, Sule Ozev, Mehdi Tahoori	2026-02-11	下载	Thin-film technologies such as Indium Gallium Zinc Oxide (IGZO) enable Flexible Electronics (FE) for emerging applications in wearable sensing, personal health monitoring, and large-area systems.
LOREN: Low Rank-Based Code-Rate Adaptation in Neural Receivers	Bram Van Bolderik, Vlado Menkovski, Sonia Heemstra de Groot, Manil Dev Gomony	2026-02-11	下载	Neural network based receivers have recently demonstrated superior system-level performance compared to traditional receivers. However, their practicality is limited by high memory and power requireme...
DRAMPyML: A Formal Description of DRAM Protocols with Timed Petri Nets	Derek Christ, Thomas Zimmermann, Philippe Barbie, Dmitri Saberi, Yao Yin, Matthias Jung	2026-02-11	下载	The JEDEC committee defines various domain-specific DRAM standards. These standards feature increasingly complex and evolving protocol specifications, which are detailed in timing diagrams and command...
Scaling Routers with In-Package Optics and High-Bandwidth Memories	Isaac Keslassy, Ilay Yavlovich, Jose Yallouz, Tzu-Chien Hsueh, Yeshaiahu Fainman, Bill Lin	2026-02-11	下载	This paper aims to apply two major scaling transformations from the computing packaging industry to internet routers: the heterogeneous integration of high-bandwidth memories (HBMs) and chiplets, as w...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Real Life Is Uncertain. Consensus Should Be Too!	Reginald Frank, Soujanya Ponnapalli, Octavio Lomeli, Neil Giridharan, Marcos K Aguilera, Natacha Crooks	2026-02-11	下载	Modern distributed systems rely on consensus protocols to build a fault-tolerant-core upon which they can build applications. Consensus protocols are correct under a specific failure model, where up t...
Min-Sum Uniform Coverage Problem by Autonomous Mobile Robots	Animesh Maiti, Abhinav Chakraborty, Bibhuti Das, Subhash Bhagat, Krishnendu Mukhopadhyaya	2026-02-11	下载	We study the \textit{min-sum uniform coverage} problem for a swarm of $n$ mobile robots on a given finite line segment and on a circle having finite positive radius, where the circle is given as an in...
Fine-Tuning GPT-5 for GPU Kernel Generation	Ali Tehrani, Yahya Emara, Essam Wissam, Wojciech Paluch, Waleed Atallah, Łukasz Dudziak, Mohamed S. Abdelfattah	2026-02-11	下载	Developing efficient GPU kernels is essential for scaling modern AI systems, yet it remains a complex task due to intricate hardware architectures and the need for specialized optimization expertise.
Ask the Expert: Collaborative Inference for Vision Transformers with Near-Edge Accelerators	Hao Liu, Suhaib A. Fahmy	2026-02-11	下载	Deploying Vision Transformers on edge devices is challenging due to their high computational complexity, while full offloading to cloud resources presents significant latency overheads.
BOute: Cost-Efficient LLM Serving with Heterogeneous LLMs and GPUs via Multi-Objective Bayesian Optimization	Youhe Jiang, Fangcheng Fu, Eiko Yoneki	2026-02-11	下载	The rapid growth of large language model (LLM) deployments has made cost-efficient serving systems essential. Recent efforts to enhance system cost-efficiency adopt two main perspectives: (i) An algor...
Computing Least Fixed Points with Overwrite Semantics in Parallel and Distributed Systems	Vijay K. Garg, Rohan Garg	2026-02-11	下载	We present methods to compute least fixed points of multiple monotone inflationary functions in parallel and distributed settings. While the classic Knaster-Tarski theorem addresses a single function ...
Authenticated Workflows: A Systems Approach to Protecting Agentic AI	Mohan Rajagopalan, Vinay Rao	2026-02-11	下载	Agentic AI systems automate enterprise workflows but existing defenses--guardrails, semantic filters--are probabilistic and routinely bypassed.
Chamfer-Linkage for Hierarchical Agglomerative Clustering	Kishen N Gowda, Willem Fletcher, MohammadHossein Bateni, Laxman Dhulipala, D Ellis Hershkowitz, Rajesh Jayaram, Jakub Łącki	2026-02-11	下载	Hierarchical Agglomerative Clustering (HAC) is a widely-used clustering method based on repeatedly merging the closest pair of clusters, where inter-cluster distances are determined by a linkage funct...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Multi Layer Protection Against Low Rate DDoS Attacks in Containerized Systems	Ahmad Fareed, Bilal Al Habib, Anne Pepita Francis	2026-02-11	下载	Low rate Distributed Denial of Service DDoS attacks have emerged as a major threat to containerized cloud infrastructures. Due to their low traffic volumes, these attacks can be difficult to detect an...
WHEREIS: IP Address Registration Geo-Consistency	Robert Beverly, Amreesh Phokeer, Oliver Gasser	2026-02-11	下载	The five Regional Internet Registries (RIRs) provide the critical function of IP address resource del egation and registration. The accuracy of registration data directly impacts Internet operation, m...
A Robust Optimization Approach for Regenerator Placement in Fault-Tolerant Networks Under Discrete Cost Uncertainty	Mohammad Khosravi, Setareh Maghsudi	2026-02-11	下载	We focus on robust, survivable communication networks, where network links and nodes are affected by an uncertainty set. In this sense, any network links might fail.
AI Infrastructure Sovereignty	Sergio Cruzes	2026-02-11	下载	Artificial intelligence has shifted from a software-centric discipline to an infrastructure-driven system. Large-scale training and inference increasingly depend on tightly coupled data centers, high-...
Less is More: The Dilution Effect in Multi-Link Wireless Sensing	Karim Khamaisi, Bruno Rodrigues	2026-02-11	下载	Wireless sensing approaches promise to transform smart infrastructures into privacy-preserving motion detectors, yet commercial adoption remains limited.
Security, Privacy and System-Level Resillience of 6G End-to-End System: Hexa-X-II Perspective	Pawani Porambage, Diego Lopez, Antonio Pastor, Bin Han, José María Jorquera Valero, Manuel Gil Pérez, Noelia Pérez Palma, Antonio Skarmeta, Prajnamaya Dass, Stefan Köpsell, Sonika Ujjwal, Javier José Díaz Rivera, Pol Alemany, Raul Muñoz, Jafar Mohammadi, Chaitanya Aggarwal, Betul Guvenc Paltun, Ferhat Karakoc	2026-02-11	下载	The sixth generation (6G) of mobile networks are being developed to overcome limitations in previous generations and meet emerging user demands.
Supercharging Packet-level Network Simulation of Large Model Training via Memoization and Fast-Forwarding	Fei Long, Kaihui Gao, Li Chen, Dan Li, Yiwei Zhang, Fei Gui, Yitao Xing, Wenjia Wei, Bingyang Liu	2026-02-11	下载	Packet-level discrete-event simulation (PLDES) is a prevalent tool for evaluating detailed performance of large model training. Although PLDES offers high fidelity and generality, its slow performance...
SplitCom: Communication-efficient Split Federated Fine-tuning of LLMs via Temporal Compression	Tao Li, Yulin Tang, Yiyang Song, Cong Wu, Xihui Liu, Pan Li, Xianhao Chen	2026-02-11	下载	Federated fine-tuning of on-device large language models (LLMs) mitigates privacy concerns by preventing raw data sharing. However, the intensive computational and memory demands pose significant chal...
Predictive-State Communication: Innovation Coding and Reconciliation under Delay	Ozgur Ercetin, Mohaned Chraiti	2026-02-11	下载	Shannon theory models communication as the reliable transfer of symbol sequences, with performance governed by capacity and rate-distortion limits.
Scaling Routers with In-Package Optics and High-Bandwidth Memories	Isaac Keslassy, Ilay Yavlovich, Jose Yallouz, Tzu-Chien Hsueh, Yeshaiahu Fainman, Bill Lin	2026-02-11	下载	This paper aims to apply two major scaling transformations from the computing packaging industry to internet routers: the heterogeneous integration of high-bandwidth memories (HBMs) and chiplets, as w...
To Reconfigure or Not to Reconfigure: Optimizing All-to-All Collectives in Circuit-Switched Photonic Interconnects	Anchengcheng Zhou, Vamsi Addanki, Maria Apostolaki	2026-02-11	下载	All-to-all collective communication is a core primitive in distributed machine learning and high-performance computing. At the server scale, the communication demands of these workloads are increasing...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Hardening the OSv Unikernel with Efficient Address Randomization: Design and Performance Evaluation	Alex Wollman, John Hastings	2026-02-11	下载	Unikernels are single-purpose library operating systems that run the kernel and application in one address space, but often omit security mitigations such as address space layout randomization (ASLR).

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Resource-Efficient RGB-Only Action Recognition for Edge Deployment	Dongsik Yoon, Jongeun Kim, Dayeon Lee	2026-02-11	下载	Action recognition on edge devices poses stringent constraints on latency, memory, storage, and power consumption. While auxiliary modalities such as skeleton and depth information can enhance recogni...
Supercharging Packet-level Network Simulation of Large Model Training via Memoization and Fast-Forwarding	Fei Long, Kaihui Gao, Li Chen, Dan Li, Yiwei Zhang, Fei Gui, Yitao Xing, Wenjia Wei, Bingyang Liu	2026-02-11	下载	Packet-level discrete-event simulation (PLDES) is a prevalent tool for evaluating detailed performance of large model training. Although PLDES offers high fidelity and generality, its slow performance...