Appearance
2026-02-11
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Metastable Dynamical Computing with Energy Landscapes: A Primer | Christian Z. Pratt, Kyle J. Ray, James P. Crutchfield | 2026-02-11 | 下载 | Smartphones, laptops, and data centers are CMOS-based technologies that ushered our world into the information age of the 21st century. Despite their advantages for scalable computing, their implement... |
| A 16 nm 1.60TOPS/W High Utilization DNN Accelerator with 3D Spatial Data Reuse and Efficient Shared Memory Access | Xiaoling Yi, Ryan Antonio, Yunhao Deng, Fanchen Kong, Joren Dumoulin, Jun Yin, Marian Verhelst | 2026-02-11 | 下载 | Achieving high compute utilization across a wide range of AI workloads is crucial for the efficiency of versatile DNN accelerators. This paper presents the Voltra chip and its utilization-optimised DN... |
| HiFloat4 Format for Language Model Inference | Yuanyong Luo, Jing Huang, Yu Cheng, Ziwei Yu, Kaihua Tang, Xinda Ma, Xin Wang, Anping Tong, Guipeng Hu, Yun Xu, Mehran Taghian, Peng Wu, Guanglin Li, Yunke Peng, Tianchi Hu, Minqi Chen, Michael Bi Mi, Hu Liu, Xiping Zhou, Junsong Wang, Qiang Lin, Heng Liao | 2026-02-11 | 下载 | This paper introduces HiFloat4 (HiF4), a block floating-point data format tailored for deep learning. Each HiF4 unit packs 64 4-bit elements with 32 bits of shared scaling metadata, averaging 4. |
| Reed-Muller Error-Correction Code Encoder for SFQ-to-CMOS Interface Circuits | Yerzhan Mustafa, Berker Peköz, Selçuk Köse | 2026-02-11 | 下载 | Data transmission from superconducting digital electronics such as single flux quantum (SFQ) logic to semiconductor (CMOS) circuits is subject to bit errors due to, e.g. |
| Vulnerabilities in Partial TEE-Shielded LLM Inference with Precomputed Noise | Abhishek Saini, Haolin Jiang, Hang Liu | 2026-02-11 | 下载 | The deployment of large language models (LLMs) on third-party devices requires new ways to protect model intellectual property. While Trusted Execution Environments (TEEs) offer a promising solution, ... |
| From Buffers to Registers: Unlocking Fine-Grained FlashAttention with Hybrid-Bonded 3D NPU Co-Design | Jinxin Yu, Yudong Pan, Mengdi Wang, Huawei Li, Yinhe Han, Xiaowei Li, Ying Wang | 2026-02-11 | 下载 | Transformer-based models dominate modern AI workloads but exacerbate memory bottlenecks due to their quadratic attention complexity and ever-growing model sizes. |
| Fault Tolerant Design of IGZO-based Binary Search ADCs | Paula Carolina Lozano Duarte, Sule Ozev, Mehdi Tahoori | 2026-02-11 | 下载 | Thin-film technologies such as Indium Gallium Zinc Oxide (IGZO) enable Flexible Electronics (FE) for emerging applications in wearable sensing, personal health monitoring, and large-area systems. |
| LOREN: Low Rank-Based Code-Rate Adaptation in Neural Receivers | Bram Van Bolderik, Vlado Menkovski, Sonia Heemstra de Groot, Manil Dev Gomony | 2026-02-11 | 下载 | Neural network based receivers have recently demonstrated superior system-level performance compared to traditional receivers. However, their practicality is limited by high memory and power requireme... |
| DRAMPyML: A Formal Description of DRAM Protocols with Timed Petri Nets | Derek Christ, Thomas Zimmermann, Philippe Barbie, Dmitri Saberi, Yao Yin, Matthias Jung | 2026-02-11 | 下载 | The JEDEC committee defines various domain-specific DRAM standards. These standards feature increasingly complex and evolving protocol specifications, which are detailed in timing diagrams and command... |
| Scaling Routers with In-Package Optics and High-Bandwidth Memories | Isaac Keslassy, Ilay Yavlovich, Jose Yallouz, Tzu-Chien Hsueh, Yeshaiahu Fainman, Bill Lin | 2026-02-11 | 下载 | This paper aims to apply two major scaling transformations from the computing packaging industry to internet routers: the heterogeneous integration of high-bandwidth memories (HBMs) and chiplets, as w... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Real Life Is Uncertain. Consensus Should Be Too! | Reginald Frank, Soujanya Ponnapalli, Octavio Lomeli, Neil Giridharan, Marcos K Aguilera, Natacha Crooks | 2026-02-11 | 下载 | Modern distributed systems rely on consensus protocols to build a fault-tolerant-core upon which they can build applications. Consensus protocols are correct under a specific failure model, where up t... |
| Min-Sum Uniform Coverage Problem by Autonomous Mobile Robots | Animesh Maiti, Abhinav Chakraborty, Bibhuti Das, Subhash Bhagat, Krishnendu Mukhopadhyaya | 2026-02-11 | 下载 | We study the \textit{min-sum uniform coverage} problem for a swarm of mobile robots on a given finite line segment and on a circle having finite positive radius, where the circle is given as an in... |
| Fine-Tuning GPT-5 for GPU Kernel Generation | Ali Tehrani, Yahya Emara, Essam Wissam, Wojciech Paluch, Waleed Atallah, Łukasz Dudziak, Mohamed S. Abdelfattah | 2026-02-11 | 下载 | Developing efficient GPU kernels is essential for scaling modern AI systems, yet it remains a complex task due to intricate hardware architectures and the need for specialized optimization expertise. |
| Ask the Expert: Collaborative Inference for Vision Transformers with Near-Edge Accelerators | Hao Liu, Suhaib A. Fahmy | 2026-02-11 | 下载 | Deploying Vision Transformers on edge devices is challenging due to their high computational complexity, while full offloading to cloud resources presents significant latency overheads. |
| BOute: Cost-Efficient LLM Serving with Heterogeneous LLMs and GPUs via Multi-Objective Bayesian Optimization | Youhe Jiang, Fangcheng Fu, Eiko Yoneki | 2026-02-11 | 下载 | The rapid growth of large language model (LLM) deployments has made cost-efficient serving systems essential. Recent efforts to enhance system cost-efficiency adopt two main perspectives: (i) An algor... |
| Computing Least Fixed Points with Overwrite Semantics in Parallel and Distributed Systems | Vijay K. Garg, Rohan Garg | 2026-02-11 | 下载 | We present methods to compute least fixed points of multiple monotone inflationary functions in parallel and distributed settings. While the classic Knaster-Tarski theorem addresses a single function ... |
| Authenticated Workflows: A Systems Approach to Protecting Agentic AI | Mohan Rajagopalan, Vinay Rao | 2026-02-11 | 下载 | Agentic AI systems automate enterprise workflows but existing defenses--guardrails, semantic filters--are probabilistic and routinely bypassed. |
| Chamfer-Linkage for Hierarchical Agglomerative Clustering | Kishen N Gowda, Willem Fletcher, MohammadHossein Bateni, Laxman Dhulipala, D Ellis Hershkowitz, Rajesh Jayaram, Jakub Łącki | 2026-02-11 | 下载 | Hierarchical Agglomerative Clustering (HAC) is a widely-used clustering method based on repeatedly merging the closest pair of clusters, where inter-cluster distances are determined by a linkage funct... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Multi Layer Protection Against Low Rate DDoS Attacks in Containerized Systems | Ahmad Fareed, Bilal Al Habib, Anne Pepita Francis | 2026-02-11 | 下载 | Low rate Distributed Denial of Service DDoS attacks have emerged as a major threat to containerized cloud infrastructures. Due to their low traffic volumes, these attacks can be difficult to detect an... |
| WHEREIS: IP Address Registration Geo-Consistency | Robert Beverly, Amreesh Phokeer, Oliver Gasser | 2026-02-11 | 下载 | The five Regional Internet Registries (RIRs) provide the critical function of IP address resource del egation and registration. The accuracy of registration data directly impacts Internet operation, m... |
| A Robust Optimization Approach for Regenerator Placement in Fault-Tolerant Networks Under Discrete Cost Uncertainty | Mohammad Khosravi, Setareh Maghsudi | 2026-02-11 | 下载 | We focus on robust, survivable communication networks, where network links and nodes are affected by an uncertainty set. In this sense, any network links might fail. |
| AI Infrastructure Sovereignty | Sergio Cruzes | 2026-02-11 | 下载 | Artificial intelligence has shifted from a software-centric discipline to an infrastructure-driven system. Large-scale training and inference increasingly depend on tightly coupled data centers, high-... |
| Less is More: The Dilution Effect in Multi-Link Wireless Sensing | Karim Khamaisi, Bruno Rodrigues | 2026-02-11 | 下载 | Wireless sensing approaches promise to transform smart infrastructures into privacy-preserving motion detectors, yet commercial adoption remains limited. |
| Security, Privacy and System-Level Resillience of 6G End-to-End System: Hexa-X-II Perspective | Pawani Porambage, Diego Lopez, Antonio Pastor, Bin Han, José María Jorquera Valero, Manuel Gil Pérez, Noelia Pérez Palma, Antonio Skarmeta, Prajnamaya Dass, Stefan Köpsell, Sonika Ujjwal, Javier José Díaz Rivera, Pol Alemany, Raul Muñoz, Jafar Mohammadi, Chaitanya Aggarwal, Betul Guvenc Paltun, Ferhat Karakoc | 2026-02-11 | 下载 | The sixth generation (6G) of mobile networks are being developed to overcome limitations in previous generations and meet emerging user demands. |
| Supercharging Packet-level Network Simulation of Large Model Training via Memoization and Fast-Forwarding | Fei Long, Kaihui Gao, Li Chen, Dan Li, Yiwei Zhang, Fei Gui, Yitao Xing, Wenjia Wei, Bingyang Liu | 2026-02-11 | 下载 | Packet-level discrete-event simulation (PLDES) is a prevalent tool for evaluating detailed performance of large model training. Although PLDES offers high fidelity and generality, its slow performance... |
| SplitCom: Communication-efficient Split Federated Fine-tuning of LLMs via Temporal Compression | Tao Li, Yulin Tang, Yiyang Song, Cong Wu, Xihui Liu, Pan Li, Xianhao Chen | 2026-02-11 | 下载 | Federated fine-tuning of on-device large language models (LLMs) mitigates privacy concerns by preventing raw data sharing. However, the intensive computational and memory demands pose significant chal... |
| Predictive-State Communication: Innovation Coding and Reconciliation under Delay | Ozgur Ercetin, Mohaned Chraiti | 2026-02-11 | 下载 | Shannon theory models communication as the reliable transfer of symbol sequences, with performance governed by capacity and rate-distortion limits. |
| Scaling Routers with In-Package Optics and High-Bandwidth Memories | Isaac Keslassy, Ilay Yavlovich, Jose Yallouz, Tzu-Chien Hsueh, Yeshaiahu Fainman, Bill Lin | 2026-02-11 | 下载 | This paper aims to apply two major scaling transformations from the computing packaging industry to internet routers: the heterogeneous integration of high-bandwidth memories (HBMs) and chiplets, as w... |
| To Reconfigure or Not to Reconfigure: Optimizing All-to-All Collectives in Circuit-Switched Photonic Interconnects | Anchengcheng Zhou, Vamsi Addanki, Maria Apostolaki | 2026-02-11 | 下载 | All-to-all collective communication is a core primitive in distributed machine learning and high-performance computing. At the server scale, the communication demands of these workloads are increasing... |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Hardening the OSv Unikernel with Efficient Address Randomization: Design and Performance Evaluation | Alex Wollman, John Hastings | 2026-02-11 | 下载 | Unikernels are single-purpose library operating systems that run the kernel and application in one address space, but often omit security mitigations such as address space layout randomization (ASLR). |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Resource-Efficient RGB-Only Action Recognition for Edge Deployment | Dongsik Yoon, Jongeun Kim, Dayeon Lee | 2026-02-11 | 下载 | Action recognition on edge devices poses stringent constraints on latency, memory, storage, and power consumption. While auxiliary modalities such as skeleton and depth information can enhance recogni... |
| Supercharging Packet-level Network Simulation of Large Model Training via Memoization and Fast-Forwarding | Fei Long, Kaihui Gao, Li Chen, Dan Li, Yiwei Zhang, Fei Gui, Yitao Xing, Wenjia Wei, Bingyang Liu | 2026-02-11 | 下载 | Packet-level discrete-event simulation (PLDES) is a prevalent tool for evaluating detailed performance of large model training. Although PLDES offers high fidelity and generality, its slow performance... |