2026-01-16

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
NuRedact: Non-Uniform eFPGA Architecture for Low-Overhead and Secure IP Redaction	Voktho Das, Kimia Azar, Hadi Kamali	2026-01-16	下载	While logic locking has been extensively studied as a countermeasure against integrated circuit (IC) supply chain threats, recent research has shifted toward reconfigurable-based redaction techniques,...
Bench4HLS: End-to-End Evaluation of LLMs in High-Level Synthesis Code Generation	M Zafir Sadik Khan, Kimia Azar, Hadi Kamali	2026-01-16	下载	In last two years, large language models (LLMs) have shown strong capabilities in code generation, including hardware design at register-transfer level (RTL).
Continuous-Flow Data-Rate-Aware CNN Inference on FPGA	Tobias Habermann, Michael Mecik, Zhenyu Wang, César David Vera, Martin Kumm, Mario Garrido	2026-01-16	下载	Among hardware accelerators for deep-learning inference, data flow implementations offer low latency and high throughput capabilities. In these architectures, each neuron is mapped to a dedicated hard...
IMS: Intelligent Hardware Monitoring System for Secure SoCs	Wadid Foudhaili, Aykut Rencber, Anouar Nechi, Rainer Buchty, Mladen Berekovic, Andres Gomez, Saleh Mulhem	2026-01-16	下载	In the modern Systems-on-Chip (SoC), the Advanced eXtensible Interface (AXI) protocol exhibits security vulnerabilities, enabling partial or complete denial-of-service (DoS) through protocol-violation...
InterPUF: Distributed Authentication via Physically Unclonable Functions and Multi-party Computation for Reconfigurable Interposers	Ishraq Tashdid, Tasnuva Farheen, Sazadur Rahman	2026-01-16	下载	Modern system-in-package (SiP) platforms increasingly adopt reconfigurable interposers to enable plug-and-play chiplet integration across heterogeneous multi-vendor ecosystems.
OpenACM: An Open-Source SRAM-Based Approximate CiM Compiler	Yiqi Zhou, JunHao Ma, Xingyang Li, Yule Sheng, Yue Yuan, Yikai Wang, Bochang Wang, Yiheng Wu, Shan Shen, Wei Xing, Daying Sun, Li Li, Zhiqiang Xiao	2026-01-16	下载	The rise of data-intensive AI workloads has exacerbated the ``memory wall'' bottleneck. Digital Compute-in-Memory (DCiM) using SRAM offers a scalable solution, but its vast design space makes manual d...
RidgeWalker: Perfectly Pipelined Graph Random Walks on FPGAs	Hongshi Tan, Yao Chen, Xinyu Chen, Qizhen Zhang, Cheng Chen, Weng-Fai Wong, Bingsheng He	2026-01-16	下载	Graph Random Walks (GRWs) offer efficient approximations of key graph properties and have been widely adopted in many applications. However, GRW workloads are notoriously difficult to accelerate due t...
SwiftKV: An Edge-Oriented Attention Algorithm and Multi-Head Accelerator for Fast, Efficient LLM Decoding	Junming Zhang, Qinyan Zhang, Huajun Sun, Feiyang Gao, Sheng Hu, Rui Nie, Xiangshui Miao	2026-01-16	下载	Edge acceleration for large language models is crucial for their widespread application; however, achieving fast attention inference and efficient decoding on resource-constrained edge accelerators re...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
RAPID-Serve: Resource-efficient and Accelerated P/D Intra-GPU Disaggregation	Amna Masood, Pratishtha Gaur, Nuwan Jayasena	2026-01-16	下载	Two widely adopted techniques for LLM inference serving systems today are hybrid batching and disaggregated serving. A hybrid batch combines prefill and decode tokens of different requests in the same...
SIVF: GPU-Resident IVF Index for Streaming Vector Search	Dongfang Zhao	2026-01-16	下载	GPU-accelerated Inverted File (IVF) index is one of the industry standards for large-scale vector search but relies on static VRAM layouts that hinder real-time mutability.
Nixie: Efficient, Transparent Temporal Multiplexing for Consumer GPUs	Yechen Xu, Yifei Wang, Nathanael Ren, Yiran Chen, Danyang Zhuo	2026-01-16	下载	Consumer machines are increasingly running large ML workloads such as large language models (LLMs), text-to-image generation, and interactive image editing.
Space-Optimal, Computation-Optimal, Topology-Agnostic, Throughput-Scalable Causal Delivery through Hybrid Buffering	Paulo Sérgio Almeida	2026-01-16	下载	Message delivery respecting causal ordering (causal delivery) is one of the most classic and widely useful abstraction for inter-process communication in a distributed system.
DecHW: Heterogeneous Decentralized Federated Learning Exploiting Second-Order Information	Adnan Ahmad, Chiara Boldrini, Lorenzo Valerio, Andrea Passarella, Marco Conti	2026-01-16	下载	Decentralized Federated Learning (DFL) is a serverless collaborative machine learning paradigm where devices collaborate directly with neighbouring devices to exchange model information for learning a...
Konflux: Optimized Function Fusion for Serverless Applications	Niklas Kowallik, Trever Schirmer, David Bermbach	2026-01-16	下载	Function-as-a-Service (FaaS) has become a central paradigm in serverless cloud computing, yet optimizing FaaS deployments remains challenging.
HALO: Semantic-Aware Distributed LLM Inference in Lossy Edge Network	Peirong Zheng, Wenchao Xu, Haozhao Wang, Jinyu Chen, Xuemin Shen	2026-01-16	下载	The deployment of large language models' (LLMs) inference at the edge can facilitate prompt service responsiveness while protecting user privacy.
AFLL: Real-time Load Stabilization for MMO Game Servers Based on Circular Causality Learning	Shinsuk Kang, Youngjae Kim	2026-01-16	下载	Massively Multiplayer Online (MMO) game servers must handle thousands of simultaneous players while maintaining sub-100ms response times. When server load exceeds capacity, traditional approaches eith...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Inter-Cell Interference Rejection Based on Ultrawideband Walsh-Domain Wireless Autoencoding	Rodney Martinez Alonso, Cel Thys, Cedric Dehos, Yuneisy Esthela Garcia Guzman, Sofie Pollin	2026-01-16	下载	This paper proposes a novel technique for rejecting partial-in-band inter-cell interference (ICI) in ultrawideband communication systems. We present the design of an end-to-end wireless autoencoder ar...
Age-Based Scheduling for a Memory-Constrained Quantum Switch	Stavros Mitrolaris, Subhankar Banerjee, Sennur Ulukus	2026-01-16	下载	In a time-slotted system, we study the problem of scheduling multipartite entanglement requests in a quantum switch with a finite number of quantum memory registers.
Convergence Properties of Good Quantum Codes for Classical Communication	Alptug Aytekin, Mohamed Nomeir, Lei Hu, Sennur Ulukus	2026-01-16	下载	An important part of the information theory folklore had been about the output statistics of codes that achieve the capacity and how the empirical distributions compare to the output distributions ind...
Indoor Neutral-Host Networks Over Shared Spectrum and Shared Infrastructure: A Comparison Study of Real-World Deployments	Joshua Roy Palathinkal, Muhammad Iqbal Rochman, Vanlin Sathya, Mehmet Yavuz, Monisha Ghosh	2026-01-16	下载	Indoor high-capacity connectivity is frequently constrained by significant building penetration loss and the inherent uplink power limitations of a typical outdoor macro-cell deployment.
X-raying the arXiv: A Large-Scale Analysis of arXiv Submissions' Source Files	Giovanni Apruzzese, Aurore Fass	2026-01-16	下载	arXiv is the largest open-access repository for scientific literature. When submitting a paper, authors upload the manuscript's source files, from which the final PDF is compiled.
A Survey on Mapping Digital Systems with Bill of Materials: Development, Practices, and Challenges	Shuai Zhang, Minzhao Lyu, Hassan Habibi Gharakheili	2026-01-16	下载	Modern digital ecosystems, spanning software, hardware, learning models, datasets, and cryptographic products, continue to grow in complexity, making it difficult for organizations to understand and m...
HALO: Semantic-Aware Distributed LLM Inference in Lossy Edge Network	Peirong Zheng, Wenchao Xu, Haozhao Wang, Jinyu Chen, Xuemin Shen	2026-01-16	下载	The deployment of large language models' (LLMs) inference at the edge can facilitate prompt service responsiveness while protecting user privacy.
AFLL: Real-time Load Stabilization for MMO Game Servers Based on Circular Causality Learning	Shinsuk Kang, Youngjae Kim	2026-01-16	下载	Massively Multiplayer Online (MMO) game servers must handle thousands of simultaneous players while maintaining sub-100ms response times. When server load exceeds capacity, traditional approaches eith...
Fundamental Limits of Quantum Semantic Communication via Sheaf Cohomology	Christo Kurisummoottil Thomas, Mingzhe Chen	2026-01-16	下载	Semantic communication (SC) enables bandwidth-efficient coordination in multi-agent systems by transmitting meaning rather than raw bits. However, when agents employ heterogeneous sensing modalities a...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Nixie: Efficient, Transparent Temporal Multiplexing for Consumer GPUs	Yechen Xu, Yifei Wang, Nathanael Ren, Yiran Chen, Danyang Zhuo	2026-01-16	下载	Consumer machines are increasingly running large ML workloads such as large language models (LLMs), text-to-image generation, and interactive image editing.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Offline Reinforcement-Learning-Based Power Control for Application-Agnostic Energy Efficiency	Akhilesh Raj, Swann Perarnau, Aniruddha Gokhale, Solomon Bekele Abera	2026-01-16	下载	Energy efficiency has become an integral aspect of modern computing infrastructure design, impacting the performance, cost, scalability, and durability of production systems.
AFLL: Real-time Load Stabilization for MMO Game Servers Based on Circular Causality Learning	Shinsuk Kang, Youngjae Kim	2026-01-16	下载	Massively Multiplayer Online (MMO) game servers must handle thousands of simultaneous players while maintaining sub-100ms response times. When server load exceeds capacity, traditional approaches eith...