Appearance
2026-01-16
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| NuRedact: Non-Uniform eFPGA Architecture for Low-Overhead and Secure IP Redaction | Voktho Das, Kimia Azar, Hadi Kamali | 2026-01-16 | 下载 | While logic locking has been extensively studied as a countermeasure against integrated circuit (IC) supply chain threats, recent research has shifted toward reconfigurable-based redaction techniques,... |
| Bench4HLS: End-to-End Evaluation of LLMs in High-Level Synthesis Code Generation | M Zafir Sadik Khan, Kimia Azar, Hadi Kamali | 2026-01-16 | 下载 | In last two years, large language models (LLMs) have shown strong capabilities in code generation, including hardware design at register-transfer level (RTL). |
| Continuous-Flow Data-Rate-Aware CNN Inference on FPGA | Tobias Habermann, Michael Mecik, Zhenyu Wang, César David Vera, Martin Kumm, Mario Garrido | 2026-01-16 | 下载 | Among hardware accelerators for deep-learning inference, data flow implementations offer low latency and high throughput capabilities. In these architectures, each neuron is mapped to a dedicated hard... |
| IMS: Intelligent Hardware Monitoring System for Secure SoCs | Wadid Foudhaili, Aykut Rencber, Anouar Nechi, Rainer Buchty, Mladen Berekovic, Andres Gomez, Saleh Mulhem | 2026-01-16 | 下载 | In the modern Systems-on-Chip (SoC), the Advanced eXtensible Interface (AXI) protocol exhibits security vulnerabilities, enabling partial or complete denial-of-service (DoS) through protocol-violation... |
| InterPUF: Distributed Authentication via Physically Unclonable Functions and Multi-party Computation for Reconfigurable Interposers | Ishraq Tashdid, Tasnuva Farheen, Sazadur Rahman | 2026-01-16 | 下载 | Modern system-in-package (SiP) platforms increasingly adopt reconfigurable interposers to enable plug-and-play chiplet integration across heterogeneous multi-vendor ecosystems. |
| OpenACM: An Open-Source SRAM-Based Approximate CiM Compiler | Yiqi Zhou, JunHao Ma, Xingyang Li, Yule Sheng, Yue Yuan, Yikai Wang, Bochang Wang, Yiheng Wu, Shan Shen, Wei Xing, Daying Sun, Li Li, Zhiqiang Xiao | 2026-01-16 | 下载 | The rise of data-intensive AI workloads has exacerbated the ``memory wall'' bottleneck. Digital Compute-in-Memory (DCiM) using SRAM offers a scalable solution, but its vast design space makes manual d... |
| RidgeWalker: Perfectly Pipelined Graph Random Walks on FPGAs | Hongshi Tan, Yao Chen, Xinyu Chen, Qizhen Zhang, Cheng Chen, Weng-Fai Wong, Bingsheng He | 2026-01-16 | 下载 | Graph Random Walks (GRWs) offer efficient approximations of key graph properties and have been widely adopted in many applications. However, GRW workloads are notoriously difficult to accelerate due t... |
| SwiftKV: An Edge-Oriented Attention Algorithm and Multi-Head Accelerator for Fast, Efficient LLM Decoding | Junming Zhang, Qinyan Zhang, Huajun Sun, Feiyang Gao, Sheng Hu, Rui Nie, Xiangshui Miao | 2026-01-16 | 下载 | Edge acceleration for large language models is crucial for their widespread application; however, achieving fast attention inference and efficient decoding on resource-constrained edge accelerators re... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| RAPID-Serve: Resource-efficient and Accelerated P/D Intra-GPU Disaggregation | Amna Masood, Pratishtha Gaur, Nuwan Jayasena | 2026-01-16 | 下载 | Two widely adopted techniques for LLM inference serving systems today are hybrid batching and disaggregated serving. A hybrid batch combines prefill and decode tokens of different requests in the same... |
| SIVF: GPU-Resident IVF Index for Streaming Vector Search | Dongfang Zhao | 2026-01-16 | 下载 | GPU-accelerated Inverted File (IVF) index is one of the industry standards for large-scale vector search but relies on static VRAM layouts that hinder real-time mutability. |
| Nixie: Efficient, Transparent Temporal Multiplexing for Consumer GPUs | Yechen Xu, Yifei Wang, Nathanael Ren, Yiran Chen, Danyang Zhuo | 2026-01-16 | 下载 | Consumer machines are increasingly running large ML workloads such as large language models (LLMs), text-to-image generation, and interactive image editing. |
| Space-Optimal, Computation-Optimal, Topology-Agnostic, Throughput-Scalable Causal Delivery through Hybrid Buffering | Paulo Sérgio Almeida | 2026-01-16 | 下载 | Message delivery respecting causal ordering (causal delivery) is one of the most classic and widely useful abstraction for inter-process communication in a distributed system. |
| DecHW: Heterogeneous Decentralized Federated Learning Exploiting Second-Order Information | Adnan Ahmad, Chiara Boldrini, Lorenzo Valerio, Andrea Passarella, Marco Conti | 2026-01-16 | 下载 | Decentralized Federated Learning (DFL) is a serverless collaborative machine learning paradigm where devices collaborate directly with neighbouring devices to exchange model information for learning a... |
| Konflux: Optimized Function Fusion for Serverless Applications | Niklas Kowallik, Trever Schirmer, David Bermbach | 2026-01-16 | 下载 | Function-as-a-Service (FaaS) has become a central paradigm in serverless cloud computing, yet optimizing FaaS deployments remains challenging. |
| HALO: Semantic-Aware Distributed LLM Inference in Lossy Edge Network | Peirong Zheng, Wenchao Xu, Haozhao Wang, Jinyu Chen, Xuemin Shen | 2026-01-16 | 下载 | The deployment of large language models' (LLMs) inference at the edge can facilitate prompt service responsiveness while protecting user privacy. |
| AFLL: Real-time Load Stabilization for MMO Game Servers Based on Circular Causality Learning | Shinsuk Kang, Youngjae Kim | 2026-01-16 | 下载 | Massively Multiplayer Online (MMO) game servers must handle thousands of simultaneous players while maintaining sub-100ms response times. When server load exceeds capacity, traditional approaches eith... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Inter-Cell Interference Rejection Based on Ultrawideband Walsh-Domain Wireless Autoencoding | Rodney Martinez Alonso, Cel Thys, Cedric Dehos, Yuneisy Esthela Garcia Guzman, Sofie Pollin | 2026-01-16 | 下载 | This paper proposes a novel technique for rejecting partial-in-band inter-cell interference (ICI) in ultrawideband communication systems. We present the design of an end-to-end wireless autoencoder ar... |
| Age-Based Scheduling for a Memory-Constrained Quantum Switch | Stavros Mitrolaris, Subhankar Banerjee, Sennur Ulukus | 2026-01-16 | 下载 | In a time-slotted system, we study the problem of scheduling multipartite entanglement requests in a quantum switch with a finite number of quantum memory registers. |
| Convergence Properties of Good Quantum Codes for Classical Communication | Alptug Aytekin, Mohamed Nomeir, Lei Hu, Sennur Ulukus | 2026-01-16 | 下载 | An important part of the information theory folklore had been about the output statistics of codes that achieve the capacity and how the empirical distributions compare to the output distributions ind... |
| Indoor Neutral-Host Networks Over Shared Spectrum and Shared Infrastructure: A Comparison Study of Real-World Deployments | Joshua Roy Palathinkal, Muhammad Iqbal Rochman, Vanlin Sathya, Mehmet Yavuz, Monisha Ghosh | 2026-01-16 | 下载 | Indoor high-capacity connectivity is frequently constrained by significant building penetration loss and the inherent uplink power limitations of a typical outdoor macro-cell deployment. |
| X-raying the arXiv: A Large-Scale Analysis of arXiv Submissions' Source Files | Giovanni Apruzzese, Aurore Fass | 2026-01-16 | 下载 | arXiv is the largest open-access repository for scientific literature. When submitting a paper, authors upload the manuscript's source files, from which the final PDF is compiled. |
| A Survey on Mapping Digital Systems with Bill of Materials: Development, Practices, and Challenges | Shuai Zhang, Minzhao Lyu, Hassan Habibi Gharakheili | 2026-01-16 | 下载 | Modern digital ecosystems, spanning software, hardware, learning models, datasets, and cryptographic products, continue to grow in complexity, making it difficult for organizations to understand and m... |
| HALO: Semantic-Aware Distributed LLM Inference in Lossy Edge Network | Peirong Zheng, Wenchao Xu, Haozhao Wang, Jinyu Chen, Xuemin Shen | 2026-01-16 | 下载 | The deployment of large language models' (LLMs) inference at the edge can facilitate prompt service responsiveness while protecting user privacy. |
| AFLL: Real-time Load Stabilization for MMO Game Servers Based on Circular Causality Learning | Shinsuk Kang, Youngjae Kim | 2026-01-16 | 下载 | Massively Multiplayer Online (MMO) game servers must handle thousands of simultaneous players while maintaining sub-100ms response times. When server load exceeds capacity, traditional approaches eith... |
| Fundamental Limits of Quantum Semantic Communication via Sheaf Cohomology | Christo Kurisummoottil Thomas, Mingzhe Chen | 2026-01-16 | 下载 | Semantic communication (SC) enables bandwidth-efficient coordination in multi-agent systems by transmitting meaning rather than raw bits. However, when agents employ heterogeneous sensing modalities a... |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Nixie: Efficient, Transparent Temporal Multiplexing for Consumer GPUs | Yechen Xu, Yifei Wang, Nathanael Ren, Yiran Chen, Danyang Zhuo | 2026-01-16 | 下载 | Consumer machines are increasingly running large ML workloads such as large language models (LLMs), text-to-image generation, and interactive image editing. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Offline Reinforcement-Learning-Based Power Control for Application-Agnostic Energy Efficiency | Akhilesh Raj, Swann Perarnau, Aniruddha Gokhale, Solomon Bekele Abera | 2026-01-16 | 下载 | Energy efficiency has become an integral aspect of modern computing infrastructure design, impacting the performance, cost, scalability, and durability of production systems. |
| AFLL: Real-time Load Stabilization for MMO Game Servers Based on Circular Causality Learning | Shinsuk Kang, Youngjae Kim | 2026-01-16 | 下载 | Massively Multiplayer Online (MMO) game servers must handle thousands of simultaneous players while maintaining sub-100ms response times. When server load exceeds capacity, traditional approaches eith... |