Skip to content

2026-01-16

cs.AR - Architecture

标题作者发布日期PDF摘要
NuRedact: Non-Uniform eFPGA Architecture for Low-Overhead and Secure IP RedactionVoktho Das, Kimia Azar, Hadi Kamali2026-01-16下载While logic locking has been extensively studied as a countermeasure against integrated circuit (IC) supply chain threats, recent research has shifted toward reconfigurable-based redaction techniques,...
Bench4HLS: End-to-End Evaluation of LLMs in High-Level Synthesis Code GenerationM Zafir Sadik Khan, Kimia Azar, Hadi Kamali2026-01-16下载In last two years, large language models (LLMs) have shown strong capabilities in code generation, including hardware design at register-transfer level (RTL).
Continuous-Flow Data-Rate-Aware CNN Inference on FPGATobias Habermann, Michael Mecik, Zhenyu Wang, César David Vera, Martin Kumm, Mario Garrido2026-01-16下载Among hardware accelerators for deep-learning inference, data flow implementations offer low latency and high throughput capabilities. In these architectures, each neuron is mapped to a dedicated hard...
IMS: Intelligent Hardware Monitoring System for Secure SoCsWadid Foudhaili, Aykut Rencber, Anouar Nechi, Rainer Buchty, Mladen Berekovic, Andres Gomez, Saleh Mulhem2026-01-16下载In the modern Systems-on-Chip (SoC), the Advanced eXtensible Interface (AXI) protocol exhibits security vulnerabilities, enabling partial or complete denial-of-service (DoS) through protocol-violation...
InterPUF: Distributed Authentication via Physically Unclonable Functions and Multi-party Computation for Reconfigurable InterposersIshraq Tashdid, Tasnuva Farheen, Sazadur Rahman2026-01-16下载Modern system-in-package (SiP) platforms increasingly adopt reconfigurable interposers to enable plug-and-play chiplet integration across heterogeneous multi-vendor ecosystems.
OpenACM: An Open-Source SRAM-Based Approximate CiM CompilerYiqi Zhou, JunHao Ma, Xingyang Li, Yule Sheng, Yue Yuan, Yikai Wang, Bochang Wang, Yiheng Wu, Shan Shen, Wei Xing, Daying Sun, Li Li, Zhiqiang Xiao2026-01-16下载The rise of data-intensive AI workloads has exacerbated the ``memory wall'' bottleneck. Digital Compute-in-Memory (DCiM) using SRAM offers a scalable solution, but its vast design space makes manual d...
RidgeWalker: Perfectly Pipelined Graph Random Walks on FPGAsHongshi Tan, Yao Chen, Xinyu Chen, Qizhen Zhang, Cheng Chen, Weng-Fai Wong, Bingsheng He2026-01-16下载Graph Random Walks (GRWs) offer efficient approximations of key graph properties and have been widely adopted in many applications. However, GRW workloads are notoriously difficult to accelerate due t...
SwiftKV: An Edge-Oriented Attention Algorithm and Multi-Head Accelerator for Fast, Efficient LLM DecodingJunming Zhang, Qinyan Zhang, Huajun Sun, Feiyang Gao, Sheng Hu, Rui Nie, Xiangshui Miao2026-01-16下载Edge acceleration for large language models is crucial for their widespread application; however, achieving fast attention inference and efficient decoding on resource-constrained edge accelerators re...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
RAPID-Serve: Resource-efficient and Accelerated P/D Intra-GPU DisaggregationAmna Masood, Pratishtha Gaur, Nuwan Jayasena2026-01-16下载Two widely adopted techniques for LLM inference serving systems today are hybrid batching and disaggregated serving. A hybrid batch combines prefill and decode tokens of different requests in the same...
SIVF: GPU-Resident IVF Index for Streaming Vector SearchDongfang Zhao2026-01-16下载GPU-accelerated Inverted File (IVF) index is one of the industry standards for large-scale vector search but relies on static VRAM layouts that hinder real-time mutability.
Nixie: Efficient, Transparent Temporal Multiplexing for Consumer GPUsYechen Xu, Yifei Wang, Nathanael Ren, Yiran Chen, Danyang Zhuo2026-01-16下载Consumer machines are increasingly running large ML workloads such as large language models (LLMs), text-to-image generation, and interactive image editing.
Space-Optimal, Computation-Optimal, Topology-Agnostic, Throughput-Scalable Causal Delivery through Hybrid BufferingPaulo Sérgio Almeida2026-01-16下载Message delivery respecting causal ordering (causal delivery) is one of the most classic and widely useful abstraction for inter-process communication in a distributed system.
DecHW: Heterogeneous Decentralized Federated Learning Exploiting Second-Order InformationAdnan Ahmad, Chiara Boldrini, Lorenzo Valerio, Andrea Passarella, Marco Conti2026-01-16下载Decentralized Federated Learning (DFL) is a serverless collaborative machine learning paradigm where devices collaborate directly with neighbouring devices to exchange model information for learning a...
Konflux: Optimized Function Fusion for Serverless ApplicationsNiklas Kowallik, Trever Schirmer, David Bermbach2026-01-16下载Function-as-a-Service (FaaS) has become a central paradigm in serverless cloud computing, yet optimizing FaaS deployments remains challenging.
HALO: Semantic-Aware Distributed LLM Inference in Lossy Edge NetworkPeirong Zheng, Wenchao Xu, Haozhao Wang, Jinyu Chen, Xuemin Shen2026-01-16下载The deployment of large language models' (LLMs) inference at the edge can facilitate prompt service responsiveness while protecting user privacy.
AFLL: Real-time Load Stabilization for MMO Game Servers Based on Circular Causality LearningShinsuk Kang, Youngjae Kim2026-01-16下载Massively Multiplayer Online (MMO) game servers must handle thousands of simultaneous players while maintaining sub-100ms response times. When server load exceeds capacity, traditional approaches eith...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Inter-Cell Interference Rejection Based on Ultrawideband Walsh-Domain Wireless AutoencodingRodney Martinez Alonso, Cel Thys, Cedric Dehos, Yuneisy Esthela Garcia Guzman, Sofie Pollin2026-01-16下载This paper proposes a novel technique for rejecting partial-in-band inter-cell interference (ICI) in ultrawideband communication systems. We present the design of an end-to-end wireless autoencoder ar...
Age-Based Scheduling for a Memory-Constrained Quantum SwitchStavros Mitrolaris, Subhankar Banerjee, Sennur Ulukus2026-01-16下载In a time-slotted system, we study the problem of scheduling multipartite entanglement requests in a quantum switch with a finite number of quantum memory registers.
Convergence Properties of Good Quantum Codes for Classical CommunicationAlptug Aytekin, Mohamed Nomeir, Lei Hu, Sennur Ulukus2026-01-16下载An important part of the information theory folklore had been about the output statistics of codes that achieve the capacity and how the empirical distributions compare to the output distributions ind...
Indoor Neutral-Host Networks Over Shared Spectrum and Shared Infrastructure: A Comparison Study of Real-World DeploymentsJoshua Roy Palathinkal, Muhammad Iqbal Rochman, Vanlin Sathya, Mehmet Yavuz, Monisha Ghosh2026-01-16下载Indoor high-capacity connectivity is frequently constrained by significant building penetration loss and the inherent uplink power limitations of a typical outdoor macro-cell deployment.
X-raying the arXiv: A Large-Scale Analysis of arXiv Submissions' Source FilesGiovanni Apruzzese, Aurore Fass2026-01-16下载arXiv is the largest open-access repository for scientific literature. When submitting a paper, authors upload the manuscript's source files, from which the final PDF is compiled.
A Survey on Mapping Digital Systems with Bill of Materials: Development, Practices, and ChallengesShuai Zhang, Minzhao Lyu, Hassan Habibi Gharakheili2026-01-16下载Modern digital ecosystems, spanning software, hardware, learning models, datasets, and cryptographic products, continue to grow in complexity, making it difficult for organizations to understand and m...
HALO: Semantic-Aware Distributed LLM Inference in Lossy Edge NetworkPeirong Zheng, Wenchao Xu, Haozhao Wang, Jinyu Chen, Xuemin Shen2026-01-16下载The deployment of large language models' (LLMs) inference at the edge can facilitate prompt service responsiveness while protecting user privacy.
AFLL: Real-time Load Stabilization for MMO Game Servers Based on Circular Causality LearningShinsuk Kang, Youngjae Kim2026-01-16下载Massively Multiplayer Online (MMO) game servers must handle thousands of simultaneous players while maintaining sub-100ms response times. When server load exceeds capacity, traditional approaches eith...
Fundamental Limits of Quantum Semantic Communication via Sheaf CohomologyChristo Kurisummoottil Thomas, Mingzhe Chen2026-01-16下载Semantic communication (SC) enables bandwidth-efficient coordination in multi-agent systems by transmitting meaning rather than raw bits. However, when agents employ heterogeneous sensing modalities a...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Nixie: Efficient, Transparent Temporal Multiplexing for Consumer GPUsYechen Xu, Yifei Wang, Nathanael Ren, Yiran Chen, Danyang Zhuo2026-01-16下载Consumer machines are increasingly running large ML workloads such as large language models (LLMs), text-to-image generation, and interactive image editing.

cs.PF - Performance

标题作者发布日期PDF摘要
Offline Reinforcement-Learning-Based Power Control for Application-Agnostic Energy EfficiencyAkhilesh Raj, Swann Perarnau, Aniruddha Gokhale, Solomon Bekele Abera2026-01-16下载Energy efficiency has become an integral aspect of modern computing infrastructure design, impacting the performance, cost, scalability, and durability of production systems.
AFLL: Real-time Load Stabilization for MMO Game Servers Based on Circular Causality LearningShinsuk Kang, Youngjae Kim2026-01-16下载Massively Multiplayer Online (MMO) game servers must handle thousands of simultaneous players while maintaining sub-100ms response times. When server load exceeds capacity, traditional approaches eith...

基于 VitePress 构建