2026-01-14

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Enhancing LUT-based Deep Neural Networks Inference through Architecture and Connectivity Optimization	Binglei Lou, Ruilin Wu, Philip Leong	2026-01-14	下载	Deploying deep neural networks (DNNs) on resource-constrained edge devices such as FPGAs requires a careful balance among latency, power, and hardware resource usage, while maintaining high accuracy.
Late Breaking Results: Quamba-SE: Soft-edge Quantizer for Activations in State Space Models	Yizhi Chen, Ahmed Hemani	2026-01-14	下载	We propose Quamba-SE, a soft-edge quantizer for State Space Model (SSM) activation quantization. Unlike existing methods, using standard INT8 operation, Quamba-SE employs three adaptive scales: high-p...
Relational Hoare Logic for High-Level Synthesis of Hardware Accelerators	Izumi Tanaka, Ken Sakayori, Shinya Takamaeda-Yamazaki, Naoki Kobayashi	2026-01-14	下载	High-level synthesis (HLS) is a powerful tool for developing efficient hardware accelerators that rely on specialized memory systems to achieve sufficient on-chip data reuse and off-chip bandwidth uti...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
QFed: Parameter-Compact Quantum-Classical Federated Learning	Samar Abdelghani, Soumaya Cherkaoui	2026-01-14	下载	Organizations and enterprises across domains such as healthcare, finance, and scientific research are increasingly required to extract collective intelligence from distributed, siloed datasets while a...
AI-NativeBench: An Open-Source White-Box Agentic Benchmark Suite for AI-Native Systems	Zirui Wang, Guangba Yu, Michael R. Lyu	2026-01-14	下载	The transition from Cloud-Native to AI-Native architectures is fundamentally reshaping software engineering, replacing deterministic microservices with probabilistic agentic services.
Network-Based Quantum Computing: an efficient design framework for many-small-node distributed fault-tolerant quantum computing	Soshun Naito, Yasunari Suzuki, Yuuki Tokunaga	2026-01-14	下载	In fault-tolerant quantum computing, a large number of physical qubits are required to construct a single logical qubit, and a single quantum node may be able to hold only a small number of logical qu...
High-Performance Serverless Computing: A Systematic Literature Review on Serverless for HPC, AI, and Big Data	Valerio Besozzi, Matteo Della Bartola, Patrizio Dazzi, Marco Danelutto	2026-01-14	下载	The widespread deployment of large-scale, compute-intensive applications such as high-performance computing, artificial intelligence, and big data is leading to convergence between cloud and high-perf...
Cluster Workload Allocation: Semantic Soft Affinity Using Natural Language Processing	Leszek Sliwko, Jolanta Mizeria-Pietraszko	2026-01-14	下载	Cluster workload allocation often requires complex configurations, creating a usability gap. This paper introduces a semantic, intent-driven scheduling paradigm for cluster systems using Natural Langu...
LatencyPrism: Online Non-intrusive Latency Sculpting for SLO-Guaranteed LLM Inference	Yin Du, Jiayi Ren, Xiayu Sun, Tianyao Zhou, Haizhu Zhou, Ruiyan Ma, Danyang Zhang	2026-01-14	下载	LLM inference latency critically determines user experience and operational costs, directly impacting throughput under SLO constraints. Even brief latency spikes degrade service quality despite accept...
Optimizing View Change for Byzantine Fault Tolerance in Parallel Consensus	Yifei Xie, Btissam Er-Rahmadi, Xiao Chen, Tiejun Ma, Jane Hillston	2026-01-14	下载	The parallel Byzantine Fault Tolerant (BFT) protocol is viewed as a promising solution to address the consensus scalability issue of the permissioned blockchain.
DP-FedSOFIM: Differentially Private Federated Stochastic Optimization using Regularized Fisher Information Matrix	Sidhant Nair, Tanmay Sen, Mrinmay Sen, Sayantan Banerjee	2026-01-14	下载	Differentially private federated learning (DP-FL) often suffers from slow convergence under tight privacy budgets because the noise required for privacy preservation degrades gradient quality.
Transaction-Driven Dynamic Reconfiguration for Certificate-Based Payment Systems	Lingkang Shangguan	2026-01-14	下载	We present a transaction-driven dynamic reconfiguration protocol in Modern payment systems based on Byzantine Consistent Broadcast which can achieve high performance by avoiding global transaction ord...
A Machine Learning Approach Towards Runtime Optimisation of Matrix Multiplication	Yufan Xia, Marco De La Pierre, Amanda S. Barnard, Giuseppe Maria Junior Barca	2026-01-14	下载	The GEneral Matrix Multiplication (GEMM) is one of the essential algorithms in scientific computing. Single-thread GEMM implementations are well-optimised with techniques like blocking and autotuning.
Lean Clients, Full Accuracy: Hybrid Zeroth- and First-Order Split Federated Learning	Zhoubin Kou, Zihan Chen, Jing Yang, Cong Shen	2026-01-14	下载	Split Federated Learning (SFL) enables collaborative training between resource-constrained edge devices and a compute-rich server. Communication overhead is a central issue in SFL and can be mitigated...
Probabilistic Computers for MIMO Detection: From Sparsification to 2D Parallel Tempering	M Mahmudul Hasan Sajeeb, Corentin Delacour, Kevin Callahan-Coray, Sanjay Seshan, Tathagata Srimani, Kerem Y. Camsari	2026-01-14	下载	Probabilistic computers built from p-bits offer a promising path for combinatorial optimization, but the dense connectivity required by real-world problems scales poorly in hardware.

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
A Novel Contrastive Loss for Zero-Day Network Intrusion Detection	Jack Wilkie, Hanan Hindy, Craig Michie, Christos Tachtatzis, James Irvine, Robert Atkinson	2026-01-14	下载	Machine learning has achieved state-of-the-art results in network intrusion detection; however, its performance significantly degrades when confronted by a new attack class -- a zero-day attack.
FairShare: Auditable Geographic Fairness for Multi-Operator LEO Spectrum Sharing	Seyed Bagher Hashemi Natanzi, Hossein Mohammadi, Vuk Marojevic, Bo Tang	2026-01-14	下载	Dynamic spectrum sharing (DSS) among multi-operator low Earth orbit (LEO) mega-constellations is essential for coexistence, yet prevailing policies focus almost exclusively on interference mitigation,...
UAV-enabled Computing Power Networks: Design and Performance Analysis under Energy Constraints	Yiqin Deng, Zhengru Fang, Senkang Hu, Yanan Ma, Xiaoyu Guo, Haixia Zhang, Yuguang Fang	2026-01-14	下载	This paper presents an innovative framework that boosts computing power by utilizing ubiquitous computing power distribution and enabling higher computing node accessibility via adaptive UAV positioni...
Lean Clients, Full Accuracy: Hybrid Zeroth- and First-Order Split Federated Learning	Zhoubin Kou, Zihan Chen, Jing Yang, Cong Shen	2026-01-14	下载	Split Federated Learning (SFL) enables collaborative training between resource-constrained edge devices and a compute-rich server. Communication overhead is a central issue in SFL and can be mitigated...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
LatencyPrism: Online Non-intrusive Latency Sculpting for SLO-Guaranteed LLM Inference	Yin Du, Jiayi Ren, Xiayu Sun, Tianyao Zhou, Haizhu Zhou, Ruiyan Ma, Danyang Zhang	2026-01-14	下载	LLM inference latency critically determines user experience and operational costs, directly impacting throughput under SLO constraints. Even brief latency spikes degrade service quality despite accept...

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Private LLM Inference on Consumer Blackwell GPUs: A Practical Guide for Cost-Effective Local Deployment in SMEs	Jonathan Knoop, Hendrik Holtmann	2026-01-14	下载	SMEs increasingly seek alternatives to cloud LLM APIs, which raise data privacy concerns. Dedicated cloud GPU instances offer improved privacy but with limited guarantees and ongoing costs, while prof...
AI-NativeBench: An Open-Source White-Box Agentic Benchmark Suite for AI-Native Systems	Zirui Wang, Guangba Yu, Michael R. Lyu	2026-01-14	下载	The transition from Cloud-Native to AI-Native architectures is fundamentally reshaping software engineering, replacing deterministic microservices with probabilistic agentic services.