Skip to content

2026-01-14

cs.AR - Architecture

标题作者发布日期PDF摘要
Enhancing LUT-based Deep Neural Networks Inference through Architecture and Connectivity OptimizationBinglei Lou, Ruilin Wu, Philip Leong2026-01-14下载Deploying deep neural networks (DNNs) on resource-constrained edge devices such as FPGAs requires a careful balance among latency, power, and hardware resource usage, while maintaining high accuracy.
Late Breaking Results: Quamba-SE: Soft-edge Quantizer for Activations in State Space ModelsYizhi Chen, Ahmed Hemani2026-01-14下载We propose Quamba-SE, a soft-edge quantizer for State Space Model (SSM) activation quantization. Unlike existing methods, using standard INT8 operation, Quamba-SE employs three adaptive scales: high-p...
Relational Hoare Logic for High-Level Synthesis of Hardware AcceleratorsIzumi Tanaka, Ken Sakayori, Shinya Takamaeda-Yamazaki, Naoki Kobayashi2026-01-14下载High-level synthesis (HLS) is a powerful tool for developing efficient hardware accelerators that rely on specialized memory systems to achieve sufficient on-chip data reuse and off-chip bandwidth uti...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
QFed: Parameter-Compact Quantum-Classical Federated LearningSamar Abdelghani, Soumaya Cherkaoui2026-01-14下载Organizations and enterprises across domains such as healthcare, finance, and scientific research are increasingly required to extract collective intelligence from distributed, siloed datasets while a...
AI-NativeBench: An Open-Source White-Box Agentic Benchmark Suite for AI-Native SystemsZirui Wang, Guangba Yu, Michael R. Lyu2026-01-14下载The transition from Cloud-Native to AI-Native architectures is fundamentally reshaping software engineering, replacing deterministic microservices with probabilistic agentic services.
Network-Based Quantum Computing: an efficient design framework for many-small-node distributed fault-tolerant quantum computingSoshun Naito, Yasunari Suzuki, Yuuki Tokunaga2026-01-14下载In fault-tolerant quantum computing, a large number of physical qubits are required to construct a single logical qubit, and a single quantum node may be able to hold only a small number of logical qu...
High-Performance Serverless Computing: A Systematic Literature Review on Serverless for HPC, AI, and Big DataValerio Besozzi, Matteo Della Bartola, Patrizio Dazzi, Marco Danelutto2026-01-14下载The widespread deployment of large-scale, compute-intensive applications such as high-performance computing, artificial intelligence, and big data is leading to convergence between cloud and high-perf...
Cluster Workload Allocation: Semantic Soft Affinity Using Natural Language ProcessingLeszek Sliwko, Jolanta Mizeria-Pietraszko2026-01-14下载Cluster workload allocation often requires complex configurations, creating a usability gap. This paper introduces a semantic, intent-driven scheduling paradigm for cluster systems using Natural Langu...
LatencyPrism: Online Non-intrusive Latency Sculpting for SLO-Guaranteed LLM InferenceYin Du, Jiayi Ren, Xiayu Sun, Tianyao Zhou, Haizhu Zhou, Ruiyan Ma, Danyang Zhang2026-01-14下载LLM inference latency critically determines user experience and operational costs, directly impacting throughput under SLO constraints. Even brief latency spikes degrade service quality despite accept...
Optimizing View Change for Byzantine Fault Tolerance in Parallel ConsensusYifei Xie, Btissam Er-Rahmadi, Xiao Chen, Tiejun Ma, Jane Hillston2026-01-14下载The parallel Byzantine Fault Tolerant (BFT) protocol is viewed as a promising solution to address the consensus scalability issue of the permissioned blockchain.
DP-FedSOFIM: Differentially Private Federated Stochastic Optimization using Regularized Fisher Information MatrixSidhant Nair, Tanmay Sen, Mrinmay Sen, Sayantan Banerjee2026-01-14下载Differentially private federated learning (DP-FL) often suffers from slow convergence under tight privacy budgets because the noise required for privacy preservation degrades gradient quality.
Transaction-Driven Dynamic Reconfiguration for Certificate-Based Payment SystemsLingkang Shangguan2026-01-14下载We present a transaction-driven dynamic reconfiguration protocol in Modern payment systems based on Byzantine Consistent Broadcast which can achieve high performance by avoiding global transaction ord...
A Machine Learning Approach Towards Runtime Optimisation of Matrix MultiplicationYufan Xia, Marco De La Pierre, Amanda S. Barnard, Giuseppe Maria Junior Barca2026-01-14下载The GEneral Matrix Multiplication (GEMM) is one of the essential algorithms in scientific computing. Single-thread GEMM implementations are well-optimised with techniques like blocking and autotuning.
Lean Clients, Full Accuracy: Hybrid Zeroth- and First-Order Split Federated LearningZhoubin Kou, Zihan Chen, Jing Yang, Cong Shen2026-01-14下载Split Federated Learning (SFL) enables collaborative training between resource-constrained edge devices and a compute-rich server. Communication overhead is a central issue in SFL and can be mitigated...
Probabilistic Computers for MIMO Detection: From Sparsification to 2D Parallel TemperingM Mahmudul Hasan Sajeeb, Corentin Delacour, Kevin Callahan-Coray, Sanjay Seshan, Tathagata Srimani, Kerem Y. Camsari2026-01-14下载Probabilistic computers built from p-bits offer a promising path for combinatorial optimization, but the dense connectivity required by real-world problems scales poorly in hardware.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
A Novel Contrastive Loss for Zero-Day Network Intrusion DetectionJack Wilkie, Hanan Hindy, Craig Michie, Christos Tachtatzis, James Irvine, Robert Atkinson2026-01-14下载Machine learning has achieved state-of-the-art results in network intrusion detection; however, its performance significantly degrades when confronted by a new attack class -- a zero-day attack.
FairShare: Auditable Geographic Fairness for Multi-Operator LEO Spectrum SharingSeyed Bagher Hashemi Natanzi, Hossein Mohammadi, Vuk Marojevic, Bo Tang2026-01-14下载Dynamic spectrum sharing (DSS) among multi-operator low Earth orbit (LEO) mega-constellations is essential for coexistence, yet prevailing policies focus almost exclusively on interference mitigation,...
UAV-enabled Computing Power Networks: Design and Performance Analysis under Energy ConstraintsYiqin Deng, Zhengru Fang, Senkang Hu, Yanan Ma, Xiaoyu Guo, Haixia Zhang, Yuguang Fang2026-01-14下载This paper presents an innovative framework that boosts computing power by utilizing ubiquitous computing power distribution and enabling higher computing node accessibility via adaptive UAV positioni...
Lean Clients, Full Accuracy: Hybrid Zeroth- and First-Order Split Federated LearningZhoubin Kou, Zihan Chen, Jing Yang, Cong Shen2026-01-14下载Split Federated Learning (SFL) enables collaborative training between resource-constrained edge devices and a compute-rich server. Communication overhead is a central issue in SFL and can be mitigated...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
LatencyPrism: Online Non-intrusive Latency Sculpting for SLO-Guaranteed LLM InferenceYin Du, Jiayi Ren, Xiayu Sun, Tianyao Zhou, Haizhu Zhou, Ruiyan Ma, Danyang Zhang2026-01-14下载LLM inference latency critically determines user experience and operational costs, directly impacting throughput under SLO constraints. Even brief latency spikes degrade service quality despite accept...

cs.PF - Performance

标题作者发布日期PDF摘要
Private LLM Inference on Consumer Blackwell GPUs: A Practical Guide for Cost-Effective Local Deployment in SMEsJonathan Knoop, Hendrik Holtmann2026-01-14下载SMEs increasingly seek alternatives to cloud LLM APIs, which raise data privacy concerns. Dedicated cloud GPU instances offer improved privacy but with limited guarantees and ongoing costs, while prof...
AI-NativeBench: An Open-Source White-Box Agentic Benchmark Suite for AI-Native SystemsZirui Wang, Guangba Yu, Michael R. Lyu2026-01-14下载The transition from Cloud-Native to AI-Native architectures is fundamentally reshaping software engineering, replacing deterministic microservices with probabilistic agentic services.

基于 VitePress 构建