2026-01-21

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
A Hybrid Residue Floating Numerical Architecture with Formal Error Bounds for High Throughput FPGA Computation	Mostafa Darvishi	2026-01-21	下载	Floating point arithmetic is costly on FPGA platforms due to wide datapaths, normalization, and carry propagation, motivating alternative numerical representations that improve throughput and efficien...
Pipeline Automation Framework for Reusable High-throughput Network Applications on FPGA	Jean Bruant, Pierre-Henri Horrein, Olivier Muller, Frédéric Pétrot	2026-01-21	下载	In a context of ever-growing worldwide communication traffic, cloud service providers aim at deploying scalable infrastructures to address heterogeneous needs.
SynPerf: A Hybrid Analytical-ML Framework for GPU Performance Prediction	Kaixuan Zhang, Yunfan Cui, Shuhao Zhang, Chutong Ding, Shiyou Qian, Luping Wang, Jian Cao, Guangtao Xue, Cheng Huang, Guodong Yang, Liping Zhang	2026-01-21	下载	The rapid expansion of Transformer-based large language models has dramatically increased the need for high-performance GPUs. As a result, there is growing demand for fast, accurate, and widely genera...
Analog-to-Stochastic Converter Using Magnetic Tunnel Junction Devices for Vision Chips	Naoya Onizawa, Daisaku Katagiri, Warren J. Gross, Takahiro Hanyu	2026-01-21	下载	This paper introduces an analog-to-stochastic converter using a magnetic tunnel junction (MTJ) device for vision chips based on stochastic computation.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Securing LLM-as-a-Service for Small Businesses: An Industry Case Study of a Distributed Chatbot Deployment Platform	Jiazhu Xie, Bowen Li, Heyu Fu, Chong Gao, Ziqi Xu, Fengling Han	2026-01-21	下载	Large Language Model (LLM)-based question-answering systems offer significant potential for automating customer support and internal knowledge access in small businesses, yet their practical deploymen...
Predictor-Free and Hardware-Aware Federated Neural Architecture Search via Pareto-Guided Supernet Training	Bostan Khan, Masoud Daneshtalab	2026-01-21	下载	Federated Neural Architecture Search (FedNAS) aims to automate model design for privacy-preserving Federated Learning (FL) but currently faces two critical bottlenecks: unguided supernet training that...
RadixMLP -- Intra-batch Deduplication for Causal Transformers	Michael Feil, Julius Lipp	2026-01-21	下载	Batch inference workloads for causal transformer models frequently process sequences that share common prefixes, such as system prompts, few-shot examples, or shared queries.
Parallel Collaborative ADMM Privacy Computing and Adaptive GPU Acceleration for Distributed Edge Networks	Mengchun Xia, Zhicheng Dong, Donghong Cai, Fang Fang, Lisheng Fan, Pingzhi Fan	2026-01-21	下载	Distributed computing has been widely applied in distributed edge networks for reducing the processing burden of high-dimensional data centralization, where a high-dimensional computational task is de...
Application-level observability for adaptive Edge to Cloud continuum systems	Kaddour Sidi, Daniel Balouek, Baptiste Jonglez	2026-01-21	下载	Modern Edge-to-Cloud (E2C) systems require fine-grained observability to ensure adaptive behavior and compliance with performance objectives across heterogeneous and dynamic environments.
AlertGuardian: Intelligent Alert Life-Cycle Management for Large-scale Cloud Systems	Guangba Yu, Genting Mai, Rui Wang, Ruipeng Li, Pengfei Chen, Long Pan, Ruijie Xu	2026-01-21	下载	Alerts are critical for detecting anomalies in large-scale cloud systems, ensuring reliability and user experience. However, current systems generate overwhelming volumes of alerts, degrading operatio...
Optimizing FaaS Platforms for MCP-enabled Agentic Workflows	Varad Kulkarni, Vaibhav Jha, Nikhil Reddy, Anand Eswaran, Praveen Jayachandran, Yogesh Simmhan	2026-01-21	下载	Agentic workflows that use autonomous AI Agents powered by Large Language Models (LLMs) and Model Context Protocol (MCP) servers is rapidly rising.
On Distributed Quantum Computing with Distributed Fan-Out Operations	Seng W. Loke	2026-01-21	下载	We compare different circuits implementing distributed versions of quantum computations, using entangled pairs only, and using distributed fan-out operations (using GHZ states).
Beyond Denial-of-Service: The Puppeteer's Attack for Fine-Grained Control in Ranking-Based Federated Learning	Zhihao Chen, Zirui Gong, Jianting Ning, Yanjun Zhang, Leo Yu Zhang	2026-01-21	下载	Federated Rank Learning (FRL) is a promising Federated Learning (FL) paradigm designed to be resilient against model poisoning attacks due to its discrete, ranking-based update mechanism.
Specifying and Verifying RDMA Synchronisation (Extended Version)	Guillaume Ambal, Max Stupple, Brijesh Dongol, Azalea Raad	2026-01-21	下载	Remote direct memory access (RDMA) allows a machine to directly read from and write to the memory of remote machine, enabling high-throughput, low-latency data transfer.
Exploiting Spot Instances for Time-Critical Cloud Workloads Using Optimal Randomized Strategies	Neelkamal Bhuyan, Randeep Bhatia, Murali Kodialam, TV Lakshman	2026-01-21	下载	This paper addresses the challenge of deadline-aware online scheduling for jobs in hybrid cloud environments, where jobs may run on either cost-effective but unreliable spot instances or more expensiv...
Exploring Performance-Productivity Trade-offs in AMT Runtimes: A Task Bench Study of Itoyori, ItoyoriFBC, HPX, and MPI	Torben R. Lahnor, Mia Reitz, Jonas Posner, Patrick Diehl	2026-01-21	下载	Asynchronous Many-Task (AMT) runtimes offer a productive alternative to the Message Passing Interface (MPI). However, the diverse AMT landscape makes fair comparisons challenging.
Agent Identity URI Scheme: Topology-Independent Naming and Capability-Based Discovery for Multi-Agent Systems	Roland R. Rodriguez	2026-01-21	下载	Multi-agent systems face a fundamental architectural flaw: agent identity is bound to network location. When agents migrate between providers, scale across instances, or federate across organizations,...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Resource Allocation and Sharing for UAV-Assisted Integrated TN-NTN with Multi-Connectivity	Abd Ullah Khan, Wali Ullah Khan, Haejoon Jung, Hyundong Shin	2026-01-21	下载	Unmanned aerial vehicles (UAVs) with multi-connectivity (MC) capabilities efficiently and reliably transfer data between terrestrial networks (TNs) and non-terrestrial networks (NTNs).
Economic feasibility of virtual operators in 5G via network slicing	Erwin J. Sacoto-Cabrera, Luis Guijarro, Jose R. Vidal, Vicent Pla	2026-01-21	下载	The provision of services by more than one operator over a common network infrastructure, as enabled by 5G network slicing, is analyzed. Two business models to be implemented by a network operator, wh...
5G NR Non-Terrestrial Networks: Open Challenges for Full-Stack Protocol Design	Francesco Rossato, Mattia Figaro, Alessandro Traspadini, Takayuki Shimizu, Chinmay Mahabal, Sanjeewa Herath, Chunghan Lee, Dogan Kutay Pekcan, Michele Zorzi, Marco Giordani	2026-01-21	下载	As 5th generation (5G) networks continue to evolve, there is a growing interest toward the integration of Terrestrial Networks (TNs) and Non-Terrestrial Networks (NTNs).
Exploiting Spot Instances for Time-Critical Cloud Workloads Using Optimal Randomized Strategies	Neelkamal Bhuyan, Randeep Bhatia, Murali Kodialam, TV Lakshman	2026-01-21	下载	This paper addresses the challenge of deadline-aware online scheduling for jobs in hybrid cloud environments, where jobs may run on either cost-effective but unreliable spot instances or more expensiv...
Holmes: An Evidence-Grounded LLM Agent for Auditable DDoS Investigation in Cloud Networks	Haodong Chen, Ziheng Zhang, Jinghui Jiang, Qiang Su, Qiao Xiang	2026-01-21	下载	Cloud environments face frequent DDoS threats due to centralized resources and broad attack surfaces. Modern cloud-native DDoS attacks further evolve rapidly and often blend multi-vector strategies, c...
Close-enough general routing problem for multiple unmanned aerial vehicles in monitoring missions	Huan Liu, Michel Gendreau, Binjie Xu, Guohua Wu, Yi Gu	2026-01-21	下载	In this paper, we introduce a close-enough multi-UAV general routing problem (CEMUAVGRP) where a fleet of homogeneous UAVs conduct monitoring tasks containing nodes, each of which has its disk neighbo...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
DeLog: An Efficient Log Compression Framework with Pattern Signature Synthesis	Siyu Yu, Yifan Wu, Junjielong Xu, Ying Fu, Ning Wang, Maoyin Liu, Pancheng Jiang, Xiang Zhang, Tong Jia, Pinjia He, Ying Li	2026-01-21	下载	Parser-based log compression, which separates static templates from dynamic variables, is a promising approach to exploit the unique structure of log data.
WebAssembly Based Portable and Secure Sensor Interface for Internet of Things	Botong Ou, Baijian Yang	2026-01-21	下载	As the expansion of IoT connectivity continues to provide quality-of-life improvements around the world, they simultaneously introduce increasing privacy and security concerns.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Reliability by design: quantifying and eliminating fabrication risk in LLMs. From generative to consultative AI: a comparative analysis in the legal domain and lessons for high-stakes knowledge bases	Alex Dantart	2026-01-21	下载	This paper examines how to make large language models reliable for high-stakes legal work by reducing hallucinations. It distinguishes three AI paradigms: (1) standalone generative models ("creative o...
SynPerf: A Hybrid Analytical-ML Framework for GPU Performance Prediction	Kaixuan Zhang, Yunfan Cui, Shuhao Zhang, Chutong Ding, Shiyou Qian, Luping Wang, Jian Cao, Guangtao Xue, Cheng Huang, Guodong Yang, Liping Zhang	2026-01-21	下载	The rapid expansion of Transformer-based large language models has dramatically increased the need for high-performance GPUs. As a result, there is growing demand for fast, accurate, and widely genera...
Exploiting Spot Instances for Time-Critical Cloud Workloads Using Optimal Randomized Strategies	Neelkamal Bhuyan, Randeep Bhatia, Murali Kodialam, TV Lakshman	2026-01-21	下载	This paper addresses the challenge of deadline-aware online scheduling for jobs in hybrid cloud environments, where jobs may run on either cost-effective but unreliable spot instances or more expensiv...