Skip to content

2026-01-21

cs.AR - Architecture

标题作者发布日期PDF摘要
A Hybrid Residue Floating Numerical Architecture with Formal Error Bounds for High Throughput FPGA ComputationMostafa Darvishi2026-01-21下载Floating point arithmetic is costly on FPGA platforms due to wide datapaths, normalization, and carry propagation, motivating alternative numerical representations that improve throughput and efficien...
Pipeline Automation Framework for Reusable High-throughput Network Applications on FPGAJean Bruant, Pierre-Henri Horrein, Olivier Muller, Frédéric Pétrot2026-01-21下载In a context of ever-growing worldwide communication traffic, cloud service providers aim at deploying scalable infrastructures to address heterogeneous needs.
SynPerf: A Hybrid Analytical-ML Framework for GPU Performance PredictionKaixuan Zhang, Yunfan Cui, Shuhao Zhang, Chutong Ding, Shiyou Qian, Luping Wang, Jian Cao, Guangtao Xue, Cheng Huang, Guodong Yang, Liping Zhang2026-01-21下载The rapid expansion of Transformer-based large language models has dramatically increased the need for high-performance GPUs. As a result, there is growing demand for fast, accurate, and widely genera...
Analog-to-Stochastic Converter Using Magnetic Tunnel Junction Devices for Vision ChipsNaoya Onizawa, Daisaku Katagiri, Warren J. Gross, Takahiro Hanyu2026-01-21下载This paper introduces an analog-to-stochastic converter using a magnetic tunnel junction (MTJ) device for vision chips based on stochastic computation.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Securing LLM-as-a-Service for Small Businesses: An Industry Case Study of a Distributed Chatbot Deployment PlatformJiazhu Xie, Bowen Li, Heyu Fu, Chong Gao, Ziqi Xu, Fengling Han2026-01-21下载Large Language Model (LLM)-based question-answering systems offer significant potential for automating customer support and internal knowledge access in small businesses, yet their practical deploymen...
Predictor-Free and Hardware-Aware Federated Neural Architecture Search via Pareto-Guided Supernet TrainingBostan Khan, Masoud Daneshtalab2026-01-21下载Federated Neural Architecture Search (FedNAS) aims to automate model design for privacy-preserving Federated Learning (FL) but currently faces two critical bottlenecks: unguided supernet training that...
RadixMLP -- Intra-batch Deduplication for Causal TransformersMichael Feil, Julius Lipp2026-01-21下载Batch inference workloads for causal transformer models frequently process sequences that share common prefixes, such as system prompts, few-shot examples, or shared queries.
Parallel Collaborative ADMM Privacy Computing and Adaptive GPU Acceleration for Distributed Edge NetworksMengchun Xia, Zhicheng Dong, Donghong Cai, Fang Fang, Lisheng Fan, Pingzhi Fan2026-01-21下载Distributed computing has been widely applied in distributed edge networks for reducing the processing burden of high-dimensional data centralization, where a high-dimensional computational task is de...
Application-level observability for adaptive Edge to Cloud continuum systemsKaddour Sidi, Daniel Balouek, Baptiste Jonglez2026-01-21下载Modern Edge-to-Cloud (E2C) systems require fine-grained observability to ensure adaptive behavior and compliance with performance objectives across heterogeneous and dynamic environments.
AlertGuardian: Intelligent Alert Life-Cycle Management for Large-scale Cloud SystemsGuangba Yu, Genting Mai, Rui Wang, Ruipeng Li, Pengfei Chen, Long Pan, Ruijie Xu2026-01-21下载Alerts are critical for detecting anomalies in large-scale cloud systems, ensuring reliability and user experience. However, current systems generate overwhelming volumes of alerts, degrading operatio...
Optimizing FaaS Platforms for MCP-enabled Agentic WorkflowsVarad Kulkarni, Vaibhav Jha, Nikhil Reddy, Anand Eswaran, Praveen Jayachandran, Yogesh Simmhan2026-01-21下载Agentic workflows that use autonomous AI Agents powered by Large Language Models (LLMs) and Model Context Protocol (MCP) servers is rapidly rising.
On Distributed Quantum Computing with Distributed Fan-Out OperationsSeng W. Loke2026-01-21下载We compare different circuits implementing distributed versions of quantum computations, using entangled pairs only, and using distributed fan-out operations (using GHZ states).
Beyond Denial-of-Service: The Puppeteer's Attack for Fine-Grained Control in Ranking-Based Federated LearningZhihao Chen, Zirui Gong, Jianting Ning, Yanjun Zhang, Leo Yu Zhang2026-01-21下载Federated Rank Learning (FRL) is a promising Federated Learning (FL) paradigm designed to be resilient against model poisoning attacks due to its discrete, ranking-based update mechanism.
Specifying and Verifying RDMA Synchronisation (Extended Version)Guillaume Ambal, Max Stupple, Brijesh Dongol, Azalea Raad2026-01-21下载Remote direct memory access (RDMA) allows a machine to directly read from and write to the memory of remote machine, enabling high-throughput, low-latency data transfer.
Exploiting Spot Instances for Time-Critical Cloud Workloads Using Optimal Randomized StrategiesNeelkamal Bhuyan, Randeep Bhatia, Murali Kodialam, TV Lakshman2026-01-21下载This paper addresses the challenge of deadline-aware online scheduling for jobs in hybrid cloud environments, where jobs may run on either cost-effective but unreliable spot instances or more expensiv...
Exploring Performance-Productivity Trade-offs in AMT Runtimes: A Task Bench Study of Itoyori, ItoyoriFBC, HPX, and MPITorben R. Lahnor, Mia Reitz, Jonas Posner, Patrick Diehl2026-01-21下载Asynchronous Many-Task (AMT) runtimes offer a productive alternative to the Message Passing Interface (MPI). However, the diverse AMT landscape makes fair comparisons challenging.
Agent Identity URI Scheme: Topology-Independent Naming and Capability-Based Discovery for Multi-Agent SystemsRoland R. Rodriguez2026-01-21下载Multi-agent systems face a fundamental architectural flaw: agent identity is bound to network location. When agents migrate between providers, scale across instances, or federate across organizations,...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Resource Allocation and Sharing for UAV-Assisted Integrated TN-NTN with Multi-ConnectivityAbd Ullah Khan, Wali Ullah Khan, Haejoon Jung, Hyundong Shin2026-01-21下载Unmanned aerial vehicles (UAVs) with multi-connectivity (MC) capabilities efficiently and reliably transfer data between terrestrial networks (TNs) and non-terrestrial networks (NTNs).
Economic feasibility of virtual operators in 5G via network slicingErwin J. Sacoto-Cabrera, Luis Guijarro, Jose R. Vidal, Vicent Pla2026-01-21下载The provision of services by more than one operator over a common network infrastructure, as enabled by 5G network slicing, is analyzed. Two business models to be implemented by a network operator, wh...
5G NR Non-Terrestrial Networks: Open Challenges for Full-Stack Protocol DesignFrancesco Rossato, Mattia Figaro, Alessandro Traspadini, Takayuki Shimizu, Chinmay Mahabal, Sanjeewa Herath, Chunghan Lee, Dogan Kutay Pekcan, Michele Zorzi, Marco Giordani2026-01-21下载As 5th generation (5G) networks continue to evolve, there is a growing interest toward the integration of Terrestrial Networks (TNs) and Non-Terrestrial Networks (NTNs).
Exploiting Spot Instances for Time-Critical Cloud Workloads Using Optimal Randomized StrategiesNeelkamal Bhuyan, Randeep Bhatia, Murali Kodialam, TV Lakshman2026-01-21下载This paper addresses the challenge of deadline-aware online scheduling for jobs in hybrid cloud environments, where jobs may run on either cost-effective but unreliable spot instances or more expensiv...
Holmes: An Evidence-Grounded LLM Agent for Auditable DDoS Investigation in Cloud NetworksHaodong Chen, Ziheng Zhang, Jinghui Jiang, Qiang Su, Qiao Xiang2026-01-21下载Cloud environments face frequent DDoS threats due to centralized resources and broad attack surfaces. Modern cloud-native DDoS attacks further evolve rapidly and often blend multi-vector strategies, c...
Close-enough general routing problem for multiple unmanned aerial vehicles in monitoring missionsHuan Liu, Michel Gendreau, Binjie Xu, Guohua Wu, Yi Gu2026-01-21下载In this paper, we introduce a close-enough multi-UAV general routing problem (CEMUAVGRP) where a fleet of homogeneous UAVs conduct monitoring tasks containing nodes, each of which has its disk neighbo...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
DeLog: An Efficient Log Compression Framework with Pattern Signature SynthesisSiyu Yu, Yifan Wu, Junjielong Xu, Ying Fu, Ning Wang, Maoyin Liu, Pancheng Jiang, Xiang Zhang, Tong Jia, Pinjia He, Ying Li2026-01-21下载Parser-based log compression, which separates static templates from dynamic variables, is a promising approach to exploit the unique structure of log data.
WebAssembly Based Portable and Secure Sensor Interface for Internet of ThingsBotong Ou, Baijian Yang2026-01-21下载As the expansion of IoT connectivity continues to provide quality-of-life improvements around the world, they simultaneously introduce increasing privacy and security concerns.

cs.PF - Performance

标题作者发布日期PDF摘要
Reliability by design: quantifying and eliminating fabrication risk in LLMs. From generative to consultative AI: a comparative analysis in the legal domain and lessons for high-stakes knowledge basesAlex Dantart2026-01-21下载This paper examines how to make large language models reliable for high-stakes legal work by reducing hallucinations. It distinguishes three AI paradigms: (1) standalone generative models ("creative o...
SynPerf: A Hybrid Analytical-ML Framework for GPU Performance PredictionKaixuan Zhang, Yunfan Cui, Shuhao Zhang, Chutong Ding, Shiyou Qian, Luping Wang, Jian Cao, Guangtao Xue, Cheng Huang, Guodong Yang, Liping Zhang2026-01-21下载The rapid expansion of Transformer-based large language models has dramatically increased the need for high-performance GPUs. As a result, there is growing demand for fast, accurate, and widely genera...
Exploiting Spot Instances for Time-Critical Cloud Workloads Using Optimal Randomized StrategiesNeelkamal Bhuyan, Randeep Bhatia, Murali Kodialam, TV Lakshman2026-01-21下载This paper addresses the challenge of deadline-aware online scheduling for jobs in hybrid cloud environments, where jobs may run on either cost-effective but unreliable spot instances or more expensiv...

基于 VitePress 构建