Appearance
2026-01-21
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| A Hybrid Residue Floating Numerical Architecture with Formal Error Bounds for High Throughput FPGA Computation | Mostafa Darvishi | 2026-01-21 | 下载 | Floating point arithmetic is costly on FPGA platforms due to wide datapaths, normalization, and carry propagation, motivating alternative numerical representations that improve throughput and efficien... |
| Pipeline Automation Framework for Reusable High-throughput Network Applications on FPGA | Jean Bruant, Pierre-Henri Horrein, Olivier Muller, Frédéric Pétrot | 2026-01-21 | 下载 | In a context of ever-growing worldwide communication traffic, cloud service providers aim at deploying scalable infrastructures to address heterogeneous needs. |
| SynPerf: A Hybrid Analytical-ML Framework for GPU Performance Prediction | Kaixuan Zhang, Yunfan Cui, Shuhao Zhang, Chutong Ding, Shiyou Qian, Luping Wang, Jian Cao, Guangtao Xue, Cheng Huang, Guodong Yang, Liping Zhang | 2026-01-21 | 下载 | The rapid expansion of Transformer-based large language models has dramatically increased the need for high-performance GPUs. As a result, there is growing demand for fast, accurate, and widely genera... |
| Analog-to-Stochastic Converter Using Magnetic Tunnel Junction Devices for Vision Chips | Naoya Onizawa, Daisaku Katagiri, Warren J. Gross, Takahiro Hanyu | 2026-01-21 | 下载 | This paper introduces an analog-to-stochastic converter using a magnetic tunnel junction (MTJ) device for vision chips based on stochastic computation. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Securing LLM-as-a-Service for Small Businesses: An Industry Case Study of a Distributed Chatbot Deployment Platform | Jiazhu Xie, Bowen Li, Heyu Fu, Chong Gao, Ziqi Xu, Fengling Han | 2026-01-21 | 下载 | Large Language Model (LLM)-based question-answering systems offer significant potential for automating customer support and internal knowledge access in small businesses, yet their practical deploymen... |
| Predictor-Free and Hardware-Aware Federated Neural Architecture Search via Pareto-Guided Supernet Training | Bostan Khan, Masoud Daneshtalab | 2026-01-21 | 下载 | Federated Neural Architecture Search (FedNAS) aims to automate model design for privacy-preserving Federated Learning (FL) but currently faces two critical bottlenecks: unguided supernet training that... |
| RadixMLP -- Intra-batch Deduplication for Causal Transformers | Michael Feil, Julius Lipp | 2026-01-21 | 下载 | Batch inference workloads for causal transformer models frequently process sequences that share common prefixes, such as system prompts, few-shot examples, or shared queries. |
| Parallel Collaborative ADMM Privacy Computing and Adaptive GPU Acceleration for Distributed Edge Networks | Mengchun Xia, Zhicheng Dong, Donghong Cai, Fang Fang, Lisheng Fan, Pingzhi Fan | 2026-01-21 | 下载 | Distributed computing has been widely applied in distributed edge networks for reducing the processing burden of high-dimensional data centralization, where a high-dimensional computational task is de... |
| Application-level observability for adaptive Edge to Cloud continuum systems | Kaddour Sidi, Daniel Balouek, Baptiste Jonglez | 2026-01-21 | 下载 | Modern Edge-to-Cloud (E2C) systems require fine-grained observability to ensure adaptive behavior and compliance with performance objectives across heterogeneous and dynamic environments. |
| AlertGuardian: Intelligent Alert Life-Cycle Management for Large-scale Cloud Systems | Guangba Yu, Genting Mai, Rui Wang, Ruipeng Li, Pengfei Chen, Long Pan, Ruijie Xu | 2026-01-21 | 下载 | Alerts are critical for detecting anomalies in large-scale cloud systems, ensuring reliability and user experience. However, current systems generate overwhelming volumes of alerts, degrading operatio... |
| Optimizing FaaS Platforms for MCP-enabled Agentic Workflows | Varad Kulkarni, Vaibhav Jha, Nikhil Reddy, Anand Eswaran, Praveen Jayachandran, Yogesh Simmhan | 2026-01-21 | 下载 | Agentic workflows that use autonomous AI Agents powered by Large Language Models (LLMs) and Model Context Protocol (MCP) servers is rapidly rising. |
| On Distributed Quantum Computing with Distributed Fan-Out Operations | Seng W. Loke | 2026-01-21 | 下载 | We compare different circuits implementing distributed versions of quantum computations, using entangled pairs only, and using distributed fan-out operations (using GHZ states). |
| Beyond Denial-of-Service: The Puppeteer's Attack for Fine-Grained Control in Ranking-Based Federated Learning | Zhihao Chen, Zirui Gong, Jianting Ning, Yanjun Zhang, Leo Yu Zhang | 2026-01-21 | 下载 | Federated Rank Learning (FRL) is a promising Federated Learning (FL) paradigm designed to be resilient against model poisoning attacks due to its discrete, ranking-based update mechanism. |
| Specifying and Verifying RDMA Synchronisation (Extended Version) | Guillaume Ambal, Max Stupple, Brijesh Dongol, Azalea Raad | 2026-01-21 | 下载 | Remote direct memory access (RDMA) allows a machine to directly read from and write to the memory of remote machine, enabling high-throughput, low-latency data transfer. |
| Exploiting Spot Instances for Time-Critical Cloud Workloads Using Optimal Randomized Strategies | Neelkamal Bhuyan, Randeep Bhatia, Murali Kodialam, TV Lakshman | 2026-01-21 | 下载 | This paper addresses the challenge of deadline-aware online scheduling for jobs in hybrid cloud environments, where jobs may run on either cost-effective but unreliable spot instances or more expensiv... |
| Exploring Performance-Productivity Trade-offs in AMT Runtimes: A Task Bench Study of Itoyori, ItoyoriFBC, HPX, and MPI | Torben R. Lahnor, Mia Reitz, Jonas Posner, Patrick Diehl | 2026-01-21 | 下载 | Asynchronous Many-Task (AMT) runtimes offer a productive alternative to the Message Passing Interface (MPI). However, the diverse AMT landscape makes fair comparisons challenging. |
| Agent Identity URI Scheme: Topology-Independent Naming and Capability-Based Discovery for Multi-Agent Systems | Roland R. Rodriguez | 2026-01-21 | 下载 | Multi-agent systems face a fundamental architectural flaw: agent identity is bound to network location. When agents migrate between providers, scale across instances, or federate across organizations,... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Resource Allocation and Sharing for UAV-Assisted Integrated TN-NTN with Multi-Connectivity | Abd Ullah Khan, Wali Ullah Khan, Haejoon Jung, Hyundong Shin | 2026-01-21 | 下载 | Unmanned aerial vehicles (UAVs) with multi-connectivity (MC) capabilities efficiently and reliably transfer data between terrestrial networks (TNs) and non-terrestrial networks (NTNs). |
| Economic feasibility of virtual operators in 5G via network slicing | Erwin J. Sacoto-Cabrera, Luis Guijarro, Jose R. Vidal, Vicent Pla | 2026-01-21 | 下载 | The provision of services by more than one operator over a common network infrastructure, as enabled by 5G network slicing, is analyzed. Two business models to be implemented by a network operator, wh... |
| 5G NR Non-Terrestrial Networks: Open Challenges for Full-Stack Protocol Design | Francesco Rossato, Mattia Figaro, Alessandro Traspadini, Takayuki Shimizu, Chinmay Mahabal, Sanjeewa Herath, Chunghan Lee, Dogan Kutay Pekcan, Michele Zorzi, Marco Giordani | 2026-01-21 | 下载 | As 5th generation (5G) networks continue to evolve, there is a growing interest toward the integration of Terrestrial Networks (TNs) and Non-Terrestrial Networks (NTNs). |
| Exploiting Spot Instances for Time-Critical Cloud Workloads Using Optimal Randomized Strategies | Neelkamal Bhuyan, Randeep Bhatia, Murali Kodialam, TV Lakshman | 2026-01-21 | 下载 | This paper addresses the challenge of deadline-aware online scheduling for jobs in hybrid cloud environments, where jobs may run on either cost-effective but unreliable spot instances or more expensiv... |
| Holmes: An Evidence-Grounded LLM Agent for Auditable DDoS Investigation in Cloud Networks | Haodong Chen, Ziheng Zhang, Jinghui Jiang, Qiang Su, Qiao Xiang | 2026-01-21 | 下载 | Cloud environments face frequent DDoS threats due to centralized resources and broad attack surfaces. Modern cloud-native DDoS attacks further evolve rapidly and often blend multi-vector strategies, c... |
| Close-enough general routing problem for multiple unmanned aerial vehicles in monitoring missions | Huan Liu, Michel Gendreau, Binjie Xu, Guohua Wu, Yi Gu | 2026-01-21 | 下载 | In this paper, we introduce a close-enough multi-UAV general routing problem (CEMUAVGRP) where a fleet of homogeneous UAVs conduct monitoring tasks containing nodes, each of which has its disk neighbo... |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| DeLog: An Efficient Log Compression Framework with Pattern Signature Synthesis | Siyu Yu, Yifan Wu, Junjielong Xu, Ying Fu, Ning Wang, Maoyin Liu, Pancheng Jiang, Xiang Zhang, Tong Jia, Pinjia He, Ying Li | 2026-01-21 | 下载 | Parser-based log compression, which separates static templates from dynamic variables, is a promising approach to exploit the unique structure of log data. |
| WebAssembly Based Portable and Secure Sensor Interface for Internet of Things | Botong Ou, Baijian Yang | 2026-01-21 | 下载 | As the expansion of IoT connectivity continues to provide quality-of-life improvements around the world, they simultaneously introduce increasing privacy and security concerns. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Reliability by design: quantifying and eliminating fabrication risk in LLMs. From generative to consultative AI: a comparative analysis in the legal domain and lessons for high-stakes knowledge bases | Alex Dantart | 2026-01-21 | 下载 | This paper examines how to make large language models reliable for high-stakes legal work by reducing hallucinations. It distinguishes three AI paradigms: (1) standalone generative models ("creative o... |
| SynPerf: A Hybrid Analytical-ML Framework for GPU Performance Prediction | Kaixuan Zhang, Yunfan Cui, Shuhao Zhang, Chutong Ding, Shiyou Qian, Luping Wang, Jian Cao, Guangtao Xue, Cheng Huang, Guodong Yang, Liping Zhang | 2026-01-21 | 下载 | The rapid expansion of Transformer-based large language models has dramatically increased the need for high-performance GPUs. As a result, there is growing demand for fast, accurate, and widely genera... |
| Exploiting Spot Instances for Time-Critical Cloud Workloads Using Optimal Randomized Strategies | Neelkamal Bhuyan, Randeep Bhatia, Murali Kodialam, TV Lakshman | 2026-01-21 | 下载 | This paper addresses the challenge of deadline-aware online scheduling for jobs in hybrid cloud environments, where jobs may run on either cost-effective but unreliable spot instances or more expensiv... |