Skip to content

2026-03-30

cs.AR - Architecture

标题作者发布日期PDF摘要
ARCS: Autoregressive Circuit Synthesis with Topology-Aware Graph Attention and Spec ConditioningTushar Dhananjay Pathak2026-03-30下载This paper presents ARCS (Autoregressive Circuit Synthesis), a system for amortized analog circuit generation that produces complete, SPICE-simulatable designs (topology and component values) in milli...
Differentiable Initialization-Accelerated CPU-GPU Hybrid Combinatorial SchedulingMingju Liu, Jiaqi Yin, Alvaro Velasquez, Cunxi Yu2026-03-30下载This paper presents a hybrid CPU-GPU framework for solving combinatorial scheduling problems formulated as Integer Linear Programming (ILP). While scheduling underpins many optimization tasks in compu...
Physical Design of UET-RVMCU: A Streamlined Open-Source RISC-V MicrocontrollerAbdullah Azhar, Uneeb Kamal, Wajid Ali, Saad Gillani, Dr Suleman Sami Qazi2026-03-30下载This paper presents the design and physical implementation of UET-RVMCU, a lightweight RISC-V microcontroller derived from the UETRV-PCore. Aimed at creating an accessible and flexible open-source RIS...
Loop Control Management in Tightly Coupled Processor Arrays (TCPAs)Dominik Walter, Frank Hannig, Jürgen Teich2026-03-30下载Multidimensional loop kernels often suffer from control overhead that can dominate execution time on parallel loop accelerators. Tightly Coupled Processor Arrays (TCPAs) offload loop control to a glob...
AceleradorSNN: A Neuromorphic Cognitive System Integrating Spiking Neural Networks and DynamicImage Signal Processing on FPGADaniel Gutierrez, Ruben Martinez, Leyre Arnedo, Antonio Cuesta, Soukaina El Hamry2026-03-30下载The demand for high-speed, low-latency, and energy-efficient object detection in autonomous systems -- such as advanced driver-assistance systems (ADAS), unmanned aerial vehicles (UAVs), and Industry ...
OptINC: Optical In-Network-Computing for Scalable Distributed LearningSijie Fei, Grace Li Zhang, Bing Li, Ulf Schlichtmann2026-03-30下载Distributed learning is widely used for training large models on large datasets by distributing parts of the model or dataset across multiple devices and aggregating the computed results for subsequen...
A Switch-Centric In-Network Architecture for Accelerating LLM Inference in Shared-Memory NetworkAojie Jiang, Kang Zhu, Zhiheng Zhang, Zhengxu Su, Juntao Liu, Yuan Du, Li Du2026-03-30下载In-network computing techniques, exemplified by NVLink Sharp (NVLS), offer a promising approach to addressing the communication bottlenecks in LLM inference by offloading collective operations, such a...
AXON: An Automated Netlist Optimization Framework for High-Speed AddersTiantian Yang, Xuanle Ren, Qingdian Wan, Qi Meng2026-03-30下载Adders are fundamental building blocks in modern digital systems, and their performance, power, and area (PPA) directly impact system efficiency.
MCPT-Solver: An Monte Carlo Algorithm Solver Using MTJ Devices for Particle Transport ProblemsSiqing Fu, Lizhou Wu, Tiejun Li, Xuchao Xie, Chunyuan Zhang, Sheng Ma, Jianmin Zhang, Yuhan Tang, Jixuan Tang2026-03-30下载Monte Carlo particle transport problems play a vital role in scientific computing, but solving them on exiting von Neumann architectures suffers from random branching and irregular memory access, caus...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
SteelDB: Diagnosing Kernel-Space Bottlenecks in Cloud OLTP DatabasesMitsumasa Kondo2026-03-30下载Modern cloud OLTP databases have sought performance primarily through user-space optimization - separating storage and compute layers, or distributing transactions across multiple nodes using consensu...
Building the Palmetto API: Adding granular permissions and caching to the Slurm REST API without sacrificing compatibilityBen Godfrey, Doug Dawson2026-03-30下载The development of administrative and computational research tools requires reliable programmatic interfaces with the cluster scheduler. The Research Computing and Data (RCD) team at Clemson Universit...
Wherefore Art Thou? Provenance-Guided Automatic Online Debugging with LumosJingyuan Chen, Lei Zhang, Leon Schuermann, Gongqi Huang, Ravi Netravali, Amit Levy2026-03-30下载Debugging distributed systems in-production is inevitable and hard. Myriad interactions between concurrent components in modern, complex and large-scale systems cause non-deterministic bugs that offli...
Understand and Accelerate Memory Processing Pipeline for Disaggregated LLM InferenceZifan He, Rui Ma, Yizhou Sun, Jason Cong2026-03-30下载Modern large language models (LLMs) increasingly depends on efficient long-context processing and generation mechanisms, including sparse attention, retrieval-augmented generation (RAG), and compresse...
BitSov: A Composable Bitcoin-Native Architecture for Sovereign Internet InfrastructureOliver Aleksander Larsen, Rasmus Thorsen Larsen, Mahyar T. Moghaddam2026-03-30下载Today's internet concentrates identity, payments, communication, and content hosting under a small number of corporate intermediaries, creating single points of failure, enabling censorship, and extra...
GPU-Accelerated Optimization of Transformer-Based Neural Networks for Real-Time InferenceSoutrik Mukherjee, Sangwhan Cha2026-03-30下载This paper presents the design and evaluation of a GPU-accelerated inference pipeline for transformer models using NVIDIA TensorRT with mixed-precision optimization.
FL-PBM: Pre-Training Backdoor Mitigation for Federated LearningOsama Wehbi, Sarhad Arisdakessian, Omar Abdel Wahab, Azzam Mourad, Hadi Otrok, Jamal Bentahar2026-03-30下载Backdoor attacks pose a significant threat to the integrity and reliability of Artificial Intelligence (AI) models, enabling adversaries to manipulate model behavior by injecting poisoned data with hi...
Mitigating Backdoor Attacks in Federated Learning Using PPA and MiniMax Game TheoryOsama Wehbi, Sarhad Arisdakessian, Omar Abdel Wahab, Anderson Avila, Azzam Mourad, Hadi Otrok2026-03-30下载Federated Learning (FL) is witnessing wider adoption due to its ability to benefit from large amounts of scattered data while preserving privacy.
Sublogarithmic Distributed Vertex Coloring with Optimal Number of ColorsMaxime Flin, Magnús M. Halldórsson, Manuel Jakob, Yannic Maus2026-03-30下载For any Δ, let k_Δ be the maximum integer kk such that (k+1)(k+2)\le Δ. We give a distributed \LOCAL algorithm that, given an integer k < k_Δ, computes a valid Δ-k-coloring if one exists.
Trust-Aware Routing for Distributed Generative AI Inference at the EdgeChanh Nguyen, Erik Elmroth2026-03-30下载Emerging deployments of Generative AI increasingly execute inference across decentralized and heterogeneous edge devices rather than on a single trusted server.
FeDMRA: Federated Incremental Learning with Dynamic Memory Replay AllocationTiantian Wang, Xiang Xiang, Simon S. Du2026-03-30下载In federated healthcare systems, Federated Class-Incremental Learning (FCIL) has emerged as a key paradigm, enabling continuous adaptive model learning among distributed clients while safeguarding dat...
Warp-STAR: High-performance, Differentiable GPU-Accelerated Static Timing Analysis through Warp-oriented Parallel OrchestrationEn-Ming Huang, Shih-Hao Hung2026-03-30下载Static timing analysis (STA) is crucial for Electronic Design Automation (EDA) flows but remains a computational bottleneck. While existing GPU-based STA engines are faster than CPU, they suffer from ...
Key-Embedded Privacy for Decentralized AI in Biomedical OmicsRongyu Zhang, Hongyu Dong, Gaole Dai, Ziqi Qiao, Shenli Zheng, Yuan Zhang, Aosong Cheng, Xiaowei Chi, Jincai Luo, Pin Li, Li Du, Dan Wang, Yuan Du, Xudong Xing, Jianxu Chen, Shanghang Zhang2026-03-30下载The rapid adoption of data-driven methods in biomedicine has intensified concerns over privacy, governance, and regulation, limiting raw data sharing and hindering the assembly of representative cohor...
Pre-Deployment Complexity Estimation for Federated Perception SystemsKMA Solaiman, Shafkat Islam, Ruy de Oliveira, Bharat Bhargava2026-03-30下载Edge AI systems increasingly rely on federated learning to train perception models in distributed, privacy-preserving, and resource-constrained environments.
Efficient Counting and Simulation in Content-Oblivious RingsJérémie Chalopin, Yi-Jun Chang, Giuseppe Antonio Di Luna, Haoran Zhou2026-03-30下载In the content-oblivious (CO) model (proposed by Censor-Hillel et al.), processes inhabit an asynchronous network and communicate only by exchanging pulses.
Varuna: Enabling Failure-Type Aware RDMA FailoverXiaoyang Wang, Yongkun Li, Lulu Yao, Guoli Wei, Longcheng Yang, Yinlong Xu, Weiqing Kong, Weiguang Wang, Peng Dong, Bingyang Liu2026-03-30下载RDMA link failures can render connections temporarily unavailable, causing both performance degradation and significant recovery overhead. To tolerate such failures, production datacenters assign each...
YUHENG-OS: A Cloud-Native Space Cluster Operating SystemJin Zhang, Jiachen Sun, Kai Liu, Linling Kuang, Jianhua Lu2026-03-30下载As industry and academia continue to advance spaceborne computing and communication capabilities, the formation of cloud-native space clusters (CNSCs) has become an increasingly evident trend.
ITQ3_S: High-Fidelity 3-bit LLM Inference via Interleaved Ternary Quantization with Rotation-Domain SmoothingEdward J. Yoon2026-03-30下载We present ITQ3_S (Interleaved Ternary Quantization -- Specialized), a novel 3-bit weight quantization format for LLMs integrating TurboQuant (TQ), a rotation-domain strategy based on the Fast Walsh-H...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Embeddings of Nation-Level Social NetworksTanzir Pial, Flavio Hafner, Dakota Handzlik, Enamul Hassan, Lucas Sage, Ana Macanovic, Tom Emery, Arnout van de Rijt, Steven Skiena2026-03-30下载Full nation-scale social networks are now emerging from countries such as the Netherlands and Denmark, but these networks present challenging technical issues in working with large, multiplex, time-de...
Iran's January 2026 Internet Shutdown: Public Data, Censorship Methods, and Circumvention TechniquesGiuseppe Aceto, Valerio Persico, Antonio Pescapè2026-03-30下载This paper analyzes the Internet shutdown that occurred in Iran in January 2026 in the context of protests, focusing on its impact on the country's digital communication infrastructure and on informat...
Study of Post Quantum status of Widely Used ProtocolsTushin Mallick, Ashish Kundu, Ramana Kompella2026-03-30下载The advent of quantum computing poses significant threats to classical public-key cryptographic primitives such as RSA and elliptic-curve cryptography.
BitSov: A Composable Bitcoin-Native Architecture for Sovereign Internet InfrastructureOliver Aleksander Larsen, Rasmus Thorsen Larsen, Mahyar T. Moghaddam2026-03-30下载Today's internet concentrates identity, payments, communication, and content hosting under a small number of corporate intermediaries, creating single points of failure, enabling censorship, and extra...
A Techno-Economic Framework for Cost Modeling and Revenue Opportunities in Open and Programmable AI-RANGabriele Gemmi, Michele Polese, Tommaso Melodia2026-03-30下载The large-scale deployment of 5G networks has not delivered the expected return on investment for mobile network operators, raising concerns about the economic viability of future 6G rollouts.
How Many Qubits Can Be Teleported? Scalability of Fidelity-Constrained Quantum ApplicationsOscar Adamuz-Hinojosa, Jonathan Prados-Garzon, Sara Vaquero-Gil, Juan M. Lopez-Soler2026-03-30下载Quantum networks (QNs) enable the transfer of qubits between distant nodes using quantum teleportation, which reproduces a qubit state at a remote location by consuming a shared Bell pair.
Performance Analysis of 5G RAN Slicing Deployment Options in Industry 4.0 FactoriesOscar Adamuz-Hinojosa, Abdelhilah Abdeselam, Pablo Muñoz, Pablo Ameigeiras, Juan M. Lopez-Soler2026-03-30下载This paper studies Radio Access Network (RAN) slicing strategies for 5G Industry~4.0 networks with ultra-reliable low-latency communication (uRLLC) requirements.
Trust-Aware Routing for Distributed Generative AI Inference at the EdgeChanh Nguyen, Erik Elmroth2026-03-30下载Emerging deployments of Generative AI increasingly execute inference across decentralized and heterogeneous edge devices rather than on a single trusted server.
Shy Guys: A Light-Weight Approach to Detecting Robots on WebsitesRémi Van Boxem, Tom Barbette, Cristel Pelsser, Ramin Sadre2026-03-30下载Automated bots now account for roughly half of all web requests, and an increasing number deliberately spoof their identity to either evade detection or to not respect robots.txt.
From Simulation to Deep Learning: Survey on Network Performance Modeling ApproachesCarlos Güemes-Palau, Miquel Ferriol-Galmés, Jordi Paillisse-Vilanova, Pere Barlet-Ros, Albert Cabellos-Aparicio2026-03-30下载Network performance modeling is a field that predates early computer networks and the beginning of the Internet. It aims to predict the traffic performance of packet flows in a given network.
Age of Incorrect Information for Generic Discrete-Time Markov SourcesKonstantinos Bountrogiannis, Anthony Ephremides, Panagiotis Tsakalides, George Tzagkarakis2026-03-30下载This work introduces a framework for analyzing the Age of Incorrect Information (AoII) in a real-time monitoring system with a generic discrete-time Markov source.
Leaf-centric Logical Topology Design for OCS-based GPU ClustersXinchi Han, Weihao Jiang, Yingming Mao, Yike Liu, Zhuoran Liu, Yongxi Lv, Peirui Cao, Zhuotao Liu, Ximeng Liu, Xinbing Wang, Changbo Wu, Zihan Zhu, Wu Dongchao, Yang Jian, Zhang Zhanbang, Yuansen Chen, Shizhen Zhao2026-03-30下载Recent years have witnessed the growing deployment of optical circuit switches (OCS) in commercial GPU clusters (e.g., Google A3 GPU cluster) optimized for machine learning (ML) workloads.
A Survey on AI for 6G: Challenges and OpportunitiesConstantina Chatzieleftheriou, Eirini Liotou2026-03-30下载As wireless communication evolves, each generation of networks brings new technologies that change how we connect and interact. Artificial Intelligence (AI) is becoming crucial in shaping the future o...
Beyond Traffic Matrix: DELTA -- A DAG-Aware OCS Logical Topology Optimization for AIDCsNiangen Ye, Jingya Liu, Weiqiang Sun, Weisheng Hu2026-03-30下载The rapid scaling of large language models (LLMs) exacerbates communication bottlenecks in AI data centers (AIDCs). To overcome this, optical circuit switches (OCS) are increasingly adopted for their ...
Varuna: Enabling Failure-Type Aware RDMA FailoverXiaoyang Wang, Yongkun Li, Lulu Yao, Guoli Wei, Longcheng Yang, Yinlong Xu, Weiqing Kong, Weiguang Wang, Peng Dong, Bingyang Liu2026-03-30下载RDMA link failures can render connections temporarily unavailable, causing both performance degradation and significant recovery overhead. To tolerate such failures, production datacenters assign each...
YUHENG-OS: A Cloud-Native Space Cluster Operating SystemJin Zhang, Jiachen Sun, Kai Liu, Linling Kuang, Jianhua Lu2026-03-30下载As industry and academia continue to advance spaceborne computing and communication capabilities, the formation of cloud-native space clusters (CNSCs) has become an increasingly evident trend.
Adaptive Multi-Dimensional Coordinated Comprehensive Routing Scheme for IoVRuixing Ren, Minqi Tao, Junhui Zhao, Qiuping Li, Xiaoke Sun2026-03-30下载The characteristics of high-speed node movement and dynamic topology changes pose great challenges to the design of internet of vehicles (IoV) routing protocols.
Beyond Message Passing: A Semantic View of Agent Communication ProtocolsDun Yuan, Fuyuan Lyu, Ye Yuan, Weixu Zhang, Bowei He, Jiayi Geng, Linfeng Du, Zipeng Sun, Yankai Chen, Changjiang Han, Jikun Kang, Alex Chen, Haolun Wu, Xue Liu2026-03-30下载Agent communication protocols are becoming critical infrastructure for large language model (LLM) systems that must use tools, coordinate with other agents, and operate across heterogeneous environmen...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
SteelDB: Diagnosing Kernel-Space Bottlenecks in Cloud OLTP DatabasesMitsumasa Kondo2026-03-30下载Modern cloud OLTP databases have sought performance primarily through user-space optimization - separating storage and compute layers, or distributing transactions across multiple nodes using consensu...

cs.PF - Performance

标题作者发布日期PDF摘要
CirrusBench: Evaluating LLM-based Agents Beyond Correctness in Real-World Cloud Service EnvironmentsYi Yu, Guangquan Hu, Chenghuang Shen, Xingyan Liu, Jing Gu, Hangyi Sun, Junzhuo Ma, Weiting Liu, Jianfeng Liu, Mingyue Pu, Yu Wang, Zhengdong Xiao, Rui Xie, Longjiu Luo, Qianrong Wang, Gurong Cui, Honglin Qiao, Wenlian Lu2026-03-30下载The increasing agentic capabilities of Large Language Models (LLMs) have enabled their deployment in real-world applications, such as cloud services, where customer-assistant interactions exhibit high...

基于 VitePress 构建