Appearance
2026-03-30
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| ARCS: Autoregressive Circuit Synthesis with Topology-Aware Graph Attention and Spec Conditioning | Tushar Dhananjay Pathak | 2026-03-30 | 下载 | This paper presents ARCS (Autoregressive Circuit Synthesis), a system for amortized analog circuit generation that produces complete, SPICE-simulatable designs (topology and component values) in milli... |
| Differentiable Initialization-Accelerated CPU-GPU Hybrid Combinatorial Scheduling | Mingju Liu, Jiaqi Yin, Alvaro Velasquez, Cunxi Yu | 2026-03-30 | 下载 | This paper presents a hybrid CPU-GPU framework for solving combinatorial scheduling problems formulated as Integer Linear Programming (ILP). While scheduling underpins many optimization tasks in compu... |
| Physical Design of UET-RVMCU: A Streamlined Open-Source RISC-V Microcontroller | Abdullah Azhar, Uneeb Kamal, Wajid Ali, Saad Gillani, Dr Suleman Sami Qazi | 2026-03-30 | 下载 | This paper presents the design and physical implementation of UET-RVMCU, a lightweight RISC-V microcontroller derived from the UETRV-PCore. Aimed at creating an accessible and flexible open-source RIS... |
| Loop Control Management in Tightly Coupled Processor Arrays (TCPAs) | Dominik Walter, Frank Hannig, Jürgen Teich | 2026-03-30 | 下载 | Multidimensional loop kernels often suffer from control overhead that can dominate execution time on parallel loop accelerators. Tightly Coupled Processor Arrays (TCPAs) offload loop control to a glob... |
| AceleradorSNN: A Neuromorphic Cognitive System Integrating Spiking Neural Networks and DynamicImage Signal Processing on FPGA | Daniel Gutierrez, Ruben Martinez, Leyre Arnedo, Antonio Cuesta, Soukaina El Hamry | 2026-03-30 | 下载 | The demand for high-speed, low-latency, and energy-efficient object detection in autonomous systems -- such as advanced driver-assistance systems (ADAS), unmanned aerial vehicles (UAVs), and Industry ... |
| OptINC: Optical In-Network-Computing for Scalable Distributed Learning | Sijie Fei, Grace Li Zhang, Bing Li, Ulf Schlichtmann | 2026-03-30 | 下载 | Distributed learning is widely used for training large models on large datasets by distributing parts of the model or dataset across multiple devices and aggregating the computed results for subsequen... |
| A Switch-Centric In-Network Architecture for Accelerating LLM Inference in Shared-Memory Network | Aojie Jiang, Kang Zhu, Zhiheng Zhang, Zhengxu Su, Juntao Liu, Yuan Du, Li Du | 2026-03-30 | 下载 | In-network computing techniques, exemplified by NVLink Sharp (NVLS), offer a promising approach to addressing the communication bottlenecks in LLM inference by offloading collective operations, such a... |
| AXON: An Automated Netlist Optimization Framework for High-Speed Adders | Tiantian Yang, Xuanle Ren, Qingdian Wan, Qi Meng | 2026-03-30 | 下载 | Adders are fundamental building blocks in modern digital systems, and their performance, power, and area (PPA) directly impact system efficiency. |
| MCPT-Solver: An Monte Carlo Algorithm Solver Using MTJ Devices for Particle Transport Problems | Siqing Fu, Lizhou Wu, Tiejun Li, Xuchao Xie, Chunyuan Zhang, Sheng Ma, Jianmin Zhang, Yuhan Tang, Jixuan Tang | 2026-03-30 | 下载 | Monte Carlo particle transport problems play a vital role in scientific computing, but solving them on exiting von Neumann architectures suffers from random branching and irregular memory access, caus... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| SteelDB: Diagnosing Kernel-Space Bottlenecks in Cloud OLTP Databases | Mitsumasa Kondo | 2026-03-30 | 下载 | Modern cloud OLTP databases have sought performance primarily through user-space optimization - separating storage and compute layers, or distributing transactions across multiple nodes using consensu... |
| Building the Palmetto API: Adding granular permissions and caching to the Slurm REST API without sacrificing compatibility | Ben Godfrey, Doug Dawson | 2026-03-30 | 下载 | The development of administrative and computational research tools requires reliable programmatic interfaces with the cluster scheduler. The Research Computing and Data (RCD) team at Clemson Universit... |
| Wherefore Art Thou? Provenance-Guided Automatic Online Debugging with Lumos | Jingyuan Chen, Lei Zhang, Leon Schuermann, Gongqi Huang, Ravi Netravali, Amit Levy | 2026-03-30 | 下载 | Debugging distributed systems in-production is inevitable and hard. Myriad interactions between concurrent components in modern, complex and large-scale systems cause non-deterministic bugs that offli... |
| Understand and Accelerate Memory Processing Pipeline for Disaggregated LLM Inference | Zifan He, Rui Ma, Yizhou Sun, Jason Cong | 2026-03-30 | 下载 | Modern large language models (LLMs) increasingly depends on efficient long-context processing and generation mechanisms, including sparse attention, retrieval-augmented generation (RAG), and compresse... |
| BitSov: A Composable Bitcoin-Native Architecture for Sovereign Internet Infrastructure | Oliver Aleksander Larsen, Rasmus Thorsen Larsen, Mahyar T. Moghaddam | 2026-03-30 | 下载 | Today's internet concentrates identity, payments, communication, and content hosting under a small number of corporate intermediaries, creating single points of failure, enabling censorship, and extra... |
| GPU-Accelerated Optimization of Transformer-Based Neural Networks for Real-Time Inference | Soutrik Mukherjee, Sangwhan Cha | 2026-03-30 | 下载 | This paper presents the design and evaluation of a GPU-accelerated inference pipeline for transformer models using NVIDIA TensorRT with mixed-precision optimization. |
| FL-PBM: Pre-Training Backdoor Mitigation for Federated Learning | Osama Wehbi, Sarhad Arisdakessian, Omar Abdel Wahab, Azzam Mourad, Hadi Otrok, Jamal Bentahar | 2026-03-30 | 下载 | Backdoor attacks pose a significant threat to the integrity and reliability of Artificial Intelligence (AI) models, enabling adversaries to manipulate model behavior by injecting poisoned data with hi... |
| Mitigating Backdoor Attacks in Federated Learning Using PPA and MiniMax Game Theory | Osama Wehbi, Sarhad Arisdakessian, Omar Abdel Wahab, Anderson Avila, Azzam Mourad, Hadi Otrok | 2026-03-30 | 下载 | Federated Learning (FL) is witnessing wider adoption due to its ability to benefit from large amounts of scattered data while preserving privacy. |
| Sublogarithmic Distributed Vertex Coloring with Optimal Number of Colors | Maxime Flin, Magnús M. Halldórsson, Manuel Jakob, Yannic Maus | 2026-03-30 | 下载 | For any Δ, let k_Δ be the maximum integer such that (k+1)(k+2)\le Δ. We give a distributed \LOCAL algorithm that, given an integer k < k_Δ, computes a valid Δ-k-coloring if one exists. |
| Trust-Aware Routing for Distributed Generative AI Inference at the Edge | Chanh Nguyen, Erik Elmroth | 2026-03-30 | 下载 | Emerging deployments of Generative AI increasingly execute inference across decentralized and heterogeneous edge devices rather than on a single trusted server. |
| FeDMRA: Federated Incremental Learning with Dynamic Memory Replay Allocation | Tiantian Wang, Xiang Xiang, Simon S. Du | 2026-03-30 | 下载 | In federated healthcare systems, Federated Class-Incremental Learning (FCIL) has emerged as a key paradigm, enabling continuous adaptive model learning among distributed clients while safeguarding dat... |
| Warp-STAR: High-performance, Differentiable GPU-Accelerated Static Timing Analysis through Warp-oriented Parallel Orchestration | En-Ming Huang, Shih-Hao Hung | 2026-03-30 | 下载 | Static timing analysis (STA) is crucial for Electronic Design Automation (EDA) flows but remains a computational bottleneck. While existing GPU-based STA engines are faster than CPU, they suffer from ... |
| Key-Embedded Privacy for Decentralized AI in Biomedical Omics | Rongyu Zhang, Hongyu Dong, Gaole Dai, Ziqi Qiao, Shenli Zheng, Yuan Zhang, Aosong Cheng, Xiaowei Chi, Jincai Luo, Pin Li, Li Du, Dan Wang, Yuan Du, Xudong Xing, Jianxu Chen, Shanghang Zhang | 2026-03-30 | 下载 | The rapid adoption of data-driven methods in biomedicine has intensified concerns over privacy, governance, and regulation, limiting raw data sharing and hindering the assembly of representative cohor... |
| Pre-Deployment Complexity Estimation for Federated Perception Systems | KMA Solaiman, Shafkat Islam, Ruy de Oliveira, Bharat Bhargava | 2026-03-30 | 下载 | Edge AI systems increasingly rely on federated learning to train perception models in distributed, privacy-preserving, and resource-constrained environments. |
| Efficient Counting and Simulation in Content-Oblivious Rings | Jérémie Chalopin, Yi-Jun Chang, Giuseppe Antonio Di Luna, Haoran Zhou | 2026-03-30 | 下载 | In the content-oblivious (CO) model (proposed by Censor-Hillel et al.), processes inhabit an asynchronous network and communicate only by exchanging pulses. |
| Varuna: Enabling Failure-Type Aware RDMA Failover | Xiaoyang Wang, Yongkun Li, Lulu Yao, Guoli Wei, Longcheng Yang, Yinlong Xu, Weiqing Kong, Weiguang Wang, Peng Dong, Bingyang Liu | 2026-03-30 | 下载 | RDMA link failures can render connections temporarily unavailable, causing both performance degradation and significant recovery overhead. To tolerate such failures, production datacenters assign each... |
| YUHENG-OS: A Cloud-Native Space Cluster Operating System | Jin Zhang, Jiachen Sun, Kai Liu, Linling Kuang, Jianhua Lu | 2026-03-30 | 下载 | As industry and academia continue to advance spaceborne computing and communication capabilities, the formation of cloud-native space clusters (CNSCs) has become an increasingly evident trend. |
| ITQ3_S: High-Fidelity 3-bit LLM Inference via Interleaved Ternary Quantization with Rotation-Domain Smoothing | Edward J. Yoon | 2026-03-30 | 下载 | We present ITQ3_S (Interleaved Ternary Quantization -- Specialized), a novel 3-bit weight quantization format for LLMs integrating TurboQuant (TQ), a rotation-domain strategy based on the Fast Walsh-H... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Embeddings of Nation-Level Social Networks | Tanzir Pial, Flavio Hafner, Dakota Handzlik, Enamul Hassan, Lucas Sage, Ana Macanovic, Tom Emery, Arnout van de Rijt, Steven Skiena | 2026-03-30 | 下载 | Full nation-scale social networks are now emerging from countries such as the Netherlands and Denmark, but these networks present challenging technical issues in working with large, multiplex, time-de... |
| Iran's January 2026 Internet Shutdown: Public Data, Censorship Methods, and Circumvention Techniques | Giuseppe Aceto, Valerio Persico, Antonio Pescapè | 2026-03-30 | 下载 | This paper analyzes the Internet shutdown that occurred in Iran in January 2026 in the context of protests, focusing on its impact on the country's digital communication infrastructure and on informat... |
| Study of Post Quantum status of Widely Used Protocols | Tushin Mallick, Ashish Kundu, Ramana Kompella | 2026-03-30 | 下载 | The advent of quantum computing poses significant threats to classical public-key cryptographic primitives such as RSA and elliptic-curve cryptography. |
| BitSov: A Composable Bitcoin-Native Architecture for Sovereign Internet Infrastructure | Oliver Aleksander Larsen, Rasmus Thorsen Larsen, Mahyar T. Moghaddam | 2026-03-30 | 下载 | Today's internet concentrates identity, payments, communication, and content hosting under a small number of corporate intermediaries, creating single points of failure, enabling censorship, and extra... |
| A Techno-Economic Framework for Cost Modeling and Revenue Opportunities in Open and Programmable AI-RAN | Gabriele Gemmi, Michele Polese, Tommaso Melodia | 2026-03-30 | 下载 | The large-scale deployment of 5G networks has not delivered the expected return on investment for mobile network operators, raising concerns about the economic viability of future 6G rollouts. |
| How Many Qubits Can Be Teleported? Scalability of Fidelity-Constrained Quantum Applications | Oscar Adamuz-Hinojosa, Jonathan Prados-Garzon, Sara Vaquero-Gil, Juan M. Lopez-Soler | 2026-03-30 | 下载 | Quantum networks (QNs) enable the transfer of qubits between distant nodes using quantum teleportation, which reproduces a qubit state at a remote location by consuming a shared Bell pair. |
| Performance Analysis of 5G RAN Slicing Deployment Options in Industry 4.0 Factories | Oscar Adamuz-Hinojosa, Abdelhilah Abdeselam, Pablo Muñoz, Pablo Ameigeiras, Juan M. Lopez-Soler | 2026-03-30 | 下载 | This paper studies Radio Access Network (RAN) slicing strategies for 5G Industry~4.0 networks with ultra-reliable low-latency communication (uRLLC) requirements. |
| Trust-Aware Routing for Distributed Generative AI Inference at the Edge | Chanh Nguyen, Erik Elmroth | 2026-03-30 | 下载 | Emerging deployments of Generative AI increasingly execute inference across decentralized and heterogeneous edge devices rather than on a single trusted server. |
| Shy Guys: A Light-Weight Approach to Detecting Robots on Websites | Rémi Van Boxem, Tom Barbette, Cristel Pelsser, Ramin Sadre | 2026-03-30 | 下载 | Automated bots now account for roughly half of all web requests, and an increasing number deliberately spoof their identity to either evade detection or to not respect robots.txt. |
| From Simulation to Deep Learning: Survey on Network Performance Modeling Approaches | Carlos Güemes-Palau, Miquel Ferriol-Galmés, Jordi Paillisse-Vilanova, Pere Barlet-Ros, Albert Cabellos-Aparicio | 2026-03-30 | 下载 | Network performance modeling is a field that predates early computer networks and the beginning of the Internet. It aims to predict the traffic performance of packet flows in a given network. |
| Age of Incorrect Information for Generic Discrete-Time Markov Sources | Konstantinos Bountrogiannis, Anthony Ephremides, Panagiotis Tsakalides, George Tzagkarakis | 2026-03-30 | 下载 | This work introduces a framework for analyzing the Age of Incorrect Information (AoII) in a real-time monitoring system with a generic discrete-time Markov source. |
| Leaf-centric Logical Topology Design for OCS-based GPU Clusters | Xinchi Han, Weihao Jiang, Yingming Mao, Yike Liu, Zhuoran Liu, Yongxi Lv, Peirui Cao, Zhuotao Liu, Ximeng Liu, Xinbing Wang, Changbo Wu, Zihan Zhu, Wu Dongchao, Yang Jian, Zhang Zhanbang, Yuansen Chen, Shizhen Zhao | 2026-03-30 | 下载 | Recent years have witnessed the growing deployment of optical circuit switches (OCS) in commercial GPU clusters (e.g., Google A3 GPU cluster) optimized for machine learning (ML) workloads. |
| A Survey on AI for 6G: Challenges and Opportunities | Constantina Chatzieleftheriou, Eirini Liotou | 2026-03-30 | 下载 | As wireless communication evolves, each generation of networks brings new technologies that change how we connect and interact. Artificial Intelligence (AI) is becoming crucial in shaping the future o... |
| Beyond Traffic Matrix: DELTA -- A DAG-Aware OCS Logical Topology Optimization for AIDCs | Niangen Ye, Jingya Liu, Weiqiang Sun, Weisheng Hu | 2026-03-30 | 下载 | The rapid scaling of large language models (LLMs) exacerbates communication bottlenecks in AI data centers (AIDCs). To overcome this, optical circuit switches (OCS) are increasingly adopted for their ... |
| Varuna: Enabling Failure-Type Aware RDMA Failover | Xiaoyang Wang, Yongkun Li, Lulu Yao, Guoli Wei, Longcheng Yang, Yinlong Xu, Weiqing Kong, Weiguang Wang, Peng Dong, Bingyang Liu | 2026-03-30 | 下载 | RDMA link failures can render connections temporarily unavailable, causing both performance degradation and significant recovery overhead. To tolerate such failures, production datacenters assign each... |
| YUHENG-OS: A Cloud-Native Space Cluster Operating System | Jin Zhang, Jiachen Sun, Kai Liu, Linling Kuang, Jianhua Lu | 2026-03-30 | 下载 | As industry and academia continue to advance spaceborne computing and communication capabilities, the formation of cloud-native space clusters (CNSCs) has become an increasingly evident trend. |
| Adaptive Multi-Dimensional Coordinated Comprehensive Routing Scheme for IoV | Ruixing Ren, Minqi Tao, Junhui Zhao, Qiuping Li, Xiaoke Sun | 2026-03-30 | 下载 | The characteristics of high-speed node movement and dynamic topology changes pose great challenges to the design of internet of vehicles (IoV) routing protocols. |
| Beyond Message Passing: A Semantic View of Agent Communication Protocols | Dun Yuan, Fuyuan Lyu, Ye Yuan, Weixu Zhang, Bowei He, Jiayi Geng, Linfeng Du, Zipeng Sun, Yankai Chen, Changjiang Han, Jikun Kang, Alex Chen, Haolun Wu, Xue Liu | 2026-03-30 | 下载 | Agent communication protocols are becoming critical infrastructure for large language model (LLM) systems that must use tools, coordinate with other agents, and operate across heterogeneous environmen... |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| SteelDB: Diagnosing Kernel-Space Bottlenecks in Cloud OLTP Databases | Mitsumasa Kondo | 2026-03-30 | 下载 | Modern cloud OLTP databases have sought performance primarily through user-space optimization - separating storage and compute layers, or distributing transactions across multiple nodes using consensu... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| CirrusBench: Evaluating LLM-based Agents Beyond Correctness in Real-World Cloud Service Environments | Yi Yu, Guangquan Hu, Chenghuang Shen, Xingyan Liu, Jing Gu, Hangyi Sun, Junzhuo Ma, Weiting Liu, Jianfeng Liu, Mingyue Pu, Yu Wang, Zhengdong Xiao, Rui Xie, Longjiu Luo, Qianrong Wang, Gurong Cui, Honglin Qiao, Wenlian Lu | 2026-03-30 | 下载 | The increasing agentic capabilities of Large Language Models (LLMs) have enabled their deployment in real-world applications, such as cloud services, where customer-assistant interactions exhibit high... |