2026-03-11

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Synthesis-in-the-Loop Evaluation of LLMs for RTL Generation: Quality, Reliability, and Failure Modes	Weimin Fu, Zeng Wang, Minghao Shao, Ramesh Karri, Muhammad Shafique, Johann Knechtel, Ozgur Sinanoglu, Xiaolong Guo	2026-03-11	下载	RTL generation demands more than software code synthesis: designs must be syntactically valid, synthesizable, functionally correct, and hardware-efficient.
Reference Architecture of a Quantum-Centric Supercomputer	Seetharami Seelam, Jerry M. Chow, Antonio Córcoles, Sarah Sheldon, Tushar Mittal, Abhinav Kandala, Sean Dague, Ian Hincks, Hiroshi Horii, Blake Johnson, Michael Le, Hani Jamjoom, Jay M. Gambetta	2026-03-11	下载	Quantum computers have demonstrated utility in simulating quantum systems beyond brute-force classical approaches. As the community builds on these demonstrations to explore using quantum computing fo...
An FPGA Implementation of Displacement Vector Search for Intra Pattern Copy in JPEG XS	Qiyue Chen, Yao Li, Jie Tao, Song Chen, Li Li, Dong Liu	2026-03-11	下载	Recently, progress has been made on the Intra Pattern Copy (IPC) tool for JPEG XS, an image compression standard designed for low-latency and low-complexity coding.
In-Memory ADC-Based Nonlinear Activation Quantization for Efficient In-Memory Computing	Shuai Dong, Junyi Yang, Biyan Zhou, Hongyang Shang, Gourav Datta, Arindam Basu	2026-03-11	下载	In deep networks, operations such as ReLU and hardware-driven clamping often cause activations to accumulate near the edges of the distribution, leading to biased clustering and suboptimal quantizatio...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Reference Architecture of a Quantum-Centric Supercomputer	Seetharami Seelam, Jerry M. Chow, Antonio Córcoles, Sarah Sheldon, Tushar Mittal, Abhinav Kandala, Sean Dague, Ian Hincks, Hiroshi Horii, Blake Johnson, Michael Le, Hani Jamjoom, Jay M. Gambetta	2026-03-11	下载	Quantum computers have demonstrated utility in simulating quantum systems beyond brute-force classical approaches. As the community builds on these demonstrations to explore using quantum computing fo...
Data Augmentation and Convolutional Network Architecture Influence on Distributed Learning	Victor Forattini Jansen, Emanuel Teixeira Martins, Yasmin Souza Lima, Flavio de Oliveira Silva, Rodrigo Moreira, Larissa Ferreira Rodrigues Moreira	2026-03-11	下载	Convolutional Neural Networks (CNNs) have proven to be highly effective in solving a broad spectrum of computer vision tasks, such as classification, identification, and segmentation.
Topological Analysis for Identifying Anomalies in Serverless Platforms	Gianluca Reali, Mauro Femminella	2026-03-11	下载	The information flows in serverless platforms are complex and non-conservative. This is a direct result of how independently deployed functions interact under the platform coarse-grained control mecha...
Aceso: Carbon-Aware and Cost-Effective Microservice Placement for Small and Medium-sized Enterprises	Georgia Christofidi, Francisco Álvarez-Terribas, Ioannis Roumpos, Nicolas Kourtellis, Jesus Omaña Iglesias, Thaleia Dimitra Doudali	2026-03-11	下载	Microservices are a dominant architecture in cloud computing, offering scalability and modularity, but also posing complex deployment challenges.
CacheSolidarity: Preventing Prefix Caching Side Channels in Multi-tenant LLM Serving Systems	Panagiotis Georgios Pennas, Konstantinos Papaioannou, Marco Guarnieri, Thaleia Dimitra Doudali	2026-03-11	下载	Large Language Models (LLMs) rely on optimizations like Automatic Prefix Caching (APC) to accelerate inference. APC works by reusing previously computed states for the beginning part of a request (pre...
Double-Precision Matrix Multiplication Emulation via Ozaki-II Scheme with FP8 Quantization	Yuki Uchino, Katsuhisa Ozaki, Toshiyuki Imamura	2026-03-11	下载	In this paper, we propose a method for emulating double-precision general matrix--matrix multiplication (DGEMM), a fundamental and performance-critical kernel in many high-performance computing applic...
Thousand-GPU Large-Scale Training and Optimization Recipe for AI-Native Cloud Embodied Intelligence Infrastructure	Yongjian Guo, Yunxuan Ma, Haoran Sun, Zhong Guan, Shuai Di, Jing Long, Wanting Xu, Xiaodong Bai, Wen Huang, Yucheng Guo, Chen Zhou, Qiming Yang, Mingxi Luo, Tianyun Zhao, Hedan Yang, Song Wang, Xiaomeng Tian, Xiaolong Xiang, Zhen Sun, Yu Wei, Luqiao Wang, Yuzhen Li, Chenfeng Gu, Junwu Xiong, Yicheng Gong	2026-03-11	下载	Embodied intelligence is a key step towards Artificial General Intelligence (AGI), yet its development faces multiple challenges including data, frameworks, infrastructure, and evaluation systems.
CD-Raft: Reducing the Latency of Distributed Consensus in Cross-Domain Sites	Yangyang Wang, Ziqian Cheng, Yucong Dong, Zichen Xu	2026-03-11	下载	Today's massive AI computation loads push heavy data synchronization across sites, i.e., nodes in data centers. Any reduction in such consensus latency can significantly improve the overall performanc...
Estimating the condition number of Chebyshev filtered vectors with application to the ChASE library	Edoardo Di Napoli, Xinzhe Wu	2026-03-11	下载	Chebyshev filtered subspace iteration is a well-known algorithm for the solution of (symmetric/Hermitian) algebraic eigenproblems which has been implemented in several application codes~\cite{Kronik:2...
COHORT: Hybrid RL for Collaborative Large DNN Inference on Multi-Robot Systems Under Real-Time Constraints	Mohammad Saeid Anwar, Anuradha Ravi, Indrajeet Ghosh, Gaurav Shinde, Carl Busart, Nirmalya Roy	2026-03-11	下载	Large deep neural networks (DNNs), especially transformer-based and multimodal architectures, are computationally demanding and challenging to deploy on resource-constrained edge platforms like field ...
SimulCost: A Cost-Aware Benchmark and Toolkit for Automating Physics Simulations with LLMs	Yadi Cao, Sicheng Lai, Jiahe Huang, Yang Zhang, Zach Lawrence, Rohan Bhakta, Izzy F. Thomas, Mingyun Cao, Chung-Hao Tsai, Zihao Zhou, Yidong Zhao, Hao Liu, Alessandro Marinoni, Alexey Arefiev, Rose Yu	2026-03-11	下载	Evaluating LLM agents for scientific tasks has focused on token costs while ignoring tool-use costs like simulation time and experimental resources.
S-HPLB: Efficient LLM Attention Serving via Sparsity-Aware Head Parallelism Load Balance	Di Liu, Yifei Liu, Chen Chen, Zhibin Yu, Xiaoyi Fan, Quan Chen, Minyi Guo	2026-03-11	下载	With the increasing volumes of Large Language Models (LLMs) and the expanding context lengths, attention computation has become a key performance bottleneck in LLM serving.
AgentServe: Algorithm-System Co-Design for Efficient Agentic AI Serving on a Consumer-Grade GPU	Yuning Zhang, Yan Yan, Nan Yang, Dong Yuan	2026-03-11	下载	Large language models (LLMs) are increasingly deployed as AI agents that operate in short reasoning-action loops, interleaving model computation with external calls.

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Measurement-Driven O-RAN Diagnostics with Tail Latency and Scheduler Indicators	Theofanis P. Raptis, Weronika Maria Bachan, Roberto Verdone	2026-03-11	下载	We investigate cross-layer performance diagnostics for an O-RAN instance by jointly analyzing application-level latency and radio-layer behavior from a real measurement campaign.
Expressive Boundedness of Authoritative DNS Response Selection	Chris Bertinato	2026-03-11	下载	Authoritative Domain Name System (DNS) response selection defines query-time response selection based on resolver-visible context and per-answer metadata, yielding different observable outcomes for th...
Topological Analysis for Identifying Anomalies in Serverless Platforms	Gianluca Reali, Mauro Femminella	2026-03-11	下载	The information flows in serverless platforms are complex and non-conservative. This is a direct result of how independently deployed functions interact under the platform coarse-grained control mecha...
Feasibility of satellite-augmented global quantum repeater networks	Manik Dawar, Clement Paillet, Nilesh Vyas, Andrew Thain, Rodrigo Henriques Guilherme, Ralf Riedinger	2026-03-11	下载	A large scale quantum network requires the distribution of high-fidelity end-to-end entanglement. To overcome the range limitations inherent to terrestrial fiber, a leading architecture has emerged: s...
Initialization and Rate-Quality Functions for Generative Network Layer Protocols	Mathias Thorsager, Israel Leyva-Mayorga, Petar Popovski	2026-03-11	下载	Generative AI (GenAI) creates full content based on compact prompts. While GenAI has been used for applications where the generated content is returned to the prompt sender, it can play a vital role i...
Towards Intelligent Spectrum Management: Spectrum Demand Estimation Using Graph Neural Networks	Mohamad Alkadamani, Amir Ghasemi, Halim Yanikomeroglu	2026-03-11	下载	The growing demand for wireless connectivity, combined with limited spectrum resources, calls for more efficient spectrum management. Spectrum sharing is a promising approach; however, regulators need...
Q-StaR: A Quasi-Static Routing Scheme for NoCs	Yang Zhang, Yiren Zhao, Xu Wang, Fengyuan Ren	2026-03-11	下载	In networks-on-chip, static routing schemes are favored for their simplicity and predictability, but they cannot effectively balance network load due to the unawareness of runtime load distribution.
Adaptive RAN Slicing Control via Reward-Free Self-Finetuning Agents	Yuanhao Li, Haozhe Wang, Geyong Min, Nektarios Georgalas, Wang Miao	2026-03-11	下载	The integration of Generative AI models into AI-native network systems offers a transformative path toward achieving autonomous and adaptive control.
A Secure Splitting and Acceleration Strategy for TCP/QUIC in Interplanetary Networks	Jianhao Yu, Ye Li, Qingfang Jiang, Shuai Liu, Wenfeng Li, Kanglian Zhao	2026-03-11	下载	Interplanetary networks (IPNs) present unique challenges such as extreme delay, high loss, and frequent disruptions that severely degrade the performance of conventional transport protocols like Trans...
Spyglass: Directional Spectrum Sensing with Single-shot AoA Estimation and Virtual Arrays	Raghav Subbaraman, Akshit Agarwal, Wenhao Chen, Dinesh Bharadia	2026-03-11	下载	In this paper, we introduce Spyglass, a spectrum sensor designed to address the challenges of effective spectrum usage in dense wireless environments.
Taming Vision Priors for Data Efficient mmWave Channel Modeling	Zhenlin An, Longfei Shangguan, John Kaewell, Philip Pietraski, Jelena Senic, Camillo Gentile, Nada Golmie, Kyle Jamieson	2026-03-11	下载	Accurately modeling millimeter-wave (mmWave) propagation is essential for real-time AR and autonomous systems. Differentiable ray tracing offers a physics-grounded solution but still facing deployment...
Utility Function is All You Need: LLM-based Congestion Control	Neta Rozen-Schiff, Liron Schiff, Stefan Schmid	2026-03-11	下载	Congestion is a critical and challenging problem in communication networks. Congestion control protocols allow network applications to tune their sending rate in a way that optimizes their performance...

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Improving LLM Performance Through Black-Box Online Tuning: A Case for Adding System Specs to Factsheets for Trusted AI	Yonas Atinafu, Henry Lin, Robin Cohen	2026-03-11	下载	In this paper, we present a novel black-box online controller that uses only end-to-end measurements over short segments, without internal instrumentation, and hill climbing to maximize goodput, defin...
RAGPerf: An End-to-End Benchmarking Framework for Retrieval-Augmented Generation Systems	Shaobo Li, Yirui Zhou, Yuan Xu, Kevin Chen, Daniel Waddington, Swaminathan Sundararaman, Hubertus Franke, Jian Huang	2026-03-11	下载	We present the design and implementation of a RAG-based AI system benchmarking (RAGPerf) framework for characterizing the system behaviors of RAG pipelines.