2025-08-21

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
ASIC-Agent: An Autonomous Multi-Agent System for ASIC Design with Benchmark Evaluation	Ahmed Allam, Youssef Mansour, Mohamed Shalan	2025-08-21	下载	Large Language Models (LLMs) have demonstrated remarkable capabilities in Register Transfer Level (RTL) design, enabling high-quality code generation from natural language descriptions.
Putting the Context back into Memory	David A. Roberts	2025-08-21	下载	Requests arriving at main memory are often different from what programmers can observe or estimate by using CPU-based monitoring. Hardware cache prefetching, memory request scheduling and interleaving...
Row-Column Hybrid Grouping for Fault-Resilient Multi-Bit Weight Representation on IMC Arrays	Kang Eun Jeon, Sangheum Yeon, Jinhee Kim, Hyeonsu Bang, Johnny Rhe, Jong Hwan Ko	2025-08-21	下载	This paper addresses two critical challenges in analog In-Memory Computing (IMC) systems that limit their scalability and deployability: the computational unreliability caused by stuck-at faults (SAFs...
JEDI-linear: Fast and Efficient Graph Neural Networks for Jet Tagging on FPGAs	Zhiqiang Que, Chang Sun, Sudarshan Paramesvaran, Emyr Clement, Katerina Karakoulaki, Christopher Brown, Lauri Laatu, Arianna Cox, Alexander Tapper, Wayne Luk, Maria Spiropulu	2025-08-21	下载	Graph Neural Networks (GNNs), particularly Interaction Networks (INs), have shown exceptional performance for jet tagging at the CERN High-Luminosity Large Hadron Collider (HL-LHC).

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Multi-IaC-Eval: Benchmarking Cloud Infrastructure as Code Across Multiple Formats	Sam Davidson, Li Sun, Bhavana Bhasker, Laurent Callot, Anoop Deoras	2025-08-21	下载	Infrastructure as Code (IaC) is fundamental to modern cloud computing, enabling teams to define and manage infrastructure through machine-readable configuration files.
ASIC-Agent: An Autonomous Multi-Agent System for ASIC Design with Benchmark Evaluation	Ahmed Allam, Youssef Mansour, Mohamed Shalan	2025-08-21	下载	Large Language Models (LLMs) have demonstrated remarkable capabilities in Register Transfer Level (RTL) design, enabling high-quality code generation from natural language descriptions.
HyperFlexis: Joint Design of Algorithms and Systems for Multi-SLO Serving and Fast Scaling	Zahra Yousefijamarani, Xinglu Wang, Qian Wang, Morgan Lindsay Heisler, Taha Shabani, Niloofar Gholipour, Parham Yassini, Hong Chang, Kan Chen, Qiantao Zhang, Xiaolong Bai, Jiannan Wang, Ying Xiong, Yong Zhang, Zhenan Fan	2025-08-21	下载	Modern large language model (LLM) serving systems face challenges from highly variable requests with diverse lengths, priorities, and stage-specific service-level objectives (SLOs).
Mitigating context switching in densely packed Linux clusters with Latency-Aware Group Scheduling	Al Amjad Tawfiq Isstaif, Evangelia Kalyvianaki, Richard Mortier	2025-08-21	下载	Cluster orchestrators such as Kubernetes depend on accurate estimates of node capacity and job requirements. Inaccuracies in either lead to poor placement decisions and degraded cluster performance.
CausalMesh: A Formally Verified Causal Cache for Stateful Serverless Computing	Haoran Zhang, Zihao Zhang, Shuai Mu, Sebastian Angel, Vincent Liu	2025-08-21	下载	Stateful serverless workflows consist of multiple serverless functions that access state on a remote database. Developers sometimes add a cache layer between the serverless runtime and the database to...
Efficient Mixed-Precision Large Language Model Inference with TurboMind	Li Zhang, Youhe Jiang, Guoliang He, Xin Chen, Han Lv, Qian Yao, Fangcheng Fu, Kai Chen	2025-08-21	下载	Mixed-precision inference techniques reduce the memory and computational demands of Large Language Models (LLMs) by applying hybrid precision formats to model weights, activations, and KV caches.
Lower Bounds for $k$ -Set Agreement in Fault-Prone Networks	Pierre Fraigniaud, Minh Hang Nguyen, Ami Paz, Ulrich Schmid, Hugo Rincon Galeana	2025-08-21	下载	We develop a new lower bound for k-set agreement in synchronous message-passing systems connected by an arbitrary directed communication network, where up to t processes may crash.
Universal Dancing by Luminous Robots under Sequential Schedulers	Caterina Feletti, Paola Flocchini, Debasish Pattanayak, Giuseppe Prencipe, Nicola Santoro	2025-08-21	下载	The Dancing problem requires a swarm of $n$ autonomous mobile robots to form a sequence of patterns, aka perform a choreography. Existing work has proven that some crucial restrictions on choreographi...
On the Effectiveness of Graph Reordering for Accelerating Approximate Nearest Neighbor Search on GPU	Yutaro Oguri, Mai Nishimura, Yusuke Matsui	2025-08-21	下载	We present the first systematic investigation of graph reordering effects for graph-based Approximate Nearest Neighbor Search (ANNS) on a GPU.
Databelt: A Continuous Data Path for Serverless Workflows in the 3D Compute Continuum	Cynthia Marcelino, Leonard Guelmino, Thomas Pusztai, Stefan Nastic	2025-08-21	下载	Typically, serverless functions rely on remote storage services for managing state, which can result in increased latency and network communication overhead.
Optimizing Compilation for Distributed Quantum Computing via Clustering and Annealing	Ruilin Zhou, Jinglei Cheng, Yuhang Gan, Junyu Liu, Chen Qian	2025-08-21	下载	Efficiently mapping quantum programs onto Distributed quantum computing (DQC) are challenging, particularly when considering the heterogeneous quantum processing units (QPUs) with different structures...
Reliable Multi-view 3D Reconstruction for `Just-in-time' Edge Environments	Md. Nurul Absur, Abhinav Kumar, Swastik Brahma, Saptarshi Debroy	2025-08-21	下载	Multi-view 3D reconstruction applications are revolutionizing critical use cases that require rapid situational-awareness, such as emergency response, tactical scenarios, and public safety.

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Interface on demand: Towards AI native Control interfaces for 6G	Abhishek Dandekar, Prashiddha D. Thapa, Ashrafur Rahman, Julius Schulz-Zander	2025-08-21	下载	Traditional standardized network interfaces face significant limitations, including vendor-specific incompatibilities, rigid design assumptions, and lack of adaptability for new functionalities.
Unlocking the Performance Potential of Mega-Constellation Networks: An Exploration of Structure-Building Paradigms	Xiangtong Wang, Wei Li, Menglong Yang, Songchen Han	2025-08-21	下载	Mega-constellation networks (MCNs) are transforming global internet access by providing ubiquitous connectivity to millions of users worldwide.
Toward Autonomous Digital Populations for Communication-Sensing-Computation Ecosystem	Gaosheng Zhao, Dong In Kim	2025-08-21	下载	Future communication networks are expected to achieve deep integration of communication, sensing, and computation, forming a tightly coupled and autonomously operating infrastructure system.

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
CXLAimPod: CXL Memory is all you need in AI era	Yiwei Yang, Yusheng Zheng, Yiqi Chen, Zheng Liang, Kexin Chu, Zhe Zhou, Andi Quinn, Wei Zhang	2025-08-21	下载	The proliferation of data-intensive applications, ranging from large language models to key-value stores, increasingly stresses memory systems with mixed read-write access patterns.
Iridescent: A Framework Enabling Online System Implementation Specialization	Vaastav Anand, Deepak Garg, Antoine Kaufmann	2025-08-21	下载	Specializing systems to specifics of the workload they serve and platform they are running on often significantly improves performance. However, specializing systems is difficult in practice because o...
Mitigating context switching in densely packed Linux clusters with Latency-Aware Group Scheduling	Al Amjad Tawfiq Isstaif, Evangelia Kalyvianaki, Richard Mortier	2025-08-21	下载	Cluster orchestrators such as Kubernetes depend on accurate estimates of node capacity and job requirements. Inaccuracies in either lead to poor placement decisions and degraded cluster performance.
Putting the Context back into Memory	David A. Roberts	2025-08-21	下载	Requests arriving at main memory are often different from what programmers can observe or estimate by using CPU-based monitoring. Hardware cache prefetching, memory request scheduling and interleaving...

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Putting the Context back into Memory	David A. Roberts	2025-08-21	下载	Requests arriving at main memory are often different from what programmers can observe or estimate by using CPU-based monitoring. Hardware cache prefetching, memory request scheduling and interleaving...
Efficient Mixed-Precision Large Language Model Inference with TurboMind	Li Zhang, Youhe Jiang, Guoliang He, Xin Chen, Han Lv, Qian Yao, Fangcheng Fu, Kai Chen	2025-08-21	下载	Mixed-precision inference techniques reduce the memory and computational demands of Large Language Models (LLMs) by applying hybrid precision formats to model weights, activations, and KV caches.
SLM-Bench: A Comprehensive Benchmark of Small Language Models on Environmental Impacts--Extended Version	Nghiem Thanh Pham, Tung Kieu, Duc-Manh Nguyen, Son Ha Xuan, Nghia Duong-Trung, Danh Le-Phuoc	2025-08-21	下载	Small Language Models (SLMs) offer computational efficiency and accessibility, yet a systematic evaluation of their performance and environmental impact remains lacking.
KG-EDAS: A Meta-Metric Framework for Evaluating Knowledge Graph Completion Models	Haji Gul, Abul Ghani Naim, Ajaz Ahmad Bhat	2025-08-21	下载	Knowledge Graphs (KGs) enable applications in various domains such as semantic search, recommendation systems, and natural language processing.