Skip to content

2025-08-21

cs.AR - Architecture

标题作者发布日期PDF摘要
ASIC-Agent: An Autonomous Multi-Agent System for ASIC Design with Benchmark EvaluationAhmed Allam, Youssef Mansour, Mohamed Shalan2025-08-21下载Large Language Models (LLMs) have demonstrated remarkable capabilities in Register Transfer Level (RTL) design, enabling high-quality code generation from natural language descriptions.
Putting the Context back into MemoryDavid A. Roberts2025-08-21下载Requests arriving at main memory are often different from what programmers can observe or estimate by using CPU-based monitoring. Hardware cache prefetching, memory request scheduling and interleaving...
Row-Column Hybrid Grouping for Fault-Resilient Multi-Bit Weight Representation on IMC ArraysKang Eun Jeon, Sangheum Yeon, Jinhee Kim, Hyeonsu Bang, Johnny Rhe, Jong Hwan Ko2025-08-21下载This paper addresses two critical challenges in analog In-Memory Computing (IMC) systems that limit their scalability and deployability: the computational unreliability caused by stuck-at faults (SAFs...
JEDI-linear: Fast and Efficient Graph Neural Networks for Jet Tagging on FPGAsZhiqiang Que, Chang Sun, Sudarshan Paramesvaran, Emyr Clement, Katerina Karakoulaki, Christopher Brown, Lauri Laatu, Arianna Cox, Alexander Tapper, Wayne Luk, Maria Spiropulu2025-08-21下载Graph Neural Networks (GNNs), particularly Interaction Networks (INs), have shown exceptional performance for jet tagging at the CERN High-Luminosity Large Hadron Collider (HL-LHC).

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Multi-IaC-Eval: Benchmarking Cloud Infrastructure as Code Across Multiple FormatsSam Davidson, Li Sun, Bhavana Bhasker, Laurent Callot, Anoop Deoras2025-08-21下载Infrastructure as Code (IaC) is fundamental to modern cloud computing, enabling teams to define and manage infrastructure through machine-readable configuration files.
ASIC-Agent: An Autonomous Multi-Agent System for ASIC Design with Benchmark EvaluationAhmed Allam, Youssef Mansour, Mohamed Shalan2025-08-21下载Large Language Models (LLMs) have demonstrated remarkable capabilities in Register Transfer Level (RTL) design, enabling high-quality code generation from natural language descriptions.
HyperFlexis: Joint Design of Algorithms and Systems for Multi-SLO Serving and Fast ScalingZahra Yousefijamarani, Xinglu Wang, Qian Wang, Morgan Lindsay Heisler, Taha Shabani, Niloofar Gholipour, Parham Yassini, Hong Chang, Kan Chen, Qiantao Zhang, Xiaolong Bai, Jiannan Wang, Ying Xiong, Yong Zhang, Zhenan Fan2025-08-21下载Modern large language model (LLM) serving systems face challenges from highly variable requests with diverse lengths, priorities, and stage-specific service-level objectives (SLOs).
Mitigating context switching in densely packed Linux clusters with Latency-Aware Group SchedulingAl Amjad Tawfiq Isstaif, Evangelia Kalyvianaki, Richard Mortier2025-08-21下载Cluster orchestrators such as Kubernetes depend on accurate estimates of node capacity and job requirements. Inaccuracies in either lead to poor placement decisions and degraded cluster performance.
CausalMesh: A Formally Verified Causal Cache for Stateful Serverless ComputingHaoran Zhang, Zihao Zhang, Shuai Mu, Sebastian Angel, Vincent Liu2025-08-21下载Stateful serverless workflows consist of multiple serverless functions that access state on a remote database. Developers sometimes add a cache layer between the serverless runtime and the database to...
Efficient Mixed-Precision Large Language Model Inference with TurboMindLi Zhang, Youhe Jiang, Guoliang He, Xin Chen, Han Lv, Qian Yao, Fangcheng Fu, Kai Chen2025-08-21下载Mixed-precision inference techniques reduce the memory and computational demands of Large Language Models (LLMs) by applying hybrid precision formats to model weights, activations, and KV caches.
Lower Bounds for kk-Set Agreement in Fault-Prone NetworksPierre Fraigniaud, Minh Hang Nguyen, Ami Paz, Ulrich Schmid, Hugo Rincon Galeana2025-08-21下载We develop a new lower bound for k-set agreement in synchronous message-passing systems connected by an arbitrary directed communication network, where up to t processes may crash.
Universal Dancing by Luminous Robots under Sequential SchedulersCaterina Feletti, Paola Flocchini, Debasish Pattanayak, Giuseppe Prencipe, Nicola Santoro2025-08-21下载The Dancing problem requires a swarm of nn autonomous mobile robots to form a sequence of patterns, aka perform a choreography. Existing work has proven that some crucial restrictions on choreographi...
On the Effectiveness of Graph Reordering for Accelerating Approximate Nearest Neighbor Search on GPUYutaro Oguri, Mai Nishimura, Yusuke Matsui2025-08-21下载We present the first systematic investigation of graph reordering effects for graph-based Approximate Nearest Neighbor Search (ANNS) on a GPU.
Databelt: A Continuous Data Path for Serverless Workflows in the 3D Compute ContinuumCynthia Marcelino, Leonard Guelmino, Thomas Pusztai, Stefan Nastic2025-08-21下载Typically, serverless functions rely on remote storage services for managing state, which can result in increased latency and network communication overhead.
Optimizing Compilation for Distributed Quantum Computing via Clustering and AnnealingRuilin Zhou, Jinglei Cheng, Yuhang Gan, Junyu Liu, Chen Qian2025-08-21下载Efficiently mapping quantum programs onto Distributed quantum computing (DQC) are challenging, particularly when considering the heterogeneous quantum processing units (QPUs) with different structures...
Reliable Multi-view 3D Reconstruction for `Just-in-time' Edge EnvironmentsMd. Nurul Absur, Abhinav Kumar, Swastik Brahma, Saptarshi Debroy2025-08-21下载Multi-view 3D reconstruction applications are revolutionizing critical use cases that require rapid situational-awareness, such as emergency response, tactical scenarios, and public safety.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Interface on demand: Towards AI native Control interfaces for 6GAbhishek Dandekar, Prashiddha D. Thapa, Ashrafur Rahman, Julius Schulz-Zander2025-08-21下载Traditional standardized network interfaces face significant limitations, including vendor-specific incompatibilities, rigid design assumptions, and lack of adaptability for new functionalities.
Unlocking the Performance Potential of Mega-Constellation Networks: An Exploration of Structure-Building ParadigmsXiangtong Wang, Wei Li, Menglong Yang, Songchen Han2025-08-21下载Mega-constellation networks (MCNs) are transforming global internet access by providing ubiquitous connectivity to millions of users worldwide.
Toward Autonomous Digital Populations for Communication-Sensing-Computation EcosystemGaosheng Zhao, Dong In Kim2025-08-21下载Future communication networks are expected to achieve deep integration of communication, sensing, and computation, forming a tightly coupled and autonomously operating infrastructure system.

cs.OS - Operating Systems

标题作者发布日期PDF摘要
CXLAimPod: CXL Memory is all you need in AI eraYiwei Yang, Yusheng Zheng, Yiqi Chen, Zheng Liang, Kexin Chu, Zhe Zhou, Andi Quinn, Wei Zhang2025-08-21下载The proliferation of data-intensive applications, ranging from large language models to key-value stores, increasingly stresses memory systems with mixed read-write access patterns.
Iridescent: A Framework Enabling Online System Implementation SpecializationVaastav Anand, Deepak Garg, Antoine Kaufmann2025-08-21下载Specializing systems to specifics of the workload they serve and platform they are running on often significantly improves performance. However, specializing systems is difficult in practice because o...
Mitigating context switching in densely packed Linux clusters with Latency-Aware Group SchedulingAl Amjad Tawfiq Isstaif, Evangelia Kalyvianaki, Richard Mortier2025-08-21下载Cluster orchestrators such as Kubernetes depend on accurate estimates of node capacity and job requirements. Inaccuracies in either lead to poor placement decisions and degraded cluster performance.
Putting the Context back into MemoryDavid A. Roberts2025-08-21下载Requests arriving at main memory are often different from what programmers can observe or estimate by using CPU-based monitoring. Hardware cache prefetching, memory request scheduling and interleaving...

cs.PF - Performance

标题作者发布日期PDF摘要
Putting the Context back into MemoryDavid A. Roberts2025-08-21下载Requests arriving at main memory are often different from what programmers can observe or estimate by using CPU-based monitoring. Hardware cache prefetching, memory request scheduling and interleaving...
Efficient Mixed-Precision Large Language Model Inference with TurboMindLi Zhang, Youhe Jiang, Guoliang He, Xin Chen, Han Lv, Qian Yao, Fangcheng Fu, Kai Chen2025-08-21下载Mixed-precision inference techniques reduce the memory and computational demands of Large Language Models (LLMs) by applying hybrid precision formats to model weights, activations, and KV caches.
SLM-Bench: A Comprehensive Benchmark of Small Language Models on Environmental Impacts--Extended VersionNghiem Thanh Pham, Tung Kieu, Duc-Manh Nguyen, Son Ha Xuan, Nghia Duong-Trung, Danh Le-Phuoc2025-08-21下载Small Language Models (SLMs) offer computational efficiency and accessibility, yet a systematic evaluation of their performance and environmental impact remains lacking.
KG-EDAS: A Meta-Metric Framework for Evaluating Knowledge Graph Completion ModelsHaji Gul, Abul Ghani Naim, Ajaz Ahmad Bhat2025-08-21下载Knowledge Graphs (KGs) enable applications in various domains such as semantic search, recommendation systems, and natural language processing.

基于 VitePress 构建