2025-08-08

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Nail: Not Another Fault-Injection Framework for Chisel-generated RTL	Robin Sehm, Christian Ewert, Rainer Buchty, Mladen Berekovic, Saleh Mulhem	2025-08-08	下载	Fault simulation and emulation are essential techniques for evaluating the dependability of integrated circuits, enabling early-stage vulnerability analysis and supporting the implementation of effect...
ArchXBench: A Complex Digital Systems Benchmark Suite for LLM Driven RTL Synthesis	Suresh Purini, Siddhant Garg, Mudit Gaur, Sankalp Bhat, Sohan Mupparapu, Arun Ravindran	2025-08-08	下载	Modern SoC datapaths include deeply pipelined, domain-specific accelerators, but their RTL implementation and verification are still mostly done by hand.
MAHL: Multi-Agent LLM-Guided Hierarchical Chiplet Design with Adaptive Debugging	Jinwei Tang, Jiayin Qin, Nuo Xu, Pragnya Sudershan Nalla, Yu Cao, Yang, Zhao, Caiwen Ding	2025-08-08	下载	As program workloads (e.g., AI) increase in size and algorithmic complexity, the primary challenge lies in their high dimensionality, encompassing computing cores, array sizes, and memory hierarchies.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
An Incentive-Compatible Semi-Parallel Proof-of-Work Protocol	Mustafa Doger, Sennur Ulukus	2025-08-08	下载	Parallel Proof-of-Work (PoW) protocols have been suggested in the literature to improve the safety guarantees, transaction throughput and confirmation latencies of Nakamoto consensus.
Blockchain-Enabled Federated Learning	Murtaza Rangwala, KR Venugopal, Rajkumar Buyya	2025-08-08	下载	Blockchain-enabled federated learning (BCFL) addresses fundamental challenges of trust, privacy, and coordination in collaborative AI systems.
Performant Unified GPU Kernels for Portable Singular Value Computation Across Hardware and Precision	Evelyne Ringoot, Rabab Alomairy, Valentin Churavy, Alan Edelman	2025-08-08	下载	This paper presents a portable, GPU-accelerated implementation of a QR-based singular value computation algorithm in Julia. The singular value ecomposition (SVD) is a fundamental numerical tool in sci...
FedMeNF: Privacy-Preserving Federated Meta-Learning for Neural Fields	Junhyeog Yun, Minui Hong, Gunhee Kim	2025-08-08	下载	Neural fields provide a memory-efficient representation of data, which can effectively handle diverse modalities and large-scale data. However, learning to map neural fields often requires large amoun...
KV Cache Compression for Inference Efficiency in LLMs: A Review	Yanyu Liu, Jingying Fu, Sixiang Liu, Yitian Zou, You Fu, Jiehan Zhou, Shouhua Zhang	2025-08-08	下载	Withtherapid advancement of large language models (LLMs), the context length for inference has been continuously increasing, leading to an exponential growth in the demand for Key-Value (KV) caching.
Performance measurements of modern Fortran MPI applications with Score-P	Gregor Corbin	2025-08-08	下载	Version 3.0 of the Message-Passing Interface (MPI) standard, released in 2012, introduced a new set of language bindings for Fortran 2008. By making use of modern language features and the enhanced in...
EC2MoE: Adaptive End-Cloud Pipeline Collaboration Enabling Scalable Mixture-of-Experts Inference	Zheming Yang, Yunqing Hu, Sheng Sun, Wen Ji	2025-08-08	下载	The Mixture-of-Experts (MoE) paradigm has emerged as a promising solution to scale up model capacity while maintaining inference efficiency. However, deploying MoE models across heterogeneous end-clou...
KnapFormer: An Online Load Balancer for Efficient Diffusion Transformers Training	Kai Zhang, Peng Wang, Sai Bi, Jianming Zhang, Yuanjun Xiong	2025-08-08	下载	We present KnapFormer, an efficient and versatile framework to combine workload balancing and sequence parallelism in distributed training of Diffusion Transformers (DiT).

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Generative AI for Intent-Driven Network Management in 6G RAN: A Case Study on the Mamba Model	Md Arafat Habib, Medhat Elsayed, Yigit Ozcan, Pedro Enrique Iturria-Rivera, Majid Bavand, Melike Erol-Kantarci	2025-08-08	下载	With the emergence of 6G, mobile networks are becoming increasingly heterogeneous and dynamic, necessitating advanced automation for efficient management.
Iris RESTful Server and IrisTileSource: An Iris implementation for existing OpenSeaDragon viewers	Ryan Erik Landvater MD, Navin Kathawa, Mustafa Yousif MD, Ulysses Balis MD	2025-08-08	下载	The Iris File Extension (IFE) is a low overhead performance-oriented whole slide image (WSI) file format designed to improve the image rendering experience for pathologists and simplify image manageme...
An Online Multi-dimensional Knapsack Approach for Slice Admission Control	Jesutofunmi Ajayi, Antonio Di Maio, Torsten Braun, Dimitrios Xenakis	2025-08-08	下载	Network Slicing has emerged as a powerful technique to enable cost-effective, multi-tenant communications and services over a shared physical mobile network infrastructure.
Hierarchical Placement Learning for Network Slice Provisioning	Jesutofunmi Ajayi, Antonio Di Maio, Torsten Braun	2025-08-08	下载	In this work, we aim to address the challenge of slice provisioning in edge-based mobile networks. We propose a solution that learns a service function chain placement policy for Network Slice Request...
MX-AI: Agentic Observability and Control Platform for Open and AI-RAN	Ilias Chatzistefanidis, Andrea Leone, Ali Yaghoubian, Mikel Irazabal, Sehad Nassim, Lina Bariah, Merouane Debbah, Navid Nikaein	2025-08-08	下载	Future 6G radio access networks (RANs) will be artificial intelligence (AI)-native: observed, reasoned about, and re-configured by autonomous agents cooperating across the cloud-edge continuum.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Pushing the Envelope of LLM Inference on AI-PC and Intel GPUs	Evangelos Georganas, Dhiraj Kalamkar, Alexander Heinecke	2025-08-08	下载	The advent of ultra-low-bit LLM models (1/1.58/2-bit), which match the perplexity and end-task performance of their full-precision counterparts using the same model size, is ushering in a new era of L...
Generalizing Scaling Laws for Dense and Sparse Large Language Models	Md Arafat Hossain, Xingfu Wu, Valerie Taylor, Ali Jannesari	2025-08-08	下载	Despite recent advancements of large language models (LLMs), optimally predicting the model size for LLM pretraining or allocating optimal resources still remains a challenge.
Performance measurements of modern Fortran MPI applications with Score-P	Gregor Corbin	2025-08-08	下载	Version 3.0 of the Message-Passing Interface (MPI) standard, released in 2012, introduced a new set of language bindings for Fortran 2008. By making use of modern language features and the enhanced in...