Skip to content

2025-08-08

cs.AR - Architecture

标题作者发布日期PDF摘要
Nail: Not Another Fault-Injection Framework for Chisel-generated RTLRobin Sehm, Christian Ewert, Rainer Buchty, Mladen Berekovic, Saleh Mulhem2025-08-08下载Fault simulation and emulation are essential techniques for evaluating the dependability of integrated circuits, enabling early-stage vulnerability analysis and supporting the implementation of effect...
ArchXBench: A Complex Digital Systems Benchmark Suite for LLM Driven RTL SynthesisSuresh Purini, Siddhant Garg, Mudit Gaur, Sankalp Bhat, Sohan Mupparapu, Arun Ravindran2025-08-08下载Modern SoC datapaths include deeply pipelined, domain-specific accelerators, but their RTL implementation and verification are still mostly done by hand.
MAHL: Multi-Agent LLM-Guided Hierarchical Chiplet Design with Adaptive DebuggingJinwei Tang, Jiayin Qin, Nuo Xu, Pragnya Sudershan Nalla, Yu Cao, Yang, Zhao, Caiwen Ding2025-08-08下载As program workloads (e.g., AI) increase in size and algorithmic complexity, the primary challenge lies in their high dimensionality, encompassing computing cores, array sizes, and memory hierarchies.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
An Incentive-Compatible Semi-Parallel Proof-of-Work ProtocolMustafa Doger, Sennur Ulukus2025-08-08下载Parallel Proof-of-Work (PoW) protocols have been suggested in the literature to improve the safety guarantees, transaction throughput and confirmation latencies of Nakamoto consensus.
Blockchain-Enabled Federated LearningMurtaza Rangwala, KR Venugopal, Rajkumar Buyya2025-08-08下载Blockchain-enabled federated learning (BCFL) addresses fundamental challenges of trust, privacy, and coordination in collaborative AI systems.
Performant Unified GPU Kernels for Portable Singular Value Computation Across Hardware and PrecisionEvelyne Ringoot, Rabab Alomairy, Valentin Churavy, Alan Edelman2025-08-08下载This paper presents a portable, GPU-accelerated implementation of a QR-based singular value computation algorithm in Julia. The singular value ecomposition (SVD) is a fundamental numerical tool in sci...
FedMeNF: Privacy-Preserving Federated Meta-Learning for Neural FieldsJunhyeog Yun, Minui Hong, Gunhee Kim2025-08-08下载Neural fields provide a memory-efficient representation of data, which can effectively handle diverse modalities and large-scale data. However, learning to map neural fields often requires large amoun...
KV Cache Compression for Inference Efficiency in LLMs: A ReviewYanyu Liu, Jingying Fu, Sixiang Liu, Yitian Zou, You Fu, Jiehan Zhou, Shouhua Zhang2025-08-08下载Withtherapid advancement of large language models (LLMs), the context length for inference has been continuously increasing, leading to an exponential growth in the demand for Key-Value (KV) caching.
Performance measurements of modern Fortran MPI applications with Score-PGregor Corbin2025-08-08下载Version 3.0 of the Message-Passing Interface (MPI) standard, released in 2012, introduced a new set of language bindings for Fortran 2008. By making use of modern language features and the enhanced in...
EC2MoE: Adaptive End-Cloud Pipeline Collaboration Enabling Scalable Mixture-of-Experts InferenceZheming Yang, Yunqing Hu, Sheng Sun, Wen Ji2025-08-08下载The Mixture-of-Experts (MoE) paradigm has emerged as a promising solution to scale up model capacity while maintaining inference efficiency. However, deploying MoE models across heterogeneous end-clou...
KnapFormer: An Online Load Balancer for Efficient Diffusion Transformers TrainingKai Zhang, Peng Wang, Sai Bi, Jianming Zhang, Yuanjun Xiong2025-08-08下载We present KnapFormer, an efficient and versatile framework to combine workload balancing and sequence parallelism in distributed training of Diffusion Transformers (DiT).

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Generative AI for Intent-Driven Network Management in 6G RAN: A Case Study on the Mamba ModelMd Arafat Habib, Medhat Elsayed, Yigit Ozcan, Pedro Enrique Iturria-Rivera, Majid Bavand, Melike Erol-Kantarci2025-08-08下载With the emergence of 6G, mobile networks are becoming increasingly heterogeneous and dynamic, necessitating advanced automation for efficient management.
Iris RESTful Server and IrisTileSource: An Iris implementation for existing OpenSeaDragon viewersRyan Erik Landvater MD, Navin Kathawa, Mustafa Yousif MD, Ulysses Balis MD2025-08-08下载The Iris File Extension (IFE) is a low overhead performance-oriented whole slide image (WSI) file format designed to improve the image rendering experience for pathologists and simplify image manageme...
An Online Multi-dimensional Knapsack Approach for Slice Admission ControlJesutofunmi Ajayi, Antonio Di Maio, Torsten Braun, Dimitrios Xenakis2025-08-08下载Network Slicing has emerged as a powerful technique to enable cost-effective, multi-tenant communications and services over a shared physical mobile network infrastructure.
Hierarchical Placement Learning for Network Slice ProvisioningJesutofunmi Ajayi, Antonio Di Maio, Torsten Braun2025-08-08下载In this work, we aim to address the challenge of slice provisioning in edge-based mobile networks. We propose a solution that learns a service function chain placement policy for Network Slice Request...
MX-AI: Agentic Observability and Control Platform for Open and AI-RANIlias Chatzistefanidis, Andrea Leone, Ali Yaghoubian, Mikel Irazabal, Sehad Nassim, Lina Bariah, Merouane Debbah, Navid Nikaein2025-08-08下载Future 6G radio access networks (RANs) will be artificial intelligence (AI)-native: observed, reasoned about, and re-configured by autonomous agents cooperating across the cloud-edge continuum.

cs.PF - Performance

标题作者发布日期PDF摘要
Pushing the Envelope of LLM Inference on AI-PC and Intel GPUsEvangelos Georganas, Dhiraj Kalamkar, Alexander Heinecke2025-08-08下载The advent of ultra-low-bit LLM models (1/1.58/2-bit), which match the perplexity and end-task performance of their full-precision counterparts using the same model size, is ushering in a new era of L...
Generalizing Scaling Laws for Dense and Sparse Large Language ModelsMd Arafat Hossain, Xingfu Wu, Valerie Taylor, Ali Jannesari2025-08-08下载Despite recent advancements of large language models (LLMs), optimally predicting the model size for LLM pretraining or allocating optimal resources still remains a challenge.
Performance measurements of modern Fortran MPI applications with Score-PGregor Corbin2025-08-08下载Version 3.0 of the Message-Passing Interface (MPI) standard, released in 2012, introduced a new set of language bindings for Fortran 2008. By making use of modern language features and the enhanced in...

基于 VitePress 构建