Appearance
2025-08-08
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Nail: Not Another Fault-Injection Framework for Chisel-generated RTL | Robin Sehm, Christian Ewert, Rainer Buchty, Mladen Berekovic, Saleh Mulhem | 2025-08-08 | 下载 | Fault simulation and emulation are essential techniques for evaluating the dependability of integrated circuits, enabling early-stage vulnerability analysis and supporting the implementation of effect... |
| ArchXBench: A Complex Digital Systems Benchmark Suite for LLM Driven RTL Synthesis | Suresh Purini, Siddhant Garg, Mudit Gaur, Sankalp Bhat, Sohan Mupparapu, Arun Ravindran | 2025-08-08 | 下载 | Modern SoC datapaths include deeply pipelined, domain-specific accelerators, but their RTL implementation and verification are still mostly done by hand. |
| MAHL: Multi-Agent LLM-Guided Hierarchical Chiplet Design with Adaptive Debugging | Jinwei Tang, Jiayin Qin, Nuo Xu, Pragnya Sudershan Nalla, Yu Cao, Yang, Zhao, Caiwen Ding | 2025-08-08 | 下载 | As program workloads (e.g., AI) increase in size and algorithmic complexity, the primary challenge lies in their high dimensionality, encompassing computing cores, array sizes, and memory hierarchies. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| An Incentive-Compatible Semi-Parallel Proof-of-Work Protocol | Mustafa Doger, Sennur Ulukus | 2025-08-08 | 下载 | Parallel Proof-of-Work (PoW) protocols have been suggested in the literature to improve the safety guarantees, transaction throughput and confirmation latencies of Nakamoto consensus. |
| Blockchain-Enabled Federated Learning | Murtaza Rangwala, KR Venugopal, Rajkumar Buyya | 2025-08-08 | 下载 | Blockchain-enabled federated learning (BCFL) addresses fundamental challenges of trust, privacy, and coordination in collaborative AI systems. |
| Performant Unified GPU Kernels for Portable Singular Value Computation Across Hardware and Precision | Evelyne Ringoot, Rabab Alomairy, Valentin Churavy, Alan Edelman | 2025-08-08 | 下载 | This paper presents a portable, GPU-accelerated implementation of a QR-based singular value computation algorithm in Julia. The singular value ecomposition (SVD) is a fundamental numerical tool in sci... |
| FedMeNF: Privacy-Preserving Federated Meta-Learning for Neural Fields | Junhyeog Yun, Minui Hong, Gunhee Kim | 2025-08-08 | 下载 | Neural fields provide a memory-efficient representation of data, which can effectively handle diverse modalities and large-scale data. However, learning to map neural fields often requires large amoun... |
| KV Cache Compression for Inference Efficiency in LLMs: A Review | Yanyu Liu, Jingying Fu, Sixiang Liu, Yitian Zou, You Fu, Jiehan Zhou, Shouhua Zhang | 2025-08-08 | 下载 | Withtherapid advancement of large language models (LLMs), the context length for inference has been continuously increasing, leading to an exponential growth in the demand for Key-Value (KV) caching. |
| Performance measurements of modern Fortran MPI applications with Score-P | Gregor Corbin | 2025-08-08 | 下载 | Version 3.0 of the Message-Passing Interface (MPI) standard, released in 2012, introduced a new set of language bindings for Fortran 2008. By making use of modern language features and the enhanced in... |
| EC2MoE: Adaptive End-Cloud Pipeline Collaboration Enabling Scalable Mixture-of-Experts Inference | Zheming Yang, Yunqing Hu, Sheng Sun, Wen Ji | 2025-08-08 | 下载 | The Mixture-of-Experts (MoE) paradigm has emerged as a promising solution to scale up model capacity while maintaining inference efficiency. However, deploying MoE models across heterogeneous end-clou... |
| KnapFormer: An Online Load Balancer for Efficient Diffusion Transformers Training | Kai Zhang, Peng Wang, Sai Bi, Jianming Zhang, Yuanjun Xiong | 2025-08-08 | 下载 | We present KnapFormer, an efficient and versatile framework to combine workload balancing and sequence parallelism in distributed training of Diffusion Transformers (DiT). |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Generative AI for Intent-Driven Network Management in 6G RAN: A Case Study on the Mamba Model | Md Arafat Habib, Medhat Elsayed, Yigit Ozcan, Pedro Enrique Iturria-Rivera, Majid Bavand, Melike Erol-Kantarci | 2025-08-08 | 下载 | With the emergence of 6G, mobile networks are becoming increasingly heterogeneous and dynamic, necessitating advanced automation for efficient management. |
| Iris RESTful Server and IrisTileSource: An Iris implementation for existing OpenSeaDragon viewers | Ryan Erik Landvater MD, Navin Kathawa, Mustafa Yousif MD, Ulysses Balis MD | 2025-08-08 | 下载 | The Iris File Extension (IFE) is a low overhead performance-oriented whole slide image (WSI) file format designed to improve the image rendering experience for pathologists and simplify image manageme... |
| An Online Multi-dimensional Knapsack Approach for Slice Admission Control | Jesutofunmi Ajayi, Antonio Di Maio, Torsten Braun, Dimitrios Xenakis | 2025-08-08 | 下载 | Network Slicing has emerged as a powerful technique to enable cost-effective, multi-tenant communications and services over a shared physical mobile network infrastructure. |
| Hierarchical Placement Learning for Network Slice Provisioning | Jesutofunmi Ajayi, Antonio Di Maio, Torsten Braun | 2025-08-08 | 下载 | In this work, we aim to address the challenge of slice provisioning in edge-based mobile networks. We propose a solution that learns a service function chain placement policy for Network Slice Request... |
| MX-AI: Agentic Observability and Control Platform for Open and AI-RAN | Ilias Chatzistefanidis, Andrea Leone, Ali Yaghoubian, Mikel Irazabal, Sehad Nassim, Lina Bariah, Merouane Debbah, Navid Nikaein | 2025-08-08 | 下载 | Future 6G radio access networks (RANs) will be artificial intelligence (AI)-native: observed, reasoned about, and re-configured by autonomous agents cooperating across the cloud-edge continuum. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Pushing the Envelope of LLM Inference on AI-PC and Intel GPUs | Evangelos Georganas, Dhiraj Kalamkar, Alexander Heinecke | 2025-08-08 | 下载 | The advent of ultra-low-bit LLM models (1/1.58/2-bit), which match the perplexity and end-task performance of their full-precision counterparts using the same model size, is ushering in a new era of L... |
| Generalizing Scaling Laws for Dense and Sparse Large Language Models | Md Arafat Hossain, Xingfu Wu, Valerie Taylor, Ali Jannesari | 2025-08-08 | 下载 | Despite recent advancements of large language models (LLMs), optimally predicting the model size for LLM pretraining or allocating optimal resources still remains a challenge. |
| Performance measurements of modern Fortran MPI applications with Score-P | Gregor Corbin | 2025-08-08 | 下载 | Version 3.0 of the Message-Passing Interface (MPI) standard, released in 2012, introduced a new set of language bindings for Fortran 2008. By making use of modern language features and the enhanced in... |