2025-11-18

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
A Bit Level Weight Reordering Strategy Based on Column Similarity to Explore Weight Sparsity in RRAM-based NN Accelerator	Weiping Yang, Shilin Zhou, Hui Xu, Yujiao Nie, Qimin Zhou, Zhiwei Li, Changlin Chen	2025-11-18	下载	Compute-in-Memory (CIM) and weight sparsity are two effective techniques to reduce data movement during Neural Network (NN) inference. However, they can hardly be employed in the same accelerator simu...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
A Graph-Based, Distributed Memory, Modeling Abstraction for Optimization	David L. Cole, Jordan Jalving, Jonah Langlieb, Jesse D. Jenkins	2025-11-18	下载	We present a general, flexible modeling abstraction for building and working with distributed optimization problems called a RemoteOptiGraph. This abstraction extends the OptiGraph model in Plasmo$.
AI-driven Predictive Shard Allocation for Scalable Next Generation Blockchains	M. Zeeshan Haider, Tayyaba Noreen, M. D. Assuncao, Kaiwen Zhang	2025-11-18	下载	Sharding has emerged as a key technique to address blockchain scalability by partitioning the ledger into multiple shards that process transactions in parallel.
PolyKAN: Efficient Fused GPU Operators for Polynomial Kolmogorov-Arnold Network Variants	Mingkun Yu, Heming Zhong, Dan Huang, Yutong Lu, Jiazhi Jiang	2025-11-18	下载	Kolmogorov-Arnold Networks (KANs) promise higher expressive capability and stronger interpretability than Multi-Layer Perceptron, particularly in the domain of AI for Science.
FLARE: Adaptive Multi-Dimensional Reputation for Robust Client Reliability in Federated Learning	Abolfazl Younesi, Leon Kiss, Zahra Najafabadi Samani, Juan Aznar Poveda, Thomas Fahringer	2025-11-18	下载	Federated learning (FL) enables collaborative model training while preserving data privacy. However, it remains vulnerable to malicious clients who compromise model integrity through Byzantine attacks...
Energy-Efficient Resource Management in Microservices-based Fog and Edge Computing: State-of-the-Art and Future Directions	Ali Akbar Vali, Sadoon Azizi, Mohammad Shojafar, Rajkumar Buyya	2025-11-18	下载	The exponential growth of Internet of Things (IoT) devices has intensified the demand for efficient and responsive services. To address this demand, fog and edge computing have emerged as distributed ...
Multi-GPU Quantum Circuit Simulation and the Impact of Network Performance	W. Michael Brown, Anurag Ramesh, Thomas Lubinski, Thien Nguyen, David E. Bernal Neira	2025-11-18	下载	As is intrinsic to the fundamental goal of quantum computing, classical simulation of quantum algorithms is notoriously demanding in resource requirements.
Seer: Online Context Learning for Fast Synchronous LLM Reinforcement Learning	Ruoyu Qin, Weiran He, Weixiao Huang, Yangkun Zhang, Yikai Zhao, Bo Pang, Xinran Xu, Yingdi Shan, Yongwei Wu, Mingxing Zhang	2025-11-18	下载	Reinforcement Learning (RL) has emerged as a critical technique for advancing modern Large Language Models (LLMs), yet existing synchronous RL systems face severe performance bottlenecks.
Hapax Locks : Value-Based Mutual Exclusion	Dave Dice, Alex Kogan	2025-11-18	下载	We present Hapax Locks, a novel locking algorithm that is simple, enjoys constant-time arrival and unlock paths, provides FIFO admission order, and which is also space efficient and generates relative...
Overview and Prospects of Using Integer Surrogate Keys for Data Warehouse Performance Optimization	Sviatoslav Stumpf, Vladislav Povyshev	2025-11-18	下载	The aim of this paper is to examine and demonstrate how integer-based datetime labels (integer surrogate keys for time) can optimize data-warehouse and time-series performance, proposing practical for...
Analyzing the Impact of Participant Failures in Cross-Silo Federated Learning	Fabian Stricker, David Bermbach, Christian Zirpins	2025-11-18	下载	Federated learning (FL) is a new paradigm for training machine learning (ML) models without sharing data. While applying FL in cross-silo scenarios, where organizations collaborate, it is necessary th...
Hyperion: Hierarchical Scheduling for Parallel LLM Acceleration in Multi-tier Networks	Mulei Ma, Xinyi Xu, Minrui Xu, Zihan Chen, Yang Yang, Tony Q. S. Quek	2025-11-18	下载	LLMs are increasingly executed in edge where limited GPU memory and heterogeneous computation jointly constrain deployment which motivates model partitioning and request scheduling.
10Cache: Heterogeneous Resource-Aware Tensor Caching and Migration for LLM Training	Sabiha Afroz, Redwan Ibne Seraj Khan, Hadeel Albahar, Jingoo Han, Ali R. Butt	2025-11-18	下载	Training large language models (LLMs) in the cloud faces growing memory bottlenecks due to the limited capacity and high cost of GPUs. While GPU memory offloading to CPU and NVMe has made large-scale ...
FailSafe: High-performance Resilient Serving	Ziyi Xu, Zhiqiang Xie, Swapnil Gandhi, Christos Kozyrakis	2025-11-18	下载	Tensor parallelism (TP) enables large language models (LLMs) to scale inference efficiently across multiple GPUs, but its tight coupling makes systems fragile: a single GPU failure can halt execution,...
MoE-SpeQ: Speculative Quantized Decoding with Proactive Expert Prefetching and Offloading for Mixture-of-Experts	Wenfeng Wang, Jiacheng Liu, Xiaofeng Hou, Xinfeng Xia, Peng Tang, Mingxuan Zhang, Chao Li, Minyi Guo	2025-11-18	下载	The immense memory requirements of state-of-the-art Mixture-of-Experts (MoE) models present a significant challenge for inference, often exceeding the capacity of a single accelerator.

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
RAID: In-Network RA Signaling Storm Detection for 5G Open RAN	Mohamed Rouili, Yang Xiao, Sihang Liu, Raouf Boutaba	2025-11-18	下载	The disaggregation and virtualization of 5G Open RAN (O-RAN) introduces new vulnerabilities in the control plane that can greatly impact the quality of service (QoS) of latency-sensitive 5G applicatio...
Evaluating the Impact of Packet Scheduling and Congestion Control Algorithms on MPTCP Performance over Heterogeneous Networks	Dimitrios Dimopoulos, Apostolis K. Salkintzis, Dimitris Tsolkas, Nikos Passas, Lazaros Merakos	2025-11-18	下载	Modern mobile and stationary devices are equipped with multiple network interfaces aiming to provide wireless and wireline connectivity either in a local LAN or the Internet.
From Topology to Behavioral Semantics: Enhancing BGP Security by Understanding BGP's Language with LLMs	Heng Zhao, Ruoyu Wang, Tianhang Zheng, Qi Li, Bo Lv, Yuyi Wang, Wenliang Du	2025-11-18	下载	The trust-based nature of Border Gateway Protocol (BGP) makes it vulnerable to disruptions like prefix hijacking and misconfigurations, threatening routing stability.
Cracking the Microsecond: An Efficient and Precise Time Synchronization Scheme for Hybrid 5G-TSN Networks	Michael Gundall, Hans D. Schotten	2025-11-18	下载	Achieving precise time synchronization in wireless systems is essential for both industrial applications and 5G, where sub-microsecond accuracy is required.
Benchmarking OpenWiFiSync on ESP32: Towards Cost-Effective Wireless Time Synchronization	Michael Gundall, Jan Herbst, Robin Müller, Hans D. Schotten	2025-11-18	下载	Wireless time synchronization of mobile devices is a key enabler for numerous Industry 4.0 applications, such as coordinated and synchronized tasks or the generation of high-precision timestamps for m...

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Evaluating the Impact of Packet Scheduling and Congestion Control Algorithms on MPTCP Performance over Heterogeneous Networks	Dimitrios Dimopoulos, Apostolis K. Salkintzis, Dimitris Tsolkas, Nikos Passas, Lazaros Merakos	2025-11-18	下载	Modern mobile and stationary devices are equipped with multiple network interfaces aiming to provide wireless and wireline connectivity either in a local LAN or the Internet.
PIM or CXL-PIM? Understanding Architectural Trade-offs Through Large-Scale Benchmarking	I-Ting Lee, Bao-Kai Wang, Liang-Chi Chen, Wen Sheng Lim, Da-Wei Chang, Yu-Ming Chang, Chieng-Chung Ho	2025-11-18	下载	Processing-in-memory (PIM) reduces data movement by executing near memory, but our large-scale characterization on real PIM hardware shows that end-to-end performance is often limited by disjoint host...