Skip to content

2025-11-18

cs.AR - Architecture

标题作者发布日期PDF摘要
A Bit Level Weight Reordering Strategy Based on Column Similarity to Explore Weight Sparsity in RRAM-based NN AcceleratorWeiping Yang, Shilin Zhou, Hui Xu, Yujiao Nie, Qimin Zhou, Zhiwei Li, Changlin Chen2025-11-18下载Compute-in-Memory (CIM) and weight sparsity are two effective techniques to reduce data movement during Neural Network (NN) inference. However, they can hardly be employed in the same accelerator simu...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
A Graph-Based, Distributed Memory, Modeling Abstraction for OptimizationDavid L. Cole, Jordan Jalving, Jonah Langlieb, Jesse D. Jenkins2025-11-18下载We present a general, flexible modeling abstraction for building and working with distributed optimization problems called a RemoteOptiGraph. This abstraction extends the OptiGraph model in Plasmo$.
AI-driven Predictive Shard Allocation for Scalable Next Generation BlockchainsM. Zeeshan Haider, Tayyaba Noreen, M. D. Assuncao, Kaiwen Zhang2025-11-18下载Sharding has emerged as a key technique to address blockchain scalability by partitioning the ledger into multiple shards that process transactions in parallel.
PolyKAN: Efficient Fused GPU Operators for Polynomial Kolmogorov-Arnold Network VariantsMingkun Yu, Heming Zhong, Dan Huang, Yutong Lu, Jiazhi Jiang2025-11-18下载Kolmogorov-Arnold Networks (KANs) promise higher expressive capability and stronger interpretability than Multi-Layer Perceptron, particularly in the domain of AI for Science.
FLARE: Adaptive Multi-Dimensional Reputation for Robust Client Reliability in Federated LearningAbolfazl Younesi, Leon Kiss, Zahra Najafabadi Samani, Juan Aznar Poveda, Thomas Fahringer2025-11-18下载Federated learning (FL) enables collaborative model training while preserving data privacy. However, it remains vulnerable to malicious clients who compromise model integrity through Byzantine attacks...
Energy-Efficient Resource Management in Microservices-based Fog and Edge Computing: State-of-the-Art and Future DirectionsAli Akbar Vali, Sadoon Azizi, Mohammad Shojafar, Rajkumar Buyya2025-11-18下载The exponential growth of Internet of Things (IoT) devices has intensified the demand for efficient and responsive services. To address this demand, fog and edge computing have emerged as distributed ...
Multi-GPU Quantum Circuit Simulation and the Impact of Network PerformanceW. Michael Brown, Anurag Ramesh, Thomas Lubinski, Thien Nguyen, David E. Bernal Neira2025-11-18下载As is intrinsic to the fundamental goal of quantum computing, classical simulation of quantum algorithms is notoriously demanding in resource requirements.
Seer: Online Context Learning for Fast Synchronous LLM Reinforcement LearningRuoyu Qin, Weiran He, Weixiao Huang, Yangkun Zhang, Yikai Zhao, Bo Pang, Xinran Xu, Yingdi Shan, Yongwei Wu, Mingxing Zhang2025-11-18下载Reinforcement Learning (RL) has emerged as a critical technique for advancing modern Large Language Models (LLMs), yet existing synchronous RL systems face severe performance bottlenecks.
Hapax Locks : Value-Based Mutual ExclusionDave Dice, Alex Kogan2025-11-18下载We present Hapax Locks, a novel locking algorithm that is simple, enjoys constant-time arrival and unlock paths, provides FIFO admission order, and which is also space efficient and generates relative...
Overview and Prospects of Using Integer Surrogate Keys for Data Warehouse Performance OptimizationSviatoslav Stumpf, Vladislav Povyshev2025-11-18下载The aim of this paper is to examine and demonstrate how integer-based datetime labels (integer surrogate keys for time) can optimize data-warehouse and time-series performance, proposing practical for...
Analyzing the Impact of Participant Failures in Cross-Silo Federated LearningFabian Stricker, David Bermbach, Christian Zirpins2025-11-18下载Federated learning (FL) is a new paradigm for training machine learning (ML) models without sharing data. While applying FL in cross-silo scenarios, where organizations collaborate, it is necessary th...
Hyperion: Hierarchical Scheduling for Parallel LLM Acceleration in Multi-tier NetworksMulei Ma, Xinyi Xu, Minrui Xu, Zihan Chen, Yang Yang, Tony Q. S. Quek2025-11-18下载LLMs are increasingly executed in edge where limited GPU memory and heterogeneous computation jointly constrain deployment which motivates model partitioning and request scheduling.
10Cache: Heterogeneous Resource-Aware Tensor Caching and Migration for LLM TrainingSabiha Afroz, Redwan Ibne Seraj Khan, Hadeel Albahar, Jingoo Han, Ali R. Butt2025-11-18下载Training large language models (LLMs) in the cloud faces growing memory bottlenecks due to the limited capacity and high cost of GPUs. While GPU memory offloading to CPU and NVMe has made large-scale ...
FailSafe: High-performance Resilient ServingZiyi Xu, Zhiqiang Xie, Swapnil Gandhi, Christos Kozyrakis2025-11-18下载Tensor parallelism (TP) enables large language models (LLMs) to scale inference efficiently across multiple GPUs, but its tight coupling makes systems fragile: a single GPU failure can halt execution,...
MoE-SpeQ: Speculative Quantized Decoding with Proactive Expert Prefetching and Offloading for Mixture-of-ExpertsWenfeng Wang, Jiacheng Liu, Xiaofeng Hou, Xinfeng Xia, Peng Tang, Mingxuan Zhang, Chao Li, Minyi Guo2025-11-18下载The immense memory requirements of state-of-the-art Mixture-of-Experts (MoE) models present a significant challenge for inference, often exceeding the capacity of a single accelerator.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
RAID: In-Network RA Signaling Storm Detection for 5G Open RANMohamed Rouili, Yang Xiao, Sihang Liu, Raouf Boutaba2025-11-18下载The disaggregation and virtualization of 5G Open RAN (O-RAN) introduces new vulnerabilities in the control plane that can greatly impact the quality of service (QoS) of latency-sensitive 5G applicatio...
Evaluating the Impact of Packet Scheduling and Congestion Control Algorithms on MPTCP Performance over Heterogeneous NetworksDimitrios Dimopoulos, Apostolis K. Salkintzis, Dimitris Tsolkas, Nikos Passas, Lazaros Merakos2025-11-18下载Modern mobile and stationary devices are equipped with multiple network interfaces aiming to provide wireless and wireline connectivity either in a local LAN or the Internet.
From Topology to Behavioral Semantics: Enhancing BGP Security by Understanding BGP's Language with LLMsHeng Zhao, Ruoyu Wang, Tianhang Zheng, Qi Li, Bo Lv, Yuyi Wang, Wenliang Du2025-11-18下载The trust-based nature of Border Gateway Protocol (BGP) makes it vulnerable to disruptions like prefix hijacking and misconfigurations, threatening routing stability.
Cracking the Microsecond: An Efficient and Precise Time Synchronization Scheme for Hybrid 5G-TSN NetworksMichael Gundall, Hans D. Schotten2025-11-18下载Achieving precise time synchronization in wireless systems is essential for both industrial applications and 5G, where sub-microsecond accuracy is required.
Benchmarking OpenWiFiSync on ESP32: Towards Cost-Effective Wireless Time SynchronizationMichael Gundall, Jan Herbst, Robin Müller, Hans D. Schotten2025-11-18下载Wireless time synchronization of mobile devices is a key enabler for numerous Industry 4.0 applications, such as coordinated and synchronized tasks or the generation of high-precision timestamps for m...

cs.PF - Performance

标题作者发布日期PDF摘要
Evaluating the Impact of Packet Scheduling and Congestion Control Algorithms on MPTCP Performance over Heterogeneous NetworksDimitrios Dimopoulos, Apostolis K. Salkintzis, Dimitris Tsolkas, Nikos Passas, Lazaros Merakos2025-11-18下载Modern mobile and stationary devices are equipped with multiple network interfaces aiming to provide wireless and wireline connectivity either in a local LAN or the Internet.
PIM or CXL-PIM? Understanding Architectural Trade-offs Through Large-Scale BenchmarkingI-Ting Lee, Bao-Kai Wang, Liang-Chi Chen, Wen Sheng Lim, Da-Wei Chang, Yu-Ming Chang, Chieng-Chung Ho2025-11-18下载Processing-in-memory (PIM) reduces data movement by executing near memory, but our large-scale characterization on real PIM hardware shows that end-to-end performance is often limited by disjoint host...

基于 VitePress 构建