Appearance
2025-11-18
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| A Bit Level Weight Reordering Strategy Based on Column Similarity to Explore Weight Sparsity in RRAM-based NN Accelerator | Weiping Yang, Shilin Zhou, Hui Xu, Yujiao Nie, Qimin Zhou, Zhiwei Li, Changlin Chen | 2025-11-18 | 下载 | Compute-in-Memory (CIM) and weight sparsity are two effective techniques to reduce data movement during Neural Network (NN) inference. However, they can hardly be employed in the same accelerator simu... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| A Graph-Based, Distributed Memory, Modeling Abstraction for Optimization | David L. Cole, Jordan Jalving, Jonah Langlieb, Jesse D. Jenkins | 2025-11-18 | 下载 | We present a general, flexible modeling abstraction for building and working with distributed optimization problems called a RemoteOptiGraph. This abstraction extends the OptiGraph model in Plasmo$. |
| AI-driven Predictive Shard Allocation for Scalable Next Generation Blockchains | M. Zeeshan Haider, Tayyaba Noreen, M. D. Assuncao, Kaiwen Zhang | 2025-11-18 | 下载 | Sharding has emerged as a key technique to address blockchain scalability by partitioning the ledger into multiple shards that process transactions in parallel. |
| PolyKAN: Efficient Fused GPU Operators for Polynomial Kolmogorov-Arnold Network Variants | Mingkun Yu, Heming Zhong, Dan Huang, Yutong Lu, Jiazhi Jiang | 2025-11-18 | 下载 | Kolmogorov-Arnold Networks (KANs) promise higher expressive capability and stronger interpretability than Multi-Layer Perceptron, particularly in the domain of AI for Science. |
| FLARE: Adaptive Multi-Dimensional Reputation for Robust Client Reliability in Federated Learning | Abolfazl Younesi, Leon Kiss, Zahra Najafabadi Samani, Juan Aznar Poveda, Thomas Fahringer | 2025-11-18 | 下载 | Federated learning (FL) enables collaborative model training while preserving data privacy. However, it remains vulnerable to malicious clients who compromise model integrity through Byzantine attacks... |
| Energy-Efficient Resource Management in Microservices-based Fog and Edge Computing: State-of-the-Art and Future Directions | Ali Akbar Vali, Sadoon Azizi, Mohammad Shojafar, Rajkumar Buyya | 2025-11-18 | 下载 | The exponential growth of Internet of Things (IoT) devices has intensified the demand for efficient and responsive services. To address this demand, fog and edge computing have emerged as distributed ... |
| Multi-GPU Quantum Circuit Simulation and the Impact of Network Performance | W. Michael Brown, Anurag Ramesh, Thomas Lubinski, Thien Nguyen, David E. Bernal Neira | 2025-11-18 | 下载 | As is intrinsic to the fundamental goal of quantum computing, classical simulation of quantum algorithms is notoriously demanding in resource requirements. |
| Seer: Online Context Learning for Fast Synchronous LLM Reinforcement Learning | Ruoyu Qin, Weiran He, Weixiao Huang, Yangkun Zhang, Yikai Zhao, Bo Pang, Xinran Xu, Yingdi Shan, Yongwei Wu, Mingxing Zhang | 2025-11-18 | 下载 | Reinforcement Learning (RL) has emerged as a critical technique for advancing modern Large Language Models (LLMs), yet existing synchronous RL systems face severe performance bottlenecks. |
| Hapax Locks : Value-Based Mutual Exclusion | Dave Dice, Alex Kogan | 2025-11-18 | 下载 | We present Hapax Locks, a novel locking algorithm that is simple, enjoys constant-time arrival and unlock paths, provides FIFO admission order, and which is also space efficient and generates relative... |
| Overview and Prospects of Using Integer Surrogate Keys for Data Warehouse Performance Optimization | Sviatoslav Stumpf, Vladislav Povyshev | 2025-11-18 | 下载 | The aim of this paper is to examine and demonstrate how integer-based datetime labels (integer surrogate keys for time) can optimize data-warehouse and time-series performance, proposing practical for... |
| Analyzing the Impact of Participant Failures in Cross-Silo Federated Learning | Fabian Stricker, David Bermbach, Christian Zirpins | 2025-11-18 | 下载 | Federated learning (FL) is a new paradigm for training machine learning (ML) models without sharing data. While applying FL in cross-silo scenarios, where organizations collaborate, it is necessary th... |
| Hyperion: Hierarchical Scheduling for Parallel LLM Acceleration in Multi-tier Networks | Mulei Ma, Xinyi Xu, Minrui Xu, Zihan Chen, Yang Yang, Tony Q. S. Quek | 2025-11-18 | 下载 | LLMs are increasingly executed in edge where limited GPU memory and heterogeneous computation jointly constrain deployment which motivates model partitioning and request scheduling. |
| 10Cache: Heterogeneous Resource-Aware Tensor Caching and Migration for LLM Training | Sabiha Afroz, Redwan Ibne Seraj Khan, Hadeel Albahar, Jingoo Han, Ali R. Butt | 2025-11-18 | 下载 | Training large language models (LLMs) in the cloud faces growing memory bottlenecks due to the limited capacity and high cost of GPUs. While GPU memory offloading to CPU and NVMe has made large-scale ... |
| FailSafe: High-performance Resilient Serving | Ziyi Xu, Zhiqiang Xie, Swapnil Gandhi, Christos Kozyrakis | 2025-11-18 | 下载 | Tensor parallelism (TP) enables large language models (LLMs) to scale inference efficiently across multiple GPUs, but its tight coupling makes systems fragile: a single GPU failure can halt execution,... |
| MoE-SpeQ: Speculative Quantized Decoding with Proactive Expert Prefetching and Offloading for Mixture-of-Experts | Wenfeng Wang, Jiacheng Liu, Xiaofeng Hou, Xinfeng Xia, Peng Tang, Mingxuan Zhang, Chao Li, Minyi Guo | 2025-11-18 | 下载 | The immense memory requirements of state-of-the-art Mixture-of-Experts (MoE) models present a significant challenge for inference, often exceeding the capacity of a single accelerator. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| RAID: In-Network RA Signaling Storm Detection for 5G Open RAN | Mohamed Rouili, Yang Xiao, Sihang Liu, Raouf Boutaba | 2025-11-18 | 下载 | The disaggregation and virtualization of 5G Open RAN (O-RAN) introduces new vulnerabilities in the control plane that can greatly impact the quality of service (QoS) of latency-sensitive 5G applicatio... |
| Evaluating the Impact of Packet Scheduling and Congestion Control Algorithms on MPTCP Performance over Heterogeneous Networks | Dimitrios Dimopoulos, Apostolis K. Salkintzis, Dimitris Tsolkas, Nikos Passas, Lazaros Merakos | 2025-11-18 | 下载 | Modern mobile and stationary devices are equipped with multiple network interfaces aiming to provide wireless and wireline connectivity either in a local LAN or the Internet. |
| From Topology to Behavioral Semantics: Enhancing BGP Security by Understanding BGP's Language with LLMs | Heng Zhao, Ruoyu Wang, Tianhang Zheng, Qi Li, Bo Lv, Yuyi Wang, Wenliang Du | 2025-11-18 | 下载 | The trust-based nature of Border Gateway Protocol (BGP) makes it vulnerable to disruptions like prefix hijacking and misconfigurations, threatening routing stability. |
| Cracking the Microsecond: An Efficient and Precise Time Synchronization Scheme for Hybrid 5G-TSN Networks | Michael Gundall, Hans D. Schotten | 2025-11-18 | 下载 | Achieving precise time synchronization in wireless systems is essential for both industrial applications and 5G, where sub-microsecond accuracy is required. |
| Benchmarking OpenWiFiSync on ESP32: Towards Cost-Effective Wireless Time Synchronization | Michael Gundall, Jan Herbst, Robin Müller, Hans D. Schotten | 2025-11-18 | 下载 | Wireless time synchronization of mobile devices is a key enabler for numerous Industry 4.0 applications, such as coordinated and synchronized tasks or the generation of high-precision timestamps for m... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Evaluating the Impact of Packet Scheduling and Congestion Control Algorithms on MPTCP Performance over Heterogeneous Networks | Dimitrios Dimopoulos, Apostolis K. Salkintzis, Dimitris Tsolkas, Nikos Passas, Lazaros Merakos | 2025-11-18 | 下载 | Modern mobile and stationary devices are equipped with multiple network interfaces aiming to provide wireless and wireline connectivity either in a local LAN or the Internet. |
| PIM or CXL-PIM? Understanding Architectural Trade-offs Through Large-Scale Benchmarking | I-Ting Lee, Bao-Kai Wang, Liang-Chi Chen, Wen Sheng Lim, Da-Wei Chang, Yu-Ming Chang, Chieng-Chung Ho | 2025-11-18 | 下载 | Processing-in-memory (PIM) reduces data movement by executing near memory, but our large-scale characterization on real PIM hardware shows that end-to-end performance is often limited by disjoint host... |