2025-04-03

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Unlocking the AMD Neural Processing Unit for ML Training on the Client Using Bare-Metal-Programming Tools	André Rösti, Michael Franz	2025-04-03	下载	There has been a growing interest in executing machine learning (ML) workloads on the client side for reasons of customizability, privacy, performance, and availability.
ARCANE: Adaptive RISC-V Cache Architecture for Near-memory Extensions	Vincenzo Petrolo, Flavia Guella, Michele Caon, Pasquale Davide Schiavone, Guido Masera, Maurizio Martina	2025-04-03	下载	Modern data-driven applications expose limitations of von Neumann architectures - extensive data movement, low throughput, and poor energy efficiency.
Exploring energy consumption of AI frameworks on a 64-core RV64 Server CPU	Giulio Malenza, Francesco Targa, Adriano Marques Garcia, Marco Aldinucci, Robert Birke	2025-04-03	下载	In today's era of rapid technological advancement, artificial intelligence (AI) applications require large-scale, high-performance, and data-intensive computations, leading to significant energy deman...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Towards Optimal Distributed Delta Coloring	Manuel Jakob, Yannic Maus	2025-04-03	下载	The Δ-vertex coloring problem has become one of the prototypical problems for understanding the complexity of local distributed graph problems on constant-degree graphs.
Distributed Locking: Performance Analysis and Optimization Strategies	Andre Rodriguez, William Osborn	2025-04-03	下载	Distributed locking mechanisms are fundamental to ensuring data consistency and integrity in distributed systems. This paper presents a comprehensive analysis of distributed locking algorithms, focusi...
FAST: Federated Active Learning with Foundation Models for Communication-efficient Sampling and Training	Haoyuan Li, Mathias Funk, Jindong Wang, Aaqib Saeed	2025-04-03	下载	Federated Active Learning (FAL) has emerged as a promising framework to leverage large quantities of unlabeled data across distributed clients while preserving data privacy.
Web3DB: Web 3.0 RDBMS for Individual Data Ownership	Shankha Shubhra Mukherjee, Wenyi Tang, Gustavo Prado Fenzi Aniceto, Jake Chandler, WenZhan Song, Taeho Jung	2025-04-03	下载	This paper introduces Web3DB, a decentralized relational database management system (RDBMS) designed to align with the principles of Web 3.0, addressing critical shortcomings of traditional centralize...
Snow: Self-organizing Broadcast Protocol for Cloud	Chengkai Tong	2025-04-03	下载	In large-scale distributed applications, efficient and reliable broadcast protocols are essential for node communication. Tree-based broadcast lacks flexibility and may suffer performance degradation ...
Blockchain and Distributed Ledger Technologies for Cyberthreat Intelligence Sharing	Asadullah Tariq, Tariq Qayyum, Saed Alrabaee, Mohamed Adel Serhani	2025-04-03	下载	Cyberthreat intelligence sharing is a critical aspect of cybersecurity, and it is essential to understand its definition, objectives, benefits, and impact on society.
Ethics of Blockchain Technologies	Georgy Ishmaev	2025-04-03	下载	This chapter explores three key questions in blockchain ethics. First, it situates blockchain ethics within the broader field of technology ethics, outlining its goals and guiding principles.
Towards Learning-Augmented Peer-to-Peer Networks: Self-Stabilizing Graph Linearization with Untrusted Advice	Vijeth Aradhya, Christian Scheideler	2025-04-03	下载	Distributed peer-to-peer systems are widely popular due to their decentralized nature, which ensures that no peer is critical for the functionality of the system.
FlowKV: A Disaggregated Inference Framework with Low-Latency KV Cache Transfer and Load-Aware Scheduling	Weiqing Li, Guochao Jiang, Xiangyong Ding, Zhangcheng Tao, Chuzhan Hao, Chenfeng Xu, Yuewei Zhang, Hao Wang	2025-04-03	下载	Disaggregated inference has become an essential framework that separates the prefill (P) and decode (D) stages in large language model inference to improve throughput.
Exploring energy consumption of AI frameworks on a 64-core RV64 Server CPU	Giulio Malenza, Francesco Targa, Adriano Marques Garcia, Marco Aldinucci, Robert Birke	2025-04-03	下载	In today's era of rapid technological advancement, artificial intelligence (AI) applications require large-scale, high-performance, and data-intensive computations, leading to significant energy deman...
SProBench: Stream Processing Benchmark for High Performance Computing Infrastructure	Apurv Deepak Kulkarni, Siavash Ghiasvand	2025-04-03	下载	Recent advancements in data stream processing frameworks have improved real-time data handling, however, scalability remains a significant challenge affecting throughput and latency.
Distributed Log-driven Anomaly Detection System based on Evolving Decision Making	Zhuoran Tan, Qiyuan Wang, Christos Anagnostopoulos, Shameem P. Parambath, Jeremy Singer, Sam Temple	2025-04-03	下载	Effective anomaly detection from logs is crucial for enhancing cybersecurity defenses by enabling the early identification of threats. Despite advances in anomaly detection, existing systems often fal...
Distributed Temporal Graph Learning with Provenance for APT Detection in Supply Chains	Zhuoran Tan, Christos Anagnostopoulos, Jeremy Singer	2025-04-03	下载	Cyber supply chain, encompassing digital asserts, software, hardware, has become an essential component of modern Information and Communications Technology (ICT) provisioning.
MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert Parallelism	Ruidong Zhu, Ziheng Jiang, Chao Jin, Peng Wu, Cesar A. Stuardo, Dongyang Wang, Xinlei Zhang, Huaping Zhou, Haoran Wei, Yang Cheng, Jianzhe Xiao, Xinyi Zhang, Lingjun Liu, Haibin Lin, Li-Wen Chang, Jianxi Ye, Xiao Yu, Xuanzhe Liu, Xin Jin, Xin Liu	2025-04-03	下载	Mixture-of-Experts (MoE) showcases tremendous potential to scale large language models (LLMs) with enhanced performance and reduced computational complexity.
Comparative Analysis of Distributed Caching Algorithms: Performance Metrics and Implementation Considerations	Helen Mayer, James Richards	2025-04-03	下载	This paper presents a comprehensive comparison of distributed caching algorithms employed in modern distributed systems. We evaluate various caching strategies including Least Recently Used (LRU), Lea...
FT-Transformer: Resilient and Reliable Transformer with End-to-End Fault Tolerant Attention	Huangliang Dai, Shixun Wu, Jiajun Huang, Zizhe Jian, Yue Zhu, Haiyang Hu, Zizhong Chen	2025-04-03	下载	Transformer models rely on High-Performance Computing (HPC) resources for inference, where soft errors are inevitable in large-scale systems, making the reliability of the model particularly critical.

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Dynamic Directional Routing of Freight in the Physical Internet	Sahrish Jaleel Shaikh, Praveen Muthukrishnan, Yijun Lai, Benoit Montreuil	2025-04-03	下载	The Physical Internet (PI) envisions an interconnected, modular, and dynamically managed logistics system inspired by the Digital Internet. It enables open-access networks where shipments traverse a h...
Handover and SINR-Aware Path Optimization in 5G-UAV mmWave Communication using DRL	Achilles Kiwanuka Machumilane, Alberto Gotta, Pietro Cassarà	2025-04-03	下载	Path planning and optimization for unmanned aerial vehicles (UAVs)-assisted next-generation wireless networks is critical for mobility management and ensuring UAV safety and ubiquitous connectivity, e...
Medium Access for Push-Pull Data Transmission in 6G Wireless Systems	Shashi Raj Pandey, Fabio Saggese, Junya Shiraishi, Federico Chiariotti, Petar Popovski	2025-04-03	下载	Medium access in 5G systems was tailored to accommodate diverse traffic classes through network resource slicing. 6G wireless systems are expected to be significantly reliant on Artificial Intelligenc...
Data-Driven Design of 3GPP Handover Parameters with Bayesian Optimization and Transfer Learning	Mohamed Benzaghta, Sahar Ammar, David López-Pérez, Basem Shihada, Giovanni Geraci	2025-04-03	下载	Mobility management in dense cellular networks is challenging due to varying user speeds and deployment conditions. Traditional 3GPP handover (HO) schemes, relying on fixed A3-offset and time-to-trigg...
Digital Twins for Internet of Battlespace Things (IoBT) Coalitions	Athanasios Gkelias, Patrick J. Baker, Kin K. Leung, Olwen Worthington, Christopher R. Melville	2025-04-03	下载	This paper presents a new framework for integrating Digital Twins (DTs) within Internet of battlespace Things (IoBT) coalitions. We introduce a novel three-tier architecture that enables efficient coo...
Lifecycle Management of Trustworthy AI Models in 6G Networks: The REASON Approach	Juan Parra-Ullauri, Xueqing Zhou, Shadi Moazzeni, Rasheed Hussain, Xenofon Vasilakos, Yulei Wu, Renjith Baby, M M Hassan Mahmud, Gabriele Incorvaia, Darryl Hond, Hamid Asgari, Andrea Tassi, Daniel Warren, Dimitra Simeonidou	2025-04-03	下载	Artificial Intelligence (AI) is expected to play a key role in 6G networks including optimising system management, operation, and evolution. This requires systematic lifecycle management of AI models,...

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
A Scalable Synthesis Algorithm for Reversible Functions	Moein Sarvaghad-Moghaddam, Morteza Saheb Zamani, Mehdi Sedighi	2025-04-03	下载	Reversible computation is an emerging technology that has gained significant attention due to its critical role in quantum circuit synthesis and low-power design.
SProBench: Stream Processing Benchmark for High Performance Computing Infrastructure	Apurv Deepak Kulkarni, Siavash Ghiasvand	2025-04-03	下载	Recent advancements in data stream processing frameworks have improved real-time data handling, however, scalability remains a significant challenge affecting throughput and latency.
Finite-Time Behavior of Erlang-C Model: Mixing Time, Mean Queue Length and Tail Bounds	Hoang Huy Nguyen, Sushil Mahavir Varma, Siva Theja Maguluri	2025-04-03	下载	Service systems like data centers and ride-hailing are popularly modeled as queueing systems in the literature. Such systems are primarily studied in the steady state due to their analytical tractabil...