Skip to content

2025-04-03

cs.AR - Architecture

标题作者发布日期PDF摘要
Unlocking the AMD Neural Processing Unit for ML Training on the Client Using Bare-Metal-Programming ToolsAndré Rösti, Michael Franz2025-04-03下载There has been a growing interest in executing machine learning (ML) workloads on the client side for reasons of customizability, privacy, performance, and availability.
ARCANE: Adaptive RISC-V Cache Architecture for Near-memory ExtensionsVincenzo Petrolo, Flavia Guella, Michele Caon, Pasquale Davide Schiavone, Guido Masera, Maurizio Martina2025-04-03下载Modern data-driven applications expose limitations of von Neumann architectures - extensive data movement, low throughput, and poor energy efficiency.
Exploring energy consumption of AI frameworks on a 64-core RV64 Server CPUGiulio Malenza, Francesco Targa, Adriano Marques Garcia, Marco Aldinucci, Robert Birke2025-04-03下载In today's era of rapid technological advancement, artificial intelligence (AI) applications require large-scale, high-performance, and data-intensive computations, leading to significant energy deman...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Towards Optimal Distributed Delta ColoringManuel Jakob, Yannic Maus2025-04-03下载The Δ-vertex coloring problem has become one of the prototypical problems for understanding the complexity of local distributed graph problems on constant-degree graphs.
Distributed Locking: Performance Analysis and Optimization StrategiesAndre Rodriguez, William Osborn2025-04-03下载Distributed locking mechanisms are fundamental to ensuring data consistency and integrity in distributed systems. This paper presents a comprehensive analysis of distributed locking algorithms, focusi...
FAST: Federated Active Learning with Foundation Models for Communication-efficient Sampling and TrainingHaoyuan Li, Mathias Funk, Jindong Wang, Aaqib Saeed2025-04-03下载Federated Active Learning (FAL) has emerged as a promising framework to leverage large quantities of unlabeled data across distributed clients while preserving data privacy.
Web3DB: Web 3.0 RDBMS for Individual Data OwnershipShankha Shubhra Mukherjee, Wenyi Tang, Gustavo Prado Fenzi Aniceto, Jake Chandler, WenZhan Song, Taeho Jung2025-04-03下载This paper introduces Web3DB, a decentralized relational database management system (RDBMS) designed to align with the principles of Web 3.0, addressing critical shortcomings of traditional centralize...
Snow: Self-organizing Broadcast Protocol for CloudChengkai Tong2025-04-03下载In large-scale distributed applications, efficient and reliable broadcast protocols are essential for node communication. Tree-based broadcast lacks flexibility and may suffer performance degradation ...
Blockchain and Distributed Ledger Technologies for Cyberthreat Intelligence SharingAsadullah Tariq, Tariq Qayyum, Saed Alrabaee, Mohamed Adel Serhani2025-04-03下载Cyberthreat intelligence sharing is a critical aspect of cybersecurity, and it is essential to understand its definition, objectives, benefits, and impact on society.
Ethics of Blockchain TechnologiesGeorgy Ishmaev2025-04-03下载This chapter explores three key questions in blockchain ethics. First, it situates blockchain ethics within the broader field of technology ethics, outlining its goals and guiding principles.
Towards Learning-Augmented Peer-to-Peer Networks: Self-Stabilizing Graph Linearization with Untrusted AdviceVijeth Aradhya, Christian Scheideler2025-04-03下载Distributed peer-to-peer systems are widely popular due to their decentralized nature, which ensures that no peer is critical for the functionality of the system.
FlowKV: A Disaggregated Inference Framework with Low-Latency KV Cache Transfer and Load-Aware SchedulingWeiqing Li, Guochao Jiang, Xiangyong Ding, Zhangcheng Tao, Chuzhan Hao, Chenfeng Xu, Yuewei Zhang, Hao Wang2025-04-03下载Disaggregated inference has become an essential framework that separates the prefill (P) and decode (D) stages in large language model inference to improve throughput.
Exploring energy consumption of AI frameworks on a 64-core RV64 Server CPUGiulio Malenza, Francesco Targa, Adriano Marques Garcia, Marco Aldinucci, Robert Birke2025-04-03下载In today's era of rapid technological advancement, artificial intelligence (AI) applications require large-scale, high-performance, and data-intensive computations, leading to significant energy deman...
SProBench: Stream Processing Benchmark for High Performance Computing InfrastructureApurv Deepak Kulkarni, Siavash Ghiasvand2025-04-03下载Recent advancements in data stream processing frameworks have improved real-time data handling, however, scalability remains a significant challenge affecting throughput and latency.
Distributed Log-driven Anomaly Detection System based on Evolving Decision MakingZhuoran Tan, Qiyuan Wang, Christos Anagnostopoulos, Shameem P. Parambath, Jeremy Singer, Sam Temple2025-04-03下载Effective anomaly detection from logs is crucial for enhancing cybersecurity defenses by enabling the early identification of threats. Despite advances in anomaly detection, existing systems often fal...
Distributed Temporal Graph Learning with Provenance for APT Detection in Supply ChainsZhuoran Tan, Christos Anagnostopoulos, Jeremy Singer2025-04-03下载Cyber supply chain, encompassing digital asserts, software, hardware, has become an essential component of modern Information and Communications Technology (ICT) provisioning.
MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert ParallelismRuidong Zhu, Ziheng Jiang, Chao Jin, Peng Wu, Cesar A. Stuardo, Dongyang Wang, Xinlei Zhang, Huaping Zhou, Haoran Wei, Yang Cheng, Jianzhe Xiao, Xinyi Zhang, Lingjun Liu, Haibin Lin, Li-Wen Chang, Jianxi Ye, Xiao Yu, Xuanzhe Liu, Xin Jin, Xin Liu2025-04-03下载Mixture-of-Experts (MoE) showcases tremendous potential to scale large language models (LLMs) with enhanced performance and reduced computational complexity.
Comparative Analysis of Distributed Caching Algorithms: Performance Metrics and Implementation ConsiderationsHelen Mayer, James Richards2025-04-03下载This paper presents a comprehensive comparison of distributed caching algorithms employed in modern distributed systems. We evaluate various caching strategies including Least Recently Used (LRU), Lea...
FT-Transformer: Resilient and Reliable Transformer with End-to-End Fault Tolerant AttentionHuangliang Dai, Shixun Wu, Jiajun Huang, Zizhe Jian, Yue Zhu, Haiyang Hu, Zizhong Chen2025-04-03下载Transformer models rely on High-Performance Computing (HPC) resources for inference, where soft errors are inevitable in large-scale systems, making the reliability of the model particularly critical.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Dynamic Directional Routing of Freight in the Physical InternetSahrish Jaleel Shaikh, Praveen Muthukrishnan, Yijun Lai, Benoit Montreuil2025-04-03下载The Physical Internet (PI) envisions an interconnected, modular, and dynamically managed logistics system inspired by the Digital Internet. It enables open-access networks where shipments traverse a h...
Handover and SINR-Aware Path Optimization in 5G-UAV mmWave Communication using DRLAchilles Kiwanuka Machumilane, Alberto Gotta, Pietro Cassarà2025-04-03下载Path planning and optimization for unmanned aerial vehicles (UAVs)-assisted next-generation wireless networks is critical for mobility management and ensuring UAV safety and ubiquitous connectivity, e...
Medium Access for Push-Pull Data Transmission in 6G Wireless SystemsShashi Raj Pandey, Fabio Saggese, Junya Shiraishi, Federico Chiariotti, Petar Popovski2025-04-03下载Medium access in 5G systems was tailored to accommodate diverse traffic classes through network resource slicing. 6G wireless systems are expected to be significantly reliant on Artificial Intelligenc...
Data-Driven Design of 3GPP Handover Parameters with Bayesian Optimization and Transfer LearningMohamed Benzaghta, Sahar Ammar, David López-Pérez, Basem Shihada, Giovanni Geraci2025-04-03下载Mobility management in dense cellular networks is challenging due to varying user speeds and deployment conditions. Traditional 3GPP handover (HO) schemes, relying on fixed A3-offset and time-to-trigg...
Digital Twins for Internet of Battlespace Things (IoBT) CoalitionsAthanasios Gkelias, Patrick J. Baker, Kin K. Leung, Olwen Worthington, Christopher R. Melville2025-04-03下载This paper presents a new framework for integrating Digital Twins (DTs) within Internet of battlespace Things (IoBT) coalitions. We introduce a novel three-tier architecture that enables efficient coo...
Lifecycle Management of Trustworthy AI Models in 6G Networks: The REASON ApproachJuan Parra-Ullauri, Xueqing Zhou, Shadi Moazzeni, Rasheed Hussain, Xenofon Vasilakos, Yulei Wu, Renjith Baby, M M Hassan Mahmud, Gabriele Incorvaia, Darryl Hond, Hamid Asgari, Andrea Tassi, Daniel Warren, Dimitra Simeonidou2025-04-03下载Artificial Intelligence (AI) is expected to play a key role in 6G networks including optimising system management, operation, and evolution. This requires systematic lifecycle management of AI models,...

cs.PF - Performance

标题作者发布日期PDF摘要
A Scalable Synthesis Algorithm for Reversible FunctionsMoein Sarvaghad-Moghaddam, Morteza Saheb Zamani, Mehdi Sedighi2025-04-03下载Reversible computation is an emerging technology that has gained significant attention due to its critical role in quantum circuit synthesis and low-power design.
SProBench: Stream Processing Benchmark for High Performance Computing InfrastructureApurv Deepak Kulkarni, Siavash Ghiasvand2025-04-03下载Recent advancements in data stream processing frameworks have improved real-time data handling, however, scalability remains a significant challenge affecting throughput and latency.
Finite-Time Behavior of Erlang-C Model: Mixing Time, Mean Queue Length and Tail BoundsHoang Huy Nguyen, Sushil Mahavir Varma, Siva Theja Maguluri2025-04-03下载Service systems like data centers and ride-hailing are popularly modeled as queueing systems in the literature. Such systems are primarily studied in the steady state due to their analytical tractabil...

基于 VitePress 构建