2025-06-03

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Large Processor Chip Model	Kaiyan Chang, Mingzhi Chen, Yunji Chen, Zhirong Chen, Dongrui Fan, Junfeng Gong, Nan Guo, Yinhe Han, Qinfen Hao, Shuo Hou, Xuan Huang, Pengwei Jin, Changxin Ke, Cangyuan Li, Guangli Li, Huawei Li, Kuan Li, Naipeng Li, Shengwen Liang, Cheng Liu, Hongwei Liu, Jiahua Liu, Junliang Lv, Jianan Mu, Jin Qin, Bin Sun, Chenxi Wang, Duo Wang, Mingjun Wang, Ying Wang, Chenggang Wu, Peiyang Wu, Teng Wu, Xiao Xiao, Mengyao Xie, Chenwei Xiong, Ruiyuan Xu, Mingyu Yan, Xiaochun Ye, Kuai Yu, Rui Zhang, Shuoming Zhang, Jiacheng Zhao	2025-06-03	下载	Computer System Architecture serves as a crucial bridge between software applications and the underlying hardware, encompassing components like compilers, CPUs, coprocessors, and RTL designs.
CLONE: Customizing LLMs for Efficient Latency-Aware Inference at the Edge	Chunlin Tian, Xinpeng Qin, Kahou Tam, Li Li, Zijian Wang, Yuanzhe Zhao, Minglei Zhang, Chengzhong Xu	2025-06-03	下载	Deploying large language models (LLMs) on edge devices is crucial for delivering fast responses and ensuring data privacy. However, the limited storage, weight, and power of edge devices make it diffi...
CPU-Based Layout Design for Picker-to-Parts Pallet Warehouses	Timo Looms, Lin Xie	2025-06-03	下载	Picker-to-parts pallet warehouses often face inefficiencies due to conventional layouts causing excessive travel distances and high labor requirements.
Hardware-Centric Analysis of DeepSeek's Multi-Head Latent Attention	Robin Geens, Marian Verhelst	2025-06-03	下载	Multi-Head Latent Attention (MLA), introduced in DeepSeek-V2, improves the efficiency of large language models by projecting query, key, and value tensors into a compact latent space.
Memory Access Vectors: Improving Sampling Fidelity for CPU Performance Simulations	Sriyash Caculo, Mahesh Madhav, Jeff Baxter	2025-06-03	下载	Accurate performance projection of large-scale benchmarks is essential for CPU architects to evaluate and optimize future processor designs. SimPoint sampling, which uses Basic Block Vectors (BBVs), i...
Minimal Neuron Circuits -- Part I: Resonators	Amr Nabil, T. Nandha Kumar, Haider Abbas F. Almurib	2025-06-03	下载	Spiking Neural Networks have earned increased recognition in recent years owing to their biological plausibility and event-driven computation.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
The Cloud Next Door: Investigating the Environmental and Socioeconomic Strain of Datacenters on Local Communities	Wacuka Ngata, Noman Bashir, Michelle Westerlaken, Laurent Liote, Yasra Chandio, Elsa Olivetti	2025-06-03	下载	Datacenters have become the backbone of modern digital infrastructure, powering the rapid rise of artificial intelligence and promising economic growth and technological progress.
Relay Selection and User Equipment Admission in Resource-Efficient NextG Sidelink Communications	Yalin E. Sagduyu, Tugba Erpek, Sastry Kompella, Kemal Davaslioglu	2025-06-03	下载	5G/6G sidelink communications addresses the challenge of connecting outer UEs, which are unable to directly access a base station (gNodeB), through inner UEs that act as relays to connect to the gNode...
APEX: Asynchronous Parallel CPU-GPU Execution for Online LLM Inference on Constrained GPUs	Jiakun Fan, Yanglin Zhang, Xiangchen Li, Dimitrios S. Nikolopoulos	2025-06-03	下载	Deploying large language models (LLMs) for online inference is often constrained by limited GPU memory, particularly due to the growing KV cache during auto-regressive decoding.
GPU-Parallelizable Randomized Sketch-and-Precondition for Linear Regression using Sparse Sign Sketches	Tyler Chen, Pradeep Niroula, Archan Ray, Pragna Subrahmanya, Marco Pistoia, Niraj Kumar	2025-06-03	下载	A litany of theoretical and numerical results have established the sketch-and-precondition paradigm as a powerful approach to solving large linear regression problems in standard computing environment...
Dynamic Fee for Reducing Impermanent Loss in Decentralized Exchanges	Irina Lebedeva, Dmitrii Umnov, Yury Yanovich, Ignat Melnikov, George Ovchinnikov	2025-06-03	下载	Decentralized exchanges (DEXs) are crucial to decentralized finance (DeFi) as they enable trading without intermediaries. However, they face challenges like impermanent loss (IL), where liquidity prov...
Memory-Efficient Split Federated Learning for LLM Fine-Tuning on Heterogeneous Mobile Devices	Xiaopei Chen, Liang Li, Fei Ji, Wen Wu	2025-06-03	下载	In this paper, we propose an edge-assisted split federated learning framework to facilitate large language model (LLM) fine-tuning on heterogeneous mobile devices while alleviating memory pressures on...
Overcoming Challenges of Partial Client Participation in Federated Learning : A Comprehensive Review	Mrinmay Sen, Shruti Aparna, Rohit Agarwal, Chalavadi Krishna Mohan	2025-06-03	下载	Federated Learning (FL) is a learning mechanism that falls under the distributed training umbrella, which collaboratively trains a shared global model without disclosing the raw data from different cl...
Process Mining on Distributed Data Sources	Maximilian Weisenseel, Julia Andersen, Samira Akili, Christian Imenkamp, Hendrik Reiter, Christoffer Rubensson, Wilhelm Hasselbring, Olaf Landsiedel, Xixi Lu, Jan Mendling, Florian Tschorsch, Matthias Weidlich, Agnes Koschmider	2025-06-03	下载	Major domains such as logistics, healthcare, and smart cities increasingly rely on sensor technologies and distributed infrastructures to monitor complex processes in real time.
From Local Updates to Global Balance: A Framework for Distributed Matrix Scaling	Giacomo Aletti, Giovanni Naldi	2025-06-03	下载	This paper investigates matrix scaling processes in the context of local normalization algorithms and their convergence behavior. Starting from the classical Sinkhorn algorithm, the authors introduce ...
Adaptive Configuration Selection for Multi-Model Inference Pipelines in Edge Computing	Jinhao Sheng, Zhiqing Tang, Jianxiong Guo, Tian Wang	2025-06-03	下载	The growing demand for real-time processing tasks is driving the need for multi-model inference pipelines on edge devices. However, cost-effectively deploying these pipelines while optimizing Quality ...
Exploring metrics for analyzing dynamic behavior in MPI programs via a coupled-oscillator model	Ayesha Afzal, Georg Hager, Gerhard Wellen	2025-06-03	下载	We propose a novel, lightweight, and physically inspired approach to modeling the dynamics of parallel distributed-memory programs. Inspired by the Kuramoto model, we represent MPI processes as couple...
Rethinking Dynamic Networks and Heterogeneous Computing with Automatic Parallelization	Ruilong Wu, Xinjiao Li, Yisu Wang, Xinyu Chen, Dirk Kutscher	2025-06-03	下载	Hybrid parallelism techniques are essential for efficiently training large language models (LLMs). Nevertheless, current automatic parallel planning frameworks often overlook the simultaneous consider...
Usability Evaluation of Cloud for HPC Applications	Vanessa Sochat, Daniel Milroy, Abhik Sarkar, Aniruddha Marathe, Tapasya Patki	2025-06-03	下载	The rise of AI and the economic dominance of cloud computing have created a new nexus of innovation for high performance computing (HPC), which has a long history of driving scientific discovery.
KVCache Cache in the Wild: Characterizing and Optimizing KVCache Cache at a Large Cloud Provider	Jiahao Wang, Jinbo Han, Xingda Wei, Sijie Shen, Dingyan Zhang, Chenguang Fang, Rong Chen, Wenyuan Yu, Haibo Chen	2025-06-03	下载	Serving large language models (LLMs) is important for cloud providers, and caching intermediate results (KV$) after processing each request substantially improves serving throughput and latency.
Distributedness based scheduling	Paritosh Ranjan, Surajit Majumder, Prodip Roy, Bhuban Padhan	2025-06-03	下载	Efficient utilization of computing resources in a Kubernetes cluster is often constrained by the uneven distribution of pods with similar usage patterns.
Simplifying Root Cause Analysis in Kubernetes with StateGraph and LLM	Yong Xiang, Charley Peter Chen, Liyi Zeng, Wei Yin, Xin Liu, Hu Li, Wei Xu	2025-06-03	下载	Kubernetes, a notably complex and distributed system, utilizes an array of controllers to uphold cluster management logic through state reconciliation.
DiOMP-Offloading: Toward Portable Distributed Heterogeneous OpenMP	Baodi Shan, Mauricio Araya-Polo, Barbara Chapman	2025-06-03	下载	As core counts and heterogeneity rise in HPC, traditional hybrid programming models face challenges in managing distributed GPU memory and ensuring portability.
Enhancing Convergence, Privacy and Fairness for Wireless Personalized Federated Learning: Quantization-Assisted Min-Max Fair Scheduling	Xiyu Zhao, Qimei Cui, Ziqiang Du, Weicai Li, Xi Yu, Wei Ni, Ji Zhang, Xiaofeng Tao, Ping Zhang	2025-06-03	下载	Personalized federated learning (PFL) offers a solution to balancing personalization and generalization by conducting federated learning (FL) to guide personalized learning (PL).
Converge Faster, Talk Less: Hessian-Informed Federated Zeroth-Order Optimization	Zhe Li, Bicheng Ying, Zidong Liu, Chaosheng Dong, Haibo Yang	2025-06-03	下载	Zeroth-order (ZO) optimization enables dimension-free communication in federated learning (FL), making it attractive for fine-tuning of large language models (LLMs) due to significant communication sa...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Relay Selection and User Equipment Admission in Resource-Efficient NextG Sidelink Communications	Yalin E. Sagduyu, Tugba Erpek, Sastry Kompella, Kemal Davaslioglu	2025-06-03	下载	5G/6G sidelink communications addresses the challenge of connecting outer UEs, which are unable to directly access a base station (gNodeB), through inner UEs that act as relays to connect to the gNode...
AI-Augmented OTDR Fault Localization Framework for Resilient Rural Fiber Networks in the United States	Sabab Al Farabi	2025-06-03	下载	This research presents a novel framework that combines traditional Optical Time-Domain Reflectometer (OTDR) signal analysis with machine learning to localize and classify fiber optic faults in rural b...
Computation- and Communication-Efficient Online FL for Resource-Constrained Aerial Vehicles	Ferdous Pervej, Richeng Jin, Md Moin Uddin Chowdhury, Simran Singh, İsmail Güvenç, Huaiyu Dai	2025-06-03	下载	Privacy-preserving distributed machine learning (ML) and aerial connected vehicle (ACV)-assisted edge computing have drawn significant attention lately.
Quantum Data Centres: Why Entanglement Changes Everything	Angela Sara Cacciapuoti, Claudio Pellitteri, Jessica Illiano, Laura d'Avossa, Francesco Mazza, Siyi Chen, Marcello Caleffi	2025-06-03	下载	The Quantum Internet is key for distributed quantum computing, by interconnecting multiple quantum processors into a virtual quantum computation system.
NetArena: Dynamic Benchmarks for AI Agents in Network Automation	Yajie Zhou, Jiajun Ruan, Eric S. Wang, Sadjad Fouladi, Francis Y. Yan, Kevin Hsieh, Zaoxing Liu	2025-06-03	下载	As AI agents expand into high-stakes domains like network system operations, evaluating their real-world reliability becomes increasingly critical.
AI-Driven Vehicle Condition Monitoring with Cell-Aware Edge Service Migration	Charalampos Kalalas, Pavol Mulinka, Guillermo Candela Belmonte, Miguel Fornell, Michail Dalgitsis, Francisco Paredes Vera, Javier Santaella Sánchez, Carmen Vicente Villares, Roshan Sedar, Eftychia Datsika, Angelos Antonopoulos, Antonio Fernández Ojea, Miquel Payaro	2025-06-03	下载	Artificial intelligence (AI) has been increasingly applied to the condition monitoring of vehicular equipment, aiming to enhance maintenance strategies, reduce costs, and improve safety.
HARNode: A Time-Synchronised, Open-Source, Multi-Device, Wearable System for Ad Hoc Field Studies	Philipp Lepold, Tobias Röddiger, Michael Beigl	2025-06-03	下载	Human activity recognition (HAR) research often lacks accessible, comprehensive field data. Commercial systems are rarely open source, hard to expand, and limited by issues like node synchronisation, ...
Zero-Energy RIS-Assisted Communications With Noise Modulation and Interference-Based Energy Harvesting	Ahmad Massud Tota Khel, Aissa Ikhlef, Zhiguo Ding, Hongjian Sun	2025-06-03	下载	To advance towards carbon-neutrality and improve the limited {performance} of conventional passive wireless communications, in this paper, we investigate the integration of noise modulation with zero-...
Channel-adaptive Cross-modal Generative Semantic Communication for Point Cloud Transmission	Wanting Yang, Zehui Xiong, Qianqian Yang, Ping Zhang, Merouane Debbah, Rahim Tafazolli	2025-06-03	下载	With the rapid development of autonomous driving and extended reality, efficient transmission of point clouds (PCs) has become increasingly important.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Energy Efficiency Analysis of Active RIS-enhanced Wireless Network under Power-Sum Constraint	Jingdie Xin, Yan Wang, Feng Shu, Feng Zhao, Yifan Zhao, Hao Jiang	2025-06-03	下载	Recently, as a green wireless technology, active reconfigurable intelligent surface (RIS) attracts numerous research activities due to its amplifying ability to combat the double-fading effect compare...
Usability Evaluation of Cloud for HPC Applications	Vanessa Sochat, Daniel Milroy, Abhik Sarkar, Aniruddha Marathe, Tapasya Patki	2025-06-03	下载	The rise of AI and the economic dominance of cloud computing have created a new nexus of innovation for high performance computing (HPC), which has a long history of driving scientific discovery.
Spatially Correlated multi-RIS Communication: The Effect of Inter-Operator Interference	Nikolaos I. Miridakis, Panagiotis A. Karkazis	2025-06-03	下载	A multi-operator wireless communication system is studied where each operator is equipped with a reconfigurable intelligent surface (RIS) to enhance its communication quality.