2025-08-01

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
E2ATST: A Temporal-Spatial Optimized Energy-Efficient Architecture for Training Spiking Transformer	Yunhao Ma, Yanyu Lin, Mingjing Li, Puli Quan, Chenlin Zhou, Wenyue Zhang, Zhiwei Zhong, Wanyi Jia, Xueke Zhu, Qingyan Meng, Huihui Zhou, Fengwei An	2025-08-01	下载	(1) Pengcheng Laboratory, (2) Southern University of Science and Technology, (3) Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, (4) University of Chinese Academy of Sciences
DGEMM without FP64 Arithmetic - Using FP64 Emulation and FP8 Tensor Cores with Ozaki Scheme	Daichi Mukunoki	2025-08-01	下载	As the demand for AI computation rapidly increases, more hardware is being developed to efficiently perform the low-precision matrix multiplications required by such workloads.
Reimagining Voltage-Controlled Cryogenic Boolean Logic Paradigm with Quantum-Enhanced Josephson Junction FETs	Md Mazharul Islam, Diego Ferrer, Shamiul Alam, Juan P. Mendez, Denis Mamaluy, Wei Pan, Ahmedullah Aziz	2025-08-01	下载	The growing demand for ultra low power computing and the emergence of quantum technologies have intensified interest in cryogenic electronics, particularly superconducting devices.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
AdVAR-DNN: Adversarial Misclassification Attack on Collaborative DNN Inference	Shima Yousefi, Motahare Mounesan, Saptarshi Debroy	2025-08-01	下载	In recent years, Deep Neural Networks (DNNs) have become increasingly integral to IoT-based environments, enabling realtime visual computing. However, the limited computational capacity of these devic...
Optimal Scheduling Algorithms for LLM Inference: Theory and Practice	Agrim Bari, Parikshit Hegde, Gustavo de Veciana	2025-08-01	下载	With the growing use of Large Language Model (LLM)-based tools like ChatGPT, Perplexity, and Gemini across industries, there is a rising need for efficient LLM inference systems.
Adacc: An Adaptive Framework Unifying Compression and Activation Recomputation for LLM Training	Ping Chen, Zhuohong Deng, Ping Li, Shuibing He, Hongzi Zhu, Yi Zheng, Zhefeng Wang, Baoxing Huai, Minyi Guo	2025-08-01	下载	Training large language models (LLMs) is often constrained by GPU memory limitations. To alleviate memory pressure, activation recomputation and data compression have been proposed as two major strate...
FedGuard: A Diverse-Byzantine-Robust Mechanism for Federated Learning with Major Malicious Clients	Haocheng Jiang, Hua Shen, Jixin Zhang, Willy Susilo, Mingwu Zhang	2025-08-01	下载	Federated learning is a distributed training framework vulnerable to Byzantine attacks, particularly when over 50% of clients are malicious or when datasets are highly non-independent and identically ...
SwarmRaft: Leveraging Consensus for Robust Drone Swarm Coordination in GNSS-Degraded Environments	Kapel Dev, Yash Madhwal, Sofia Shevelo, Pavel Osinenko, Yury Yanovich	2025-08-01	下载	Unmanned aerial vehicle (UAV) swarms are increasingly used in critical applications such as aerial mapping, environmental monitoring, and autonomous delivery.
A Parallel Alternative for Energy-Efficient Neural Network Training and Inferencing	Sudip K. Seal, Maksudul Alam, Jorge Ramirez, Sajal Dash, Hao Lu	2025-08-01	下载	Energy efficiency of training and inferencing with large neural network models is a critical challenge facing the future of sustainable large-scale machine learning workloads.
Information-Theoretic Decentralized Secure Aggregation with Passive Collusion Resilience	Xiang Zhang, Zhou Li, Shuangyang Li, Kai Wan, Derrick Wing Kwan Ng, Giuseppe Caire	2025-08-01	下载	In decentralized federated learning (FL), multiple clients collaboratively learn a shared machine learning (ML) model by leveraging their privately held datasets distributed across the network, throug...
Tetris: Efficient Intra-Datacenter Calls Packing for Large Conferencing Services	Rohan Gandhi, Ankur Mallick, Ken Sueda, Rui Liang	2025-08-01	下载	Conference services like Zoom, Microsoft Teams, and Google Meet facilitate millions of daily calls, yet ensuring high performance at low costs remains a significant challenge.
Integrated user scheduling and beam steering in over-the-air federated learning for mobile IoT	Shengheng Liu, Ningning Fu, Zhonghao Zhang, Yongming Huang, Tony Q. S. Quek	2025-08-01	下载	The rising popularity of Internet of things (IoT) has spurred technological advancements in mobile internet and interconnected systems. While offering flexible connectivity and intelligent application...
Quality-of-Service Aware LLM Routing for Edge Computing with Multiple Experts	Jin Yang, Qiong Wu, Zhiying Feng, Zhi Zhou, Deke Guo, Xu Chen	2025-08-01	下载	Large Language Models (LLMs) have demonstrated remarkable capabilities, leading to a significant increase in user demand for LLM services. However, cloud-based LLM services often suffer from high late...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Connectivity Management in Satellite-Aided Vehicular Networks with Multi-Head Attention-Based State Estimation	Ibrahim Althamary, Chen-Fu Chou, Chih-Wei Huang	2025-08-01	下载	Managing connectivity in integrated satellite-terrestrial vehicular networks is critical for 6G, yet is challenged by dynamic conditions and partial observability.
A Deep Reinforcement Learning-Based TCP Congestion Control Algorithm: Design, Simulation, and Evaluation	Efe Ağlamazlar, Emirhan Eken, Harun Batur Geçici	2025-08-01	下载	This paper introduces a Deep Reinforcement Learning (DRL) based TCP congestion-control algorithm that uses a Deep Q-Network (DQN) to adapt the congestion window (cWnd) dynamically based on observed ne...
Data Movement Manager (DMM) for the SENSE-Rucio Interoperation Prototype	Aashay Arora, Diego Davila, Jonathan Guiang, Frank Würthwein, Harvey Newman, Justas Balcas, Tom Lehman, Xi Yang	2025-08-01	下载	The Data Movement Manager (DMM) is a prototype interface that connects CERN's data management software, Rucio, with the Sofware-Defined Networking (SDN) service SENSE by ESNet.
Overlapping IPv4, IPv6, and TCP data: exploring errors, test case context and multiple overlaps inside network stacks and NIDSes with PYROLYSE	Lucas Aubard, Johan Mazel, Gilles Guette, Pierre Chifflier	2025-08-01	下载	IP fragmentation and TCP segmentation allow for splitting large data packets into smaller ones, e.g., for transmission across network links of limited capacity.
Deep Joint Source-Channel Coding for Small Satellite Applications	Olga Kondrateva, Grace Li Zhang, Julian Zobel, Björn Scheuermann, Stefan Dietzel	2025-08-01	下载	Small satellites used for Earth observation generate vast amounts of high-dimensional data, but their operation in low Earth orbit creates a significant communication bottleneck due to limited contact...
Criticality-Based Dynamic Topology Optimization for Enhancing Aerial-Marine Swarm Resilience	Ruiyang Huang, Haocheng Wang, Yixuan Shen, Ning Gao, Qiang Ni, Shi Jin, Yifan Wu	2025-08-01	下载	Heterogeneous marine-aerial swarm networks encounter substantial difficulties due to targeted communication disruptions and structural weaknesses in adversarial environments.
Energy-Aware CPU Orchestration in O-RAN: A dApp-Driven Lightweight Approach	Francisco Crespo, Javier Villegas, Carlos Baena, Eduardo Baena, Sergio Fortes, Raquel Barco	2025-08-01	下载	The transition toward softwarized Radio Access Networks (RANs), driven by the Open RAN (O-RAN) paradigm, enables flexible, vendor-neutral deployments through disaggregation and virtualization of base ...
Joint Association and Phase Shifts Design for UAV-mounted Stacked Intelligent Metasurfaces-assisted Communications	Mingzhe Fan, Geng Sun, Hongyang Pan, Jiacheng Wang, Jiancheng An, Hongyang Du, Chau Yuen	2025-08-01	下载	Stacked intelligent metasurfaces (SIMs) have emerged as a promising technology for realizing wave-domain signal processing, while the fixed SIMs will limit the communication performance of the system ...
Enhancing Wireless Networks for IoT with Large Vision Models: Foundations and Applications	Yunting Xu, Jiacheng Wang, Ruichen Zhang, Dusit Niyato, Deepu Rajan, Liang Yu, Haibo Zhou, Abbas Jamalipour, Xianbin Wang	2025-08-01	下载	Large vision models (LVMs) have emerged as a foundational paradigm in visual intelligence, achieving state-of-the-art performance across diverse visual tasks.
Mamba for Wireless Communications and Networking: Principles and Opportunities	Rongsheng Zhang, Ruichen Zhang, Yang Lu, Wei Chen, Bo Ai, Dusit Niyato	2025-08-01	下载	Mamba has emerged as a powerful model for efficiently addressing tasks involving temporal and spatial data. Regarding the escalating heterogeneity and dynamics in wireless networks, Mamba holds the po...
Multi-grained spatial-temporal feature complementarity for accurate online cellular traffic prediction	Ningning Fu, Shengheng Liu, Weiliang Xie, Yongming Huang	2025-08-01	下载	Knowledge discovered from telecom data can facilitate proactive understanding of network dynamics and user behaviors, which in turn empowers service providers to optimize cellular traffic scheduling a...
Energy Efficient Trajectory Control and Resource Allocation in Multi-UAV-assisted MEC via Deep Reinforcement Learning	Saichao Liu, Geng Sun, Chuang Zhang, Xuejie Liu, Jiacheng Wang, Changyuan Zhao, Dusit Niyato	2025-08-01	下载	Mobile edge computing (MEC) is a promising technique to improve the computational capacity of smart devices (SDs) in Internet of Things (IoT).
Large AI Model-Enabled Secure Communications in Low-Altitude Wireless Networks: Concepts, Perspectives and Case Study	Chuang Zhang, Geng Sun, Yijing Lin, Weijie Yuan, Sinem Coleri, Dusit Niyato	2025-08-01	下载	Low-altitude wireless networks (LAWNs) have the potential to revolutionize communications by supporting a range of applications, including urban parcel delivery, aerial inspections and air taxis.
Quality-of-Service Aware LLM Routing for Edge Computing with Multiple Experts	Jin Yang, Qiong Wu, Zhiying Feng, Zhi Zhou, Deke Guo, Xu Chen	2025-08-01	下载	Large Language Models (LLMs) have demonstrated remarkable capabilities, leading to a significant increase in user demand for LLM services. However, cloud-based LLM services often suffer from high late...
Benchmarking XRootD-HTTPS on 400Gbps Links with Variable Latencies	Aashay Arora, Diego Davila, Frank Würthwein, John Graham, Dima Mishin, Justas Balcas, Tom Lehman, Xi Yang, Chin Guok, Harvey Newman	2025-08-01	下载	In anticipation of the High Luminosity-LHC era, there is a critical need to oversee software readiness for upcoming growth in network traffic for production and user data analysis access.

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Energy-Aware CPU Orchestration in O-RAN: A dApp-Driven Lightweight Approach	Francisco Crespo, Javier Villegas, Carlos Baena, Eduardo Baena, Sergio Fortes, Raquel Barco	2025-08-01	下载	The transition toward softwarized Radio Access Networks (RANs), driven by the Open RAN (O-RAN) paradigm, enables flexible, vendor-neutral deployments through disaggregation and virtualization of base ...
Composable OS Kernel Architectures for Autonomous Intelligence	Rajpreet Singh, Vidhi Kothari	2025-08-01	下载	As intelligent systems permeate edge devices, cloud infrastructure, and embedded real-time environments, this research proposes a new OS kernel architecture for intelligent systems, transforming kerne...

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Efficient Solving of Large Single Input Superstate Decomposable Markovian Decision Process	Youssef Ait El Mahjoub, Jean-Michel Fourneau, Salma Alouah	2025-08-01	下载	Solving Markov Decision Processes (MDPs) remains a central challenge in sequential decision-making, especially when dealing with large state spaces and long-term optimization criteria.
Interpreting Performance Profiles with Deep Learning	Zhuoran Liu	2025-08-01	下载	Profiling tools (also known as profilers) play an important role in understanding program performance at runtime, such as hotspots, bottlenecks, and inefficiencies.
Energy-Aware CPU Orchestration in O-RAN: A dApp-Driven Lightweight Approach	Francisco Crespo, Javier Villegas, Carlos Baena, Eduardo Baena, Sergio Fortes, Raquel Barco	2025-08-01	下载	The transition toward softwarized Radio Access Networks (RANs), driven by the Open RAN (O-RAN) paradigm, enables flexible, vendor-neutral deployments through disaggregation and virtualization of base ...
DGEMM without FP64 Arithmetic - Using FP64 Emulation and FP8 Tensor Cores with Ozaki Scheme	Daichi Mukunoki	2025-08-01	下载	As the demand for AI computation rapidly increases, more hardware is being developed to efficiently perform the low-precision matrix multiplications required by such workloads.
Systematic Evaluation of Optimization Techniques for Long-Context Language Models	Ammar Ahmed, Sheng Di, Franck Cappello, Zirui Liu, Jingoo Han, Ali Anwar	2025-08-01	下载	Large language models (LLMs) excel across diverse natural language processing tasks but face resource demands and limited context windows. Although techniques like pruning, quantization, and token dro...