Skip to content

2025-08-01

cs.AR - Architecture

标题作者发布日期PDF摘要
E2ATST: A Temporal-Spatial Optimized Energy-Efficient Architecture for Training Spiking TransformerYunhao Ma, Yanyu Lin, Mingjing Li, Puli Quan, Chenlin Zhou, Wenyue Zhang, Zhiwei Zhong, Wanyi Jia, Xueke Zhu, Qingyan Meng, Huihui Zhou, Fengwei An2025-08-01下载(1) Pengcheng Laboratory, (2) Southern University of Science and Technology, (3) Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, (4) University of Chinese Academy of Sciences
DGEMM without FP64 Arithmetic - Using FP64 Emulation and FP8 Tensor Cores with Ozaki SchemeDaichi Mukunoki2025-08-01下载As the demand for AI computation rapidly increases, more hardware is being developed to efficiently perform the low-precision matrix multiplications required by such workloads.
Reimagining Voltage-Controlled Cryogenic Boolean Logic Paradigm with Quantum-Enhanced Josephson Junction FETsMd Mazharul Islam, Diego Ferrer, Shamiul Alam, Juan P. Mendez, Denis Mamaluy, Wei Pan, Ahmedullah Aziz2025-08-01下载The growing demand for ultra low power computing and the emergence of quantum technologies have intensified interest in cryogenic electronics, particularly superconducting devices.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
AdVAR-DNN: Adversarial Misclassification Attack on Collaborative DNN InferenceShima Yousefi, Motahare Mounesan, Saptarshi Debroy2025-08-01下载In recent years, Deep Neural Networks (DNNs) have become increasingly integral to IoT-based environments, enabling realtime visual computing. However, the limited computational capacity of these devic...
Optimal Scheduling Algorithms for LLM Inference: Theory and PracticeAgrim Bari, Parikshit Hegde, Gustavo de Veciana2025-08-01下载With the growing use of Large Language Model (LLM)-based tools like ChatGPT, Perplexity, and Gemini across industries, there is a rising need for efficient LLM inference systems.
Adacc: An Adaptive Framework Unifying Compression and Activation Recomputation for LLM TrainingPing Chen, Zhuohong Deng, Ping Li, Shuibing He, Hongzi Zhu, Yi Zheng, Zhefeng Wang, Baoxing Huai, Minyi Guo2025-08-01下载Training large language models (LLMs) is often constrained by GPU memory limitations. To alleviate memory pressure, activation recomputation and data compression have been proposed as two major strate...
FedGuard: A Diverse-Byzantine-Robust Mechanism for Federated Learning with Major Malicious ClientsHaocheng Jiang, Hua Shen, Jixin Zhang, Willy Susilo, Mingwu Zhang2025-08-01下载Federated learning is a distributed training framework vulnerable to Byzantine attacks, particularly when over 50% of clients are malicious or when datasets are highly non-independent and identically ...
SwarmRaft: Leveraging Consensus for Robust Drone Swarm Coordination in GNSS-Degraded EnvironmentsKapel Dev, Yash Madhwal, Sofia Shevelo, Pavel Osinenko, Yury Yanovich2025-08-01下载Unmanned aerial vehicle (UAV) swarms are increasingly used in critical applications such as aerial mapping, environmental monitoring, and autonomous delivery.
A Parallel Alternative for Energy-Efficient Neural Network Training and InferencingSudip K. Seal, Maksudul Alam, Jorge Ramirez, Sajal Dash, Hao Lu2025-08-01下载Energy efficiency of training and inferencing with large neural network models is a critical challenge facing the future of sustainable large-scale machine learning workloads.
Information-Theoretic Decentralized Secure Aggregation with Passive Collusion ResilienceXiang Zhang, Zhou Li, Shuangyang Li, Kai Wan, Derrick Wing Kwan Ng, Giuseppe Caire2025-08-01下载In decentralized federated learning (FL), multiple clients collaboratively learn a shared machine learning (ML) model by leveraging their privately held datasets distributed across the network, throug...
Tetris: Efficient Intra-Datacenter Calls Packing for Large Conferencing ServicesRohan Gandhi, Ankur Mallick, Ken Sueda, Rui Liang2025-08-01下载Conference services like Zoom, Microsoft Teams, and Google Meet facilitate millions of daily calls, yet ensuring high performance at low costs remains a significant challenge.
Integrated user scheduling and beam steering in over-the-air federated learning for mobile IoTShengheng Liu, Ningning Fu, Zhonghao Zhang, Yongming Huang, Tony Q. S. Quek2025-08-01下载The rising popularity of Internet of things (IoT) has spurred technological advancements in mobile internet and interconnected systems. While offering flexible connectivity and intelligent application...
Quality-of-Service Aware LLM Routing for Edge Computing with Multiple ExpertsJin Yang, Qiong Wu, Zhiying Feng, Zhi Zhou, Deke Guo, Xu Chen2025-08-01下载Large Language Models (LLMs) have demonstrated remarkable capabilities, leading to a significant increase in user demand for LLM services. However, cloud-based LLM services often suffer from high late...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Connectivity Management in Satellite-Aided Vehicular Networks with Multi-Head Attention-Based State EstimationIbrahim Althamary, Chen-Fu Chou, Chih-Wei Huang2025-08-01下载Managing connectivity in integrated satellite-terrestrial vehicular networks is critical for 6G, yet is challenged by dynamic conditions and partial observability.
A Deep Reinforcement Learning-Based TCP Congestion Control Algorithm: Design, Simulation, and EvaluationEfe Ağlamazlar, Emirhan Eken, Harun Batur Geçici2025-08-01下载This paper introduces a Deep Reinforcement Learning (DRL) based TCP congestion-control algorithm that uses a Deep Q-Network (DQN) to adapt the congestion window (cWnd) dynamically based on observed ne...
Data Movement Manager (DMM) for the SENSE-Rucio Interoperation PrototypeAashay Arora, Diego Davila, Jonathan Guiang, Frank Würthwein, Harvey Newman, Justas Balcas, Tom Lehman, Xi Yang2025-08-01下载The Data Movement Manager (DMM) is a prototype interface that connects CERN's data management software, Rucio, with the Sofware-Defined Networking (SDN) service SENSE by ESNet.
Overlapping IPv4, IPv6, and TCP data: exploring errors, test case context and multiple overlaps inside network stacks and NIDSes with PYROLYSELucas Aubard, Johan Mazel, Gilles Guette, Pierre Chifflier2025-08-01下载IP fragmentation and TCP segmentation allow for splitting large data packets into smaller ones, e.g., for transmission across network links of limited capacity.
Deep Joint Source-Channel Coding for Small Satellite ApplicationsOlga Kondrateva, Grace Li Zhang, Julian Zobel, Björn Scheuermann, Stefan Dietzel2025-08-01下载Small satellites used for Earth observation generate vast amounts of high-dimensional data, but their operation in low Earth orbit creates a significant communication bottleneck due to limited contact...
Criticality-Based Dynamic Topology Optimization for Enhancing Aerial-Marine Swarm ResilienceRuiyang Huang, Haocheng Wang, Yixuan Shen, Ning Gao, Qiang Ni, Shi Jin, Yifan Wu2025-08-01下载Heterogeneous marine-aerial swarm networks encounter substantial difficulties due to targeted communication disruptions and structural weaknesses in adversarial environments.
Energy-Aware CPU Orchestration in O-RAN: A dApp-Driven Lightweight ApproachFrancisco Crespo, Javier Villegas, Carlos Baena, Eduardo Baena, Sergio Fortes, Raquel Barco2025-08-01下载The transition toward softwarized Radio Access Networks (RANs), driven by the Open RAN (O-RAN) paradigm, enables flexible, vendor-neutral deployments through disaggregation and virtualization of base ...
Joint Association and Phase Shifts Design for UAV-mounted Stacked Intelligent Metasurfaces-assisted CommunicationsMingzhe Fan, Geng Sun, Hongyang Pan, Jiacheng Wang, Jiancheng An, Hongyang Du, Chau Yuen2025-08-01下载Stacked intelligent metasurfaces (SIMs) have emerged as a promising technology for realizing wave-domain signal processing, while the fixed SIMs will limit the communication performance of the system ...
Enhancing Wireless Networks for IoT with Large Vision Models: Foundations and ApplicationsYunting Xu, Jiacheng Wang, Ruichen Zhang, Dusit Niyato, Deepu Rajan, Liang Yu, Haibo Zhou, Abbas Jamalipour, Xianbin Wang2025-08-01下载Large vision models (LVMs) have emerged as a foundational paradigm in visual intelligence, achieving state-of-the-art performance across diverse visual tasks.
Mamba for Wireless Communications and Networking: Principles and OpportunitiesRongsheng Zhang, Ruichen Zhang, Yang Lu, Wei Chen, Bo Ai, Dusit Niyato2025-08-01下载Mamba has emerged as a powerful model for efficiently addressing tasks involving temporal and spatial data. Regarding the escalating heterogeneity and dynamics in wireless networks, Mamba holds the po...
Multi-grained spatial-temporal feature complementarity for accurate online cellular traffic predictionNingning Fu, Shengheng Liu, Weiliang Xie, Yongming Huang2025-08-01下载Knowledge discovered from telecom data can facilitate proactive understanding of network dynamics and user behaviors, which in turn empowers service providers to optimize cellular traffic scheduling a...
Energy Efficient Trajectory Control and Resource Allocation in Multi-UAV-assisted MEC via Deep Reinforcement LearningSaichao Liu, Geng Sun, Chuang Zhang, Xuejie Liu, Jiacheng Wang, Changyuan Zhao, Dusit Niyato2025-08-01下载Mobile edge computing (MEC) is a promising technique to improve the computational capacity of smart devices (SDs) in Internet of Things (IoT).
Large AI Model-Enabled Secure Communications in Low-Altitude Wireless Networks: Concepts, Perspectives and Case StudyChuang Zhang, Geng Sun, Yijing Lin, Weijie Yuan, Sinem Coleri, Dusit Niyato2025-08-01下载Low-altitude wireless networks (LAWNs) have the potential to revolutionize communications by supporting a range of applications, including urban parcel delivery, aerial inspections and air taxis.
Quality-of-Service Aware LLM Routing for Edge Computing with Multiple ExpertsJin Yang, Qiong Wu, Zhiying Feng, Zhi Zhou, Deke Guo, Xu Chen2025-08-01下载Large Language Models (LLMs) have demonstrated remarkable capabilities, leading to a significant increase in user demand for LLM services. However, cloud-based LLM services often suffer from high late...
Benchmarking XRootD-HTTPS on 400Gbps Links with Variable LatenciesAashay Arora, Diego Davila, Frank Würthwein, John Graham, Dima Mishin, Justas Balcas, Tom Lehman, Xi Yang, Chin Guok, Harvey Newman2025-08-01下载In anticipation of the High Luminosity-LHC era, there is a critical need to oversee software readiness for upcoming growth in network traffic for production and user data analysis access.

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Energy-Aware CPU Orchestration in O-RAN: A dApp-Driven Lightweight ApproachFrancisco Crespo, Javier Villegas, Carlos Baena, Eduardo Baena, Sergio Fortes, Raquel Barco2025-08-01下载The transition toward softwarized Radio Access Networks (RANs), driven by the Open RAN (O-RAN) paradigm, enables flexible, vendor-neutral deployments through disaggregation and virtualization of base ...
Composable OS Kernel Architectures for Autonomous IntelligenceRajpreet Singh, Vidhi Kothari2025-08-01下载As intelligent systems permeate edge devices, cloud infrastructure, and embedded real-time environments, this research proposes a new OS kernel architecture for intelligent systems, transforming kerne...

cs.PF - Performance

标题作者发布日期PDF摘要
Efficient Solving of Large Single Input Superstate Decomposable Markovian Decision ProcessYoussef Ait El Mahjoub, Jean-Michel Fourneau, Salma Alouah2025-08-01下载Solving Markov Decision Processes (MDPs) remains a central challenge in sequential decision-making, especially when dealing with large state spaces and long-term optimization criteria.
Interpreting Performance Profiles with Deep LearningZhuoran Liu2025-08-01下载Profiling tools (also known as profilers) play an important role in understanding program performance at runtime, such as hotspots, bottlenecks, and inefficiencies.
Energy-Aware CPU Orchestration in O-RAN: A dApp-Driven Lightweight ApproachFrancisco Crespo, Javier Villegas, Carlos Baena, Eduardo Baena, Sergio Fortes, Raquel Barco2025-08-01下载The transition toward softwarized Radio Access Networks (RANs), driven by the Open RAN (O-RAN) paradigm, enables flexible, vendor-neutral deployments through disaggregation and virtualization of base ...
DGEMM without FP64 Arithmetic - Using FP64 Emulation and FP8 Tensor Cores with Ozaki SchemeDaichi Mukunoki2025-08-01下载As the demand for AI computation rapidly increases, more hardware is being developed to efficiently perform the low-precision matrix multiplications required by such workloads.
Systematic Evaluation of Optimization Techniques for Long-Context Language ModelsAmmar Ahmed, Sheng Di, Franck Cappello, Zirui Liu, Jingoo Han, Ali Anwar2025-08-01下载Large language models (LLMs) excel across diverse natural language processing tasks but face resource demands and limited context windows. Although techniques like pruning, quantization, and token dro...

基于 VitePress 构建