2025-05-23

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Titanus: Enabling KV Cache Pruning and Quantization On-the-Fly for LLM Acceleration	Peilin Chen, Xiaoxuan Yang	2025-05-23	下载	Large language models (LLMs) have gained great success in various domains. Existing systems cache Key and Value within the attention block to avoid redundant computations.
Leveraging Stochastic Depth Training for Adaptive Inference	Guilherme Korol, Antonio Carlos Schneider Beck, Jeronimo Castrillon	2025-05-23	下载	Dynamic DNN optimization techniques such as layer-skipping offer increased adaptability and efficiency gains but can lead to i) a larger memory footprint as in decision gates, ii) increased training c...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
CarbonFlex: Enabling Carbon-aware Provisioning and Scheduling for Cloud Clusters	Walid A. Hanafy, Li Wu, David Irwin, Prashant Shenoy	2025-05-23	下载	Accelerating computing demand, largely from AI applications, has led to concerns about its carbon footprint. Fortunately, a significant fraction of computing demand comes from batch jobs that are ofte...
A Comparative Review of Parallel Exact, Heuristic, Metaheuristic, and Hybrid Optimization Techniques for the Traveling Salesman Problem	Rabab Alkhalifa, Fatima Alkhomayes, Boushra Almazroua, Dana Alhaidan, Maryam Alothman, Jumana Almuhaidib	2025-05-23	下载	The Traveling Salesman Problem (TSP) is a well-known NP-hard combinatorial optimization problem with wide-ranging applications in logistics, routing, and intelligent systems.
DiFache: Efficient and Scalable Caching on Disaggregated Memory using Decentralized Coherence	Hanze Zhang, Kaiming Wang, Rong Chen, Xingda Wei, Haibo Chen	2025-05-23	下载	The disaggregated memory (DM) architecture offers high resource elasticity at the cost of data access performance. While caching frequently accessed data in compute nodes (CNs) reduces access overhead...
DAG-based Consensus with Asymmetric Trust [Extended Version]	Ignacio Amores-Sesar, Christian Cachin, Juan Villacis, Luca Zanolini	2025-05-23	下载	In protocols with asymmetric trust, each participant is free to make its own individual trust assumptions about others, captured by an asymmetric quorum system.
Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models	Xuchen Pan, Yanxi Chen, Yushuo Chen, Yuchang Sun, Daoyuan Chen, Wenhao Zhang, Yuexiang Xie, Yilun Huang, Yilei Zhang, Dawei Gao, Weijie Shi, Yaliang Li, Bolin Ding, Jingren Zhou	2025-05-23	下载	Trinity-RFT is a general-purpose, unified and easy-to-use framework designed for reinforcement fine-tuning (RFT) of large language models. It is built with a modular and decoupled design, consisting o...
DecLock: A Case of Decoupled Locking for Disaggregated Memory	Hanze Zhang, Ke Cheng, Rong Chen, Xingda Wei, Haibo Chen	2025-05-23	下载	This paper reveals that locking can significantly degrade the performance of applications on disaggregated memory (DM), sometimes by several orders of magnitude, due to contention on the NICs of memor...
H2:Towards Efficient Large-Scale LLM Training on Hyper-Heterogeneous Cluster over 1,000 Chips	Ding Tang, Jiecheng Zhou, Jiakai Hu, Shengwei Li, Huihuang Zheng, Zhilin Pei, Hui Wang, Xingcheng Zhang	2025-05-23	下载	Recent advancements in large language models (LLMs) necessitate extensive computational resources, prompting the use of diverse hardware accelerators from multiple vendors.

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
ALLSTaR: Automated LLM-Driven Scheduler Generation and Testing for Intent-Based RAN	Maxime Elkael, Michele Polese, Reshma Prasad, Stefano Maxenti, Tommaso Melodia	2025-05-23	下载	The evolution toward open, programmable O-RAN and AI-RAN 6G networks creates unprecedented opportunities for Intent-Based Networking (IBN) to dynamically optimize RAN[...].
Neutral-Hosts In The Shared Mid-Bands: Addressing Indoor Cellular Performance	Muhammad Iqbal Rochman, Joshua Roy Palathinkal, Vanlin Sathya, Mehmet Yavuz, Monisha Ghosh	2025-05-23	下载	The 3.55 - 3.7 GHz Citizens Broadband Radio Service (CBRS) band in the U.S., shared with incumbent Navy radars, is witnessing increasing deployments both indoors and outdoors using a shared, licensed ...
Evaluation of Indoor/Outdoor Sharing in the Unlicensed 6 GHz Band	Seda Dogan-Tusha, Armed Tusha, Muhammad Iqbal Rochman, Hossein Nasiri, Joshua Roy Palathinkal, Mike Atkins, Monisha Ghosh	2025-05-23	下载	Standard Power (SP) Wi-Fi 6E in the U.S. is just beginning to be deployed outdoors in the shared but unlicensed 6 GHz band under the control of an Automated Frequency Coordination (AFC) system to prot...
EtherBee: A Global Dataset of Ethereum Node Performance Measurements Coupled with Honeypot Interactions and Full Network Sessions	Scott Seidenberger, Anindya Maiti	2025-05-23	下载	We introduce EtherBee, a global dataset integrating detailed Ethereum node metrics, network traffic metadata, and honeypot interaction logs collected from ten geographically diverse vantage points ove...
Towards a Quantum-classical Augmented Network	Nitin Jha, Abhishek Parakh, Mahadevan Subramaniam	2025-05-23	下载	In the past decade, several small-scale quantum key distribution networks have been established. However, the deployment of large-scale quantum networks depends on the development of quantum repeaters...
Joint Encryption and Error Correction for Secure Quantum Communication	Nitin Jha, Abhishek Parakh, Mahadevan Subramaniam	2025-05-23	下载	Secure quantum networks are a bedrock requirement for developing a future quantum internet. However, quantum channels are susceptible to channel noise that introduce errors in the transmitted data.
WakeMod: A 6.9uW Wake-Up Radio Module with -72.6dBm Sensitivity for On-Demand IoT	Lukas Schulthess, Silvano Cortesi, Michele Magno	2025-05-23	下载	Large-scale Internet of Things (IoT) applications, such as asset tracking and remote sensing, demand multi-year battery lifetimes to minimize maintenance and operational costs.
User-UAV Association for Dynamic User in mmWave Communication for eMBB and URLLC	Siddhanta Parial, Sasthi C. Ghosh, Anil K. Ghosh	2025-05-23	下载	In unmanned aerial vehicle (UAV) assisted millimeter wave (mmWave) communication, appropriate user-UAV association is crucial for improving system performance.
Prospects and challenges of Bluetooth backscatters system	Jingyun Du	2025-05-23	下载	Bluetooth backscatter systems, as a crucial technology for low-power communication in the Internet of Things (IoT), have witnessed remarkable development in recent years.
Topology Partitioning-based Self-Organized Localization in Indoor WSNs with Unknown Obstacles	Ze Zhang, Qian Dong	2025-05-23	下载	Accurate indoor node localization is critical for practical Wireless Sensor Network (WSN) applications, as Global Positioning System (GPS) fails to provide reliable Line-of-Sight (LoS) conditions in m...
Direct Feature Access -- Scaling Network Traffic Feature Collection to Terabit Speed	Lukas Froschauer, Jonatan Langlet, Andreas Kassler	2025-05-23	下载	Real-time traffic monitoring is critical for network operators to ensure performance, security, and visibility, especially as encryption becomes the norm.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Evaluating the impact of the L3 cache size of AMD EPYC CPUs on the performance of CFD applications	Marcin Lawenda, Łukasz Szustak, László Környei, Flavio Cesar Cunha Galeazzo, Paweł Bratek	2025-05-23	下载	In this work, the authors focus on assessing the impact of the AMD EPYC processor architecture on the performance of CFD applications. Several generations of architectures were analyzed, such as Rome,...
\texttt{Range-Arithmetic}: Verifiable Deep Learning Inference on an Untrusted Party	Ali Rahimi, Babak H. Khalaj, Mohammad Ali Maddah-Ali	2025-05-23	下载	Verifiable computing (VC) has gained prominence in decentralized machine learning systems, where resource-intensive tasks like deep neural network (DNN) inference are offloaded to external participant...