Skip to content

2025-05-23

cs.AR - Architecture

标题作者发布日期PDF摘要
Titanus: Enabling KV Cache Pruning and Quantization On-the-Fly for LLM AccelerationPeilin Chen, Xiaoxuan Yang2025-05-23下载Large language models (LLMs) have gained great success in various domains. Existing systems cache Key and Value within the attention block to avoid redundant computations.
Leveraging Stochastic Depth Training for Adaptive InferenceGuilherme Korol, Antonio Carlos Schneider Beck, Jeronimo Castrillon2025-05-23下载Dynamic DNN optimization techniques such as layer-skipping offer increased adaptability and efficiency gains but can lead to i) a larger memory footprint as in decision gates, ii) increased training c...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
CarbonFlex: Enabling Carbon-aware Provisioning and Scheduling for Cloud ClustersWalid A. Hanafy, Li Wu, David Irwin, Prashant Shenoy2025-05-23下载Accelerating computing demand, largely from AI applications, has led to concerns about its carbon footprint. Fortunately, a significant fraction of computing demand comes from batch jobs that are ofte...
A Comparative Review of Parallel Exact, Heuristic, Metaheuristic, and Hybrid Optimization Techniques for the Traveling Salesman ProblemRabab Alkhalifa, Fatima Alkhomayes, Boushra Almazroua, Dana Alhaidan, Maryam Alothman, Jumana Almuhaidib2025-05-23下载The Traveling Salesman Problem (TSP) is a well-known NP-hard combinatorial optimization problem with wide-ranging applications in logistics, routing, and intelligent systems.
DiFache: Efficient and Scalable Caching on Disaggregated Memory using Decentralized CoherenceHanze Zhang, Kaiming Wang, Rong Chen, Xingda Wei, Haibo Chen2025-05-23下载The disaggregated memory (DM) architecture offers high resource elasticity at the cost of data access performance. While caching frequently accessed data in compute nodes (CNs) reduces access overhead...
DAG-based Consensus with Asymmetric Trust [Extended Version]Ignacio Amores-Sesar, Christian Cachin, Juan Villacis, Luca Zanolini2025-05-23下载In protocols with asymmetric trust, each participant is free to make its own individual trust assumptions about others, captured by an asymmetric quorum system.
Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language ModelsXuchen Pan, Yanxi Chen, Yushuo Chen, Yuchang Sun, Daoyuan Chen, Wenhao Zhang, Yuexiang Xie, Yilun Huang, Yilei Zhang, Dawei Gao, Weijie Shi, Yaliang Li, Bolin Ding, Jingren Zhou2025-05-23下载Trinity-RFT is a general-purpose, unified and easy-to-use framework designed for reinforcement fine-tuning (RFT) of large language models. It is built with a modular and decoupled design, consisting o...
DecLock: A Case of Decoupled Locking for Disaggregated MemoryHanze Zhang, Ke Cheng, Rong Chen, Xingda Wei, Haibo Chen2025-05-23下载This paper reveals that locking can significantly degrade the performance of applications on disaggregated memory (DM), sometimes by several orders of magnitude, due to contention on the NICs of memor...
H2:Towards Efficient Large-Scale LLM Training on Hyper-Heterogeneous Cluster over 1,000 ChipsDing Tang, Jiecheng Zhou, Jiakai Hu, Shengwei Li, Huihuang Zheng, Zhilin Pei, Hui Wang, Xingcheng Zhang2025-05-23下载Recent advancements in large language models (LLMs) necessitate extensive computational resources, prompting the use of diverse hardware accelerators from multiple vendors.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
ALLSTaR: Automated LLM-Driven Scheduler Generation and Testing for Intent-Based RANMaxime Elkael, Michele Polese, Reshma Prasad, Stefano Maxenti, Tommaso Melodia2025-05-23下载The evolution toward open, programmable O-RAN and AI-RAN 6G networks creates unprecedented opportunities for Intent-Based Networking (IBN) to dynamically optimize RAN[...].
Neutral-Hosts In The Shared Mid-Bands: Addressing Indoor Cellular PerformanceMuhammad Iqbal Rochman, Joshua Roy Palathinkal, Vanlin Sathya, Mehmet Yavuz, Monisha Ghosh2025-05-23下载The 3.55 - 3.7 GHz Citizens Broadband Radio Service (CBRS) band in the U.S., shared with incumbent Navy radars, is witnessing increasing deployments both indoors and outdoors using a shared, licensed ...
Evaluation of Indoor/Outdoor Sharing in the Unlicensed 6 GHz BandSeda Dogan-Tusha, Armed Tusha, Muhammad Iqbal Rochman, Hossein Nasiri, Joshua Roy Palathinkal, Mike Atkins, Monisha Ghosh2025-05-23下载Standard Power (SP) Wi-Fi 6E in the U.S. is just beginning to be deployed outdoors in the shared but unlicensed 6 GHz band under the control of an Automated Frequency Coordination (AFC) system to prot...
EtherBee: A Global Dataset of Ethereum Node Performance Measurements Coupled with Honeypot Interactions and Full Network SessionsScott Seidenberger, Anindya Maiti2025-05-23下载We introduce EtherBee, a global dataset integrating detailed Ethereum node metrics, network traffic metadata, and honeypot interaction logs collected from ten geographically diverse vantage points ove...
Towards a Quantum-classical Augmented NetworkNitin Jha, Abhishek Parakh, Mahadevan Subramaniam2025-05-23下载In the past decade, several small-scale quantum key distribution networks have been established. However, the deployment of large-scale quantum networks depends on the development of quantum repeaters...
Joint Encryption and Error Correction for Secure Quantum CommunicationNitin Jha, Abhishek Parakh, Mahadevan Subramaniam2025-05-23下载Secure quantum networks are a bedrock requirement for developing a future quantum internet. However, quantum channels are susceptible to channel noise that introduce errors in the transmitted data.
WakeMod: A 6.9uW Wake-Up Radio Module with -72.6dBm Sensitivity for On-Demand IoTLukas Schulthess, Silvano Cortesi, Michele Magno2025-05-23下载Large-scale Internet of Things (IoT) applications, such as asset tracking and remote sensing, demand multi-year battery lifetimes to minimize maintenance and operational costs.
User-UAV Association for Dynamic User in mmWave Communication for eMBB and URLLCSiddhanta Parial, Sasthi C. Ghosh, Anil K. Ghosh2025-05-23下载In unmanned aerial vehicle (UAV) assisted millimeter wave (mmWave) communication, appropriate user-UAV association is crucial for improving system performance.
Prospects and challenges of Bluetooth backscatters systemJingyun Du2025-05-23下载Bluetooth backscatter systems, as a crucial technology for low-power communication in the Internet of Things (IoT), have witnessed remarkable development in recent years.
Topology Partitioning-based Self-Organized Localization in Indoor WSNs with Unknown ObstaclesZe Zhang, Qian Dong2025-05-23下载Accurate indoor node localization is critical for practical Wireless Sensor Network (WSN) applications, as Global Positioning System (GPS) fails to provide reliable Line-of-Sight (LoS) conditions in m...
Direct Feature Access -- Scaling Network Traffic Feature Collection to Terabit SpeedLukas Froschauer, Jonatan Langlet, Andreas Kassler2025-05-23下载Real-time traffic monitoring is critical for network operators to ensure performance, security, and visibility, especially as encryption becomes the norm.

cs.PF - Performance

标题作者发布日期PDF摘要
Evaluating the impact of the L3 cache size of AMD EPYC CPUs on the performance of CFD applicationsMarcin Lawenda, Łukasz Szustak, László Környei, Flavio Cesar Cunha Galeazzo, Paweł Bratek2025-05-23下载In this work, the authors focus on assessing the impact of the AMD EPYC processor architecture on the performance of CFD applications. Several generations of architectures were analyzed, such as Rome,...
\texttt{Range-Arithmetic}: Verifiable Deep Learning Inference on an Untrusted PartyAli Rahimi, Babak H. Khalaj, Mohammad Ali Maddah-Ali2025-05-23下载Verifiable computing (VC) has gained prominence in decentralized machine learning systems, where resource-intensive tasks like deep neural network (DNN) inference are offloaded to external participant...

基于 VitePress 构建