Appearance
2025-05-23
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Titanus: Enabling KV Cache Pruning and Quantization On-the-Fly for LLM Acceleration | Peilin Chen, Xiaoxuan Yang | 2025-05-23 | 下载 | Large language models (LLMs) have gained great success in various domains. Existing systems cache Key and Value within the attention block to avoid redundant computations. |
| Leveraging Stochastic Depth Training for Adaptive Inference | Guilherme Korol, Antonio Carlos Schneider Beck, Jeronimo Castrillon | 2025-05-23 | 下载 | Dynamic DNN optimization techniques such as layer-skipping offer increased adaptability and efficiency gains but can lead to i) a larger memory footprint as in decision gates, ii) increased training c... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| CarbonFlex: Enabling Carbon-aware Provisioning and Scheduling for Cloud Clusters | Walid A. Hanafy, Li Wu, David Irwin, Prashant Shenoy | 2025-05-23 | 下载 | Accelerating computing demand, largely from AI applications, has led to concerns about its carbon footprint. Fortunately, a significant fraction of computing demand comes from batch jobs that are ofte... |
| A Comparative Review of Parallel Exact, Heuristic, Metaheuristic, and Hybrid Optimization Techniques for the Traveling Salesman Problem | Rabab Alkhalifa, Fatima Alkhomayes, Boushra Almazroua, Dana Alhaidan, Maryam Alothman, Jumana Almuhaidib | 2025-05-23 | 下载 | The Traveling Salesman Problem (TSP) is a well-known NP-hard combinatorial optimization problem with wide-ranging applications in logistics, routing, and intelligent systems. |
| DiFache: Efficient and Scalable Caching on Disaggregated Memory using Decentralized Coherence | Hanze Zhang, Kaiming Wang, Rong Chen, Xingda Wei, Haibo Chen | 2025-05-23 | 下载 | The disaggregated memory (DM) architecture offers high resource elasticity at the cost of data access performance. While caching frequently accessed data in compute nodes (CNs) reduces access overhead... |
| DAG-based Consensus with Asymmetric Trust [Extended Version] | Ignacio Amores-Sesar, Christian Cachin, Juan Villacis, Luca Zanolini | 2025-05-23 | 下载 | In protocols with asymmetric trust, each participant is free to make its own individual trust assumptions about others, captured by an asymmetric quorum system. |
| Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models | Xuchen Pan, Yanxi Chen, Yushuo Chen, Yuchang Sun, Daoyuan Chen, Wenhao Zhang, Yuexiang Xie, Yilun Huang, Yilei Zhang, Dawei Gao, Weijie Shi, Yaliang Li, Bolin Ding, Jingren Zhou | 2025-05-23 | 下载 | Trinity-RFT is a general-purpose, unified and easy-to-use framework designed for reinforcement fine-tuning (RFT) of large language models. It is built with a modular and decoupled design, consisting o... |
| DecLock: A Case of Decoupled Locking for Disaggregated Memory | Hanze Zhang, Ke Cheng, Rong Chen, Xingda Wei, Haibo Chen | 2025-05-23 | 下载 | This paper reveals that locking can significantly degrade the performance of applications on disaggregated memory (DM), sometimes by several orders of magnitude, due to contention on the NICs of memor... |
| H2:Towards Efficient Large-Scale LLM Training on Hyper-Heterogeneous Cluster over 1,000 Chips | Ding Tang, Jiecheng Zhou, Jiakai Hu, Shengwei Li, Huihuang Zheng, Zhilin Pei, Hui Wang, Xingcheng Zhang | 2025-05-23 | 下载 | Recent advancements in large language models (LLMs) necessitate extensive computational resources, prompting the use of diverse hardware accelerators from multiple vendors. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| ALLSTaR: Automated LLM-Driven Scheduler Generation and Testing for Intent-Based RAN | Maxime Elkael, Michele Polese, Reshma Prasad, Stefano Maxenti, Tommaso Melodia | 2025-05-23 | 下载 | The evolution toward open, programmable O-RAN and AI-RAN 6G networks creates unprecedented opportunities for Intent-Based Networking (IBN) to dynamically optimize RAN[...]. |
| Neutral-Hosts In The Shared Mid-Bands: Addressing Indoor Cellular Performance | Muhammad Iqbal Rochman, Joshua Roy Palathinkal, Vanlin Sathya, Mehmet Yavuz, Monisha Ghosh | 2025-05-23 | 下载 | The 3.55 - 3.7 GHz Citizens Broadband Radio Service (CBRS) band in the U.S., shared with incumbent Navy radars, is witnessing increasing deployments both indoors and outdoors using a shared, licensed ... |
| Evaluation of Indoor/Outdoor Sharing in the Unlicensed 6 GHz Band | Seda Dogan-Tusha, Armed Tusha, Muhammad Iqbal Rochman, Hossein Nasiri, Joshua Roy Palathinkal, Mike Atkins, Monisha Ghosh | 2025-05-23 | 下载 | Standard Power (SP) Wi-Fi 6E in the U.S. is just beginning to be deployed outdoors in the shared but unlicensed 6 GHz band under the control of an Automated Frequency Coordination (AFC) system to prot... |
| EtherBee: A Global Dataset of Ethereum Node Performance Measurements Coupled with Honeypot Interactions and Full Network Sessions | Scott Seidenberger, Anindya Maiti | 2025-05-23 | 下载 | We introduce EtherBee, a global dataset integrating detailed Ethereum node metrics, network traffic metadata, and honeypot interaction logs collected from ten geographically diverse vantage points ove... |
| Towards a Quantum-classical Augmented Network | Nitin Jha, Abhishek Parakh, Mahadevan Subramaniam | 2025-05-23 | 下载 | In the past decade, several small-scale quantum key distribution networks have been established. However, the deployment of large-scale quantum networks depends on the development of quantum repeaters... |
| Joint Encryption and Error Correction for Secure Quantum Communication | Nitin Jha, Abhishek Parakh, Mahadevan Subramaniam | 2025-05-23 | 下载 | Secure quantum networks are a bedrock requirement for developing a future quantum internet. However, quantum channels are susceptible to channel noise that introduce errors in the transmitted data. |
| WakeMod: A 6.9uW Wake-Up Radio Module with -72.6dBm Sensitivity for On-Demand IoT | Lukas Schulthess, Silvano Cortesi, Michele Magno | 2025-05-23 | 下载 | Large-scale Internet of Things (IoT) applications, such as asset tracking and remote sensing, demand multi-year battery lifetimes to minimize maintenance and operational costs. |
| User-UAV Association for Dynamic User in mmWave Communication for eMBB and URLLC | Siddhanta Parial, Sasthi C. Ghosh, Anil K. Ghosh | 2025-05-23 | 下载 | In unmanned aerial vehicle (UAV) assisted millimeter wave (mmWave) communication, appropriate user-UAV association is crucial for improving system performance. |
| Prospects and challenges of Bluetooth backscatters system | Jingyun Du | 2025-05-23 | 下载 | Bluetooth backscatter systems, as a crucial technology for low-power communication in the Internet of Things (IoT), have witnessed remarkable development in recent years. |
| Topology Partitioning-based Self-Organized Localization in Indoor WSNs with Unknown Obstacles | Ze Zhang, Qian Dong | 2025-05-23 | 下载 | Accurate indoor node localization is critical for practical Wireless Sensor Network (WSN) applications, as Global Positioning System (GPS) fails to provide reliable Line-of-Sight (LoS) conditions in m... |
| Direct Feature Access -- Scaling Network Traffic Feature Collection to Terabit Speed | Lukas Froschauer, Jonatan Langlet, Andreas Kassler | 2025-05-23 | 下载 | Real-time traffic monitoring is critical for network operators to ensure performance, security, and visibility, especially as encryption becomes the norm. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Evaluating the impact of the L3 cache size of AMD EPYC CPUs on the performance of CFD applications | Marcin Lawenda, Łukasz Szustak, László Környei, Flavio Cesar Cunha Galeazzo, Paweł Bratek | 2025-05-23 | 下载 | In this work, the authors focus on assessing the impact of the AMD EPYC processor architecture on the performance of CFD applications. Several generations of architectures were analyzed, such as Rome,... |
| \texttt{Range-Arithmetic}: Verifiable Deep Learning Inference on an Untrusted Party | Ali Rahimi, Babak H. Khalaj, Mohammad Ali Maddah-Ali | 2025-05-23 | 下载 | Verifiable computing (VC) has gained prominence in decentralized machine learning systems, where resource-intensive tasks like deep neural network (DNN) inference are offloaded to external participant... |