2024-10-20

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Automated Formal Verification of a Highly-Configurable Register Generator	Shuhang Zhang, Bryan Olmos, Basavaraj Naik	2024-10-20	下载	Registers in IP blocks of an SoC perform a variety of functions, most of which are essential to the SoC operation. The complexity of register implementation is relatively low when compared with other ...
LLC Intra-set Write Balancing	Keshav Krishna, Ayush Verma	2024-10-20	下载	The increasing use of Non-Volatile Memory (NVM) in computer architecture has brought about new challenges, one of which is the write endurance problem.
Fastrack: Fast IO for Secure ML using GPU TEEs	Yongqin Wang, Rachit Rajat, Jonghyun Lee, Tingting Tang, Murali Annavaram	2024-10-20	下载	As cloud-based ML expands, ensuring data security during training and inference is critical. GPU-based Trusted Execution Environments (TEEs) offer secure, high-performance solutions, with CPU TEEs man...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training	Jinda Jia, Cong Xie, Hanlin Lu, Daoce Wang, Hao Feng, Chengming Zhang, Baixi Sun, Haibin Lin, Zhi Zhang, Xin Liu, Dingwen Tao	2024-10-20	下载	Recent years have witnessed a clear trend towards language models with an ever-increasing number of parameters, as well as the growing training overhead and memory usage.
MIRA: A Method of Federated MultI-Task Learning for LaRge LAnguage Models	Ahmed Elbakary, Chaouki Ben Issaid, Tamer ElBatt, Karim Seddik, Mehdi Bennis	2024-10-20	下载	In this paper, we introduce a method for fine-tuning Large Language Models (LLMs), inspired by Multi-Task learning in a federated manner. Our approach leverages the structure of each client's model an...
A Bayesian Framework for Clustered Federated Learning	Peng Wu, Tales Imbiriba, Pau Closas	2024-10-20	下载	One of the main challenges of federated learning (FL) is handling non-independent and identically distributed (non-IID) client data, which may occur in practice due to unbalanced datasets and use of d...
Heuristic-based Dynamic Leiden Algorithm for Efficient Tracking of Communities on Evolving Graphs	Subhajit Sahu	2024-10-20	下载	Community detection, or clustering, identifies groups of nodes in a graph that are more densely connected to each other than to the rest of the network.
EPIC: Efficient Position-Independent Caching for Serving Large Language Models	Junhao Hu, Wenrui Huang, Weidong Wang, Haoyi Wang, Tiancheng Hu, Qin Zhang, Hao Feng, Xusheng Chen, Yizhou Shan, Tao Xie	2024-10-20	下载	Large Language Models (LLMs) show great capabilities in a wide range of applications, but serving them efficiently becomes increasingly challenging as requests (prompts) become more complex.

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Slicing for AI: An Online Learning Framework for Network Slicing Supporting AI Services	Menna Helmy, Alaa Awad Abdellatif, Naram Mhaisen, Amr Mohamed, Aiman Erbad	2024-10-20	下载	The forthcoming 6G networks will embrace a new realm of AI-driven services that requires innovative network slicing strategies, namely slicing for AI, which involves the creation of customized network...
Wireless Link Quality Estimation Using LSTM Model	Yuki Kanto, Kohei Watabe	2024-10-20	下载	In recent years, various services have been provided through high-speed and high-capacity wireless networks on mobile communication devices, necessitating stable communication regardless of indoor or ...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Reinforcement Learning for Dynamic Memory Allocation	Arisrei Lim, Abhiram Maddukuri	2024-10-20	下载	In recent years, reinforcement learning (RL) has gained popularity and has been applied to a wide range of tasks. One such popular domain where RL has been effective is resource management problems in...

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Real-time Event Joining in Practice With Kafka and Flink	Srijan Saket, Vivek Chandela, Md. Danish Kalim	2024-10-20	下载	Historically, machine learning training pipelines have predominantly relied on batch training models, retraining models every few hours. However, industrial practitioners have proved that real-time tr...
EPIC: Efficient Position-Independent Caching for Serving Large Language Models	Junhao Hu, Wenrui Huang, Weidong Wang, Haoyi Wang, Tiancheng Hu, Qin Zhang, Hao Feng, Xusheng Chen, Yizhou Shan, Tao Xie	2024-10-20	下载	Large Language Models (LLMs) show great capabilities in a wide range of applications, but serving them efficiently becomes increasingly challenging as requests (prompts) become more complex.