Skip to content

2024-10-20

cs.AR - Architecture

标题作者发布日期PDF摘要
Automated Formal Verification of a Highly-Configurable Register GeneratorShuhang Zhang, Bryan Olmos, Basavaraj Naik2024-10-20下载Registers in IP blocks of an SoC perform a variety of functions, most of which are essential to the SoC operation. The complexity of register implementation is relatively low when compared with other ...
LLC Intra-set Write BalancingKeshav Krishna, Ayush Verma2024-10-20下载The increasing use of Non-Volatile Memory (NVM) in computer architecture has brought about new challenges, one of which is the write endurance problem.
Fastrack: Fast IO for Secure ML using GPU TEEsYongqin Wang, Rachit Rajat, Jonghyun Lee, Tingting Tang, Murali Annavaram2024-10-20下载As cloud-based ML expands, ensuring data security during training and inference is critical. GPU-based Trusted Execution Environments (TEEs) offer secure, high-performance solutions, with CPU TEEs man...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM TrainingJinda Jia, Cong Xie, Hanlin Lu, Daoce Wang, Hao Feng, Chengming Zhang, Baixi Sun, Haibin Lin, Zhi Zhang, Xin Liu, Dingwen Tao2024-10-20下载Recent years have witnessed a clear trend towards language models with an ever-increasing number of parameters, as well as the growing training overhead and memory usage.
MIRA: A Method of Federated MultI-Task Learning for LaRge LAnguage ModelsAhmed Elbakary, Chaouki Ben Issaid, Tamer ElBatt, Karim Seddik, Mehdi Bennis2024-10-20下载In this paper, we introduce a method for fine-tuning Large Language Models (LLMs), inspired by Multi-Task learning in a federated manner. Our approach leverages the structure of each client's model an...
A Bayesian Framework for Clustered Federated LearningPeng Wu, Tales Imbiriba, Pau Closas2024-10-20下载One of the main challenges of federated learning (FL) is handling non-independent and identically distributed (non-IID) client data, which may occur in practice due to unbalanced datasets and use of d...
Heuristic-based Dynamic Leiden Algorithm for Efficient Tracking of Communities on Evolving GraphsSubhajit Sahu2024-10-20下载Community detection, or clustering, identifies groups of nodes in a graph that are more densely connected to each other than to the rest of the network.
EPIC: Efficient Position-Independent Caching for Serving Large Language ModelsJunhao Hu, Wenrui Huang, Weidong Wang, Haoyi Wang, Tiancheng Hu, Qin Zhang, Hao Feng, Xusheng Chen, Yizhou Shan, Tao Xie2024-10-20下载Large Language Models (LLMs) show great capabilities in a wide range of applications, but serving them efficiently becomes increasingly challenging as requests (prompts) become more complex.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Slicing for AI: An Online Learning Framework for Network Slicing Supporting AI ServicesMenna Helmy, Alaa Awad Abdellatif, Naram Mhaisen, Amr Mohamed, Aiman Erbad2024-10-20下载The forthcoming 6G networks will embrace a new realm of AI-driven services that requires innovative network slicing strategies, namely slicing for AI, which involves the creation of customized network...
Wireless Link Quality Estimation Using LSTM ModelYuki Kanto, Kohei Watabe2024-10-20下载In recent years, various services have been provided through high-speed and high-capacity wireless networks on mobile communication devices, necessitating stable communication regardless of indoor or ...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Reinforcement Learning for Dynamic Memory AllocationArisrei Lim, Abhiram Maddukuri2024-10-20下载In recent years, reinforcement learning (RL) has gained popularity and has been applied to a wide range of tasks. One such popular domain where RL has been effective is resource management problems in...

cs.PF - Performance

标题作者发布日期PDF摘要
Real-time Event Joining in Practice With Kafka and FlinkSrijan Saket, Vivek Chandela, Md. Danish Kalim2024-10-20下载Historically, machine learning training pipelines have predominantly relied on batch training models, retraining models every few hours. However, industrial practitioners have proved that real-time tr...
EPIC: Efficient Position-Independent Caching for Serving Large Language ModelsJunhao Hu, Wenrui Huang, Weidong Wang, Haoyi Wang, Tiancheng Hu, Qin Zhang, Hao Feng, Xusheng Chen, Yizhou Shan, Tao Xie2024-10-20下载Large Language Models (LLMs) show great capabilities in a wide range of applications, but serving them efficiently becomes increasingly challenging as requests (prompts) become more complex.

基于 VitePress 构建