Appearance
2024-10-20
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Automated Formal Verification of a Highly-Configurable Register Generator | Shuhang Zhang, Bryan Olmos, Basavaraj Naik | 2024-10-20 | 下载 | Registers in IP blocks of an SoC perform a variety of functions, most of which are essential to the SoC operation. The complexity of register implementation is relatively low when compared with other ... |
| LLC Intra-set Write Balancing | Keshav Krishna, Ayush Verma | 2024-10-20 | 下载 | The increasing use of Non-Volatile Memory (NVM) in computer architecture has brought about new challenges, one of which is the write endurance problem. |
| Fastrack: Fast IO for Secure ML using GPU TEEs | Yongqin Wang, Rachit Rajat, Jonghyun Lee, Tingting Tang, Murali Annavaram | 2024-10-20 | 下载 | As cloud-based ML expands, ensuring data security during training and inference is critical. GPU-based Trusted Execution Environments (TEEs) offer secure, high-performance solutions, with CPU TEEs man... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training | Jinda Jia, Cong Xie, Hanlin Lu, Daoce Wang, Hao Feng, Chengming Zhang, Baixi Sun, Haibin Lin, Zhi Zhang, Xin Liu, Dingwen Tao | 2024-10-20 | 下载 | Recent years have witnessed a clear trend towards language models with an ever-increasing number of parameters, as well as the growing training overhead and memory usage. |
| MIRA: A Method of Federated MultI-Task Learning for LaRge LAnguage Models | Ahmed Elbakary, Chaouki Ben Issaid, Tamer ElBatt, Karim Seddik, Mehdi Bennis | 2024-10-20 | 下载 | In this paper, we introduce a method for fine-tuning Large Language Models (LLMs), inspired by Multi-Task learning in a federated manner. Our approach leverages the structure of each client's model an... |
| A Bayesian Framework for Clustered Federated Learning | Peng Wu, Tales Imbiriba, Pau Closas | 2024-10-20 | 下载 | One of the main challenges of federated learning (FL) is handling non-independent and identically distributed (non-IID) client data, which may occur in practice due to unbalanced datasets and use of d... |
| Heuristic-based Dynamic Leiden Algorithm for Efficient Tracking of Communities on Evolving Graphs | Subhajit Sahu | 2024-10-20 | 下载 | Community detection, or clustering, identifies groups of nodes in a graph that are more densely connected to each other than to the rest of the network. |
| EPIC: Efficient Position-Independent Caching for Serving Large Language Models | Junhao Hu, Wenrui Huang, Weidong Wang, Haoyi Wang, Tiancheng Hu, Qin Zhang, Hao Feng, Xusheng Chen, Yizhou Shan, Tao Xie | 2024-10-20 | 下载 | Large Language Models (LLMs) show great capabilities in a wide range of applications, but serving them efficiently becomes increasingly challenging as requests (prompts) become more complex. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Slicing for AI: An Online Learning Framework for Network Slicing Supporting AI Services | Menna Helmy, Alaa Awad Abdellatif, Naram Mhaisen, Amr Mohamed, Aiman Erbad | 2024-10-20 | 下载 | The forthcoming 6G networks will embrace a new realm of AI-driven services that requires innovative network slicing strategies, namely slicing for AI, which involves the creation of customized network... |
| Wireless Link Quality Estimation Using LSTM Model | Yuki Kanto, Kohei Watabe | 2024-10-20 | 下载 | In recent years, various services have been provided through high-speed and high-capacity wireless networks on mobile communication devices, necessitating stable communication regardless of indoor or ... |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Reinforcement Learning for Dynamic Memory Allocation | Arisrei Lim, Abhiram Maddukuri | 2024-10-20 | 下载 | In recent years, reinforcement learning (RL) has gained popularity and has been applied to a wide range of tasks. One such popular domain where RL has been effective is resource management problems in... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Real-time Event Joining in Practice With Kafka and Flink | Srijan Saket, Vivek Chandela, Md. Danish Kalim | 2024-10-20 | 下载 | Historically, machine learning training pipelines have predominantly relied on batch training models, retraining models every few hours. However, industrial practitioners have proved that real-time tr... |
| EPIC: Efficient Position-Independent Caching for Serving Large Language Models | Junhao Hu, Wenrui Huang, Weidong Wang, Haoyi Wang, Tiancheng Hu, Qin Zhang, Hao Feng, Xusheng Chen, Yizhou Shan, Tao Xie | 2024-10-20 | 下载 | Large Language Models (LLMs) show great capabilities in a wide range of applications, but serving them efficiently becomes increasingly challenging as requests (prompts) become more complex. |