Appearance
2024-01-20
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| SRAM Alpha-SER Estimation From Word-Line Voltage Margin Measurements: Design Architecture and Experimental Results | Gabriel Torrens, Ivan de Paul, Bartomeu Alorda, Sebastia Bota, Jaume Segura | 2024-01-20 | 下载 | Experimental results from a 65 nm CMOS commercial technology SRAM test chip reveal a linear correlation between a new electrical parameter -- the word-line voltage margin (VWLVM) -- and the measured c... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Are Your Epochs Too Epic? Batch Free Can Be Harmful | Daewoo Kim, Trevor Brown, Ajay Singh | 2024-01-20 | 下载 | Epoch based memory reclamation (EBR) is one of the most popular techniques for reclaiming memory in lock-free and optimistic locking data structures, due to its ease of use and good performance in pra... |
| BANG: Billion-Scale Approximate Nearest Neighbor Search using a Single GPU | Karthik V., Saim Khan, Somesh Singh, Harsha Vardhan Simhadri, Jyothi Vedurada | 2024-01-20 | 下载 | Approximate Nearest Neighbour Search (ANNS) is a subroutine in algorithms routinely employed in information retrieval, pattern recognition, data mining, image processing, and beyond. |
| CaraServe: CPU-Assisted and Rank-Aware LoRA Serving for Generative LLM Inference | Suyi Li, Hanfeng Lu, Tianyuan Wu, Minchen Yu, Qizhen Weng, Xusheng Chen, Yizhou Shan, Binhang Yuan, Wei Wang | 2024-01-20 | 下载 | Pre-trained large language models (LLMs) often need specialization for domain-specific tasks. Low-Rank Adaptation (LoRA) is a popular approach that adapts a base model to multiple tasks by adding ligh... |
| Programming Distributed Collective Processes in the eXchange Calculus | Giorgio Audrito, Roberto Casadei, Ferruccio Damiani, Gianluca Torta, Mirko Viroli | 2024-01-20 | 下载 | Recent trends like the Internet of Things (IoT) suggest a vision of dense and multi-scale deployments of computing devices in nearly all kinds of environments. |
| PartIR: Composing SPMD Partitioning Strategies for Machine Learning | Sami Alabed, Daniel Belov, Bart Chrzaszcz, Juliana Franco, Dominik Grewe, Dougal Maclaurin, James Molloy, Tom Natan, Tamara Norman, Xiaoyue Pan, Adam Paszke, Norman A. Rink, Michael Schaarschmidt, Timur Sitdikov, Agnieszka Swietlik, Dimitrios Vytiniotis, Joel Wee | 2024-01-20 | 下载 | Training of modern large neural networks (NN) requires a combination of parallelization strategies encompassing data, model, or optimizer sharding. |
| Inference without Interference: Disaggregate LLM Inference for Mixed Downstream Workloads | Cunchen Hu, Heyang Huang, Liangliang Xu, Xusheng Chen, Jiang Xu, Shuang Chen, Hao Feng, Chenxi Wang, Sa Wang, Yungang Bao, Ninghui Sun, Yizhou Shan | 2024-01-20 | 下载 | Transformer-based large language model (LLM) inference serving is now the backbone of many cloud services. LLM inference consists of a prefill phase and a decode phase. |
| Scheduling of Distributed Applications on the Computing Continuum: A Survey | Narges Mehran, Dragi Kimovski, Hermann Hellwagner, Dumitru Roman, Ahmet Soylu, Radu Prodan | 2024-01-20 | 下载 | The demand for distributed applications has significantly increased over the past decade, with improvements in machine learning techniques fueling this growth. |
| Combining Cloud and Mobile Computing for Machine Learning | Ruiqi Xu, Tianchi Zhang | 2024-01-20 | 下载 | Although the computing power of mobile devices is increasing, machine learning models are also growing in size. This trend creates problems for mobile devices due to limitations like their memory capa... |
| FedRKG: A Privacy-preserving Federated Recommendation Framework via Knowledge Graph Enhancement | Dezhong Yao, Tongtong Liu, Qi Cao, Hai Jin | 2024-01-20 | 下载 | Federated Learning (FL) has emerged as a promising approach for preserving data privacy in recommendation systems by training models locally. Recently, Graph Neural Networks (GNN) have gained populari... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| On the Interplay of Artificial Intelligence and Space-Air-Ground Integrated Networks: A Survey | Adilya Bakambekova, Nour Kouzayha, Tareq Al-Naffouri | 2024-01-20 | 下载 | Space-Air-Ground Integrated Networks (SAGINs), which incorporate space and aerial networks with terrestrial wireless systems, are vital enablers of the emerging sixth-generation (6G) wireless networks... |
| Security-Sensitive Task Offloading in Integrated Satellite-Terrestrial Networks | Wenjun Lan, Kongyang Chen, Jiannong Cao, Yikai Li, Ning Li, Qi Chen, Yuvraj Sahni | 2024-01-20 | 下载 | With the rapid development of sixth-generation (6G) communication technology, global communication networks are moving towards the goal of comprehensive and seamless coverage. |