Skip to content

2024-01-20

cs.AR - Architecture

标题作者发布日期PDF摘要
SRAM Alpha-SER Estimation From Word-Line Voltage Margin Measurements: Design Architecture and Experimental ResultsGabriel Torrens, Ivan de Paul, Bartomeu Alorda, Sebastia Bota, Jaume Segura2024-01-20下载Experimental results from a 65 nm CMOS commercial technology SRAM test chip reveal a linear correlation between a new electrical parameter -- the word-line voltage margin (VWLVM) -- and the measured c...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Are Your Epochs Too Epic? Batch Free Can Be HarmfulDaewoo Kim, Trevor Brown, Ajay Singh2024-01-20下载Epoch based memory reclamation (EBR) is one of the most popular techniques for reclaiming memory in lock-free and optimistic locking data structures, due to its ease of use and good performance in pra...
BANG: Billion-Scale Approximate Nearest Neighbor Search using a Single GPUKarthik V., Saim Khan, Somesh Singh, Harsha Vardhan Simhadri, Jyothi Vedurada2024-01-20下载Approximate Nearest Neighbour Search (ANNS) is a subroutine in algorithms routinely employed in information retrieval, pattern recognition, data mining, image processing, and beyond.
CaraServe: CPU-Assisted and Rank-Aware LoRA Serving for Generative LLM InferenceSuyi Li, Hanfeng Lu, Tianyuan Wu, Minchen Yu, Qizhen Weng, Xusheng Chen, Yizhou Shan, Binhang Yuan, Wei Wang2024-01-20下载Pre-trained large language models (LLMs) often need specialization for domain-specific tasks. Low-Rank Adaptation (LoRA) is a popular approach that adapts a base model to multiple tasks by adding ligh...
Programming Distributed Collective Processes in the eXchange CalculusGiorgio Audrito, Roberto Casadei, Ferruccio Damiani, Gianluca Torta, Mirko Viroli2024-01-20下载Recent trends like the Internet of Things (IoT) suggest a vision of dense and multi-scale deployments of computing devices in nearly all kinds of environments.
PartIR: Composing SPMD Partitioning Strategies for Machine LearningSami Alabed, Daniel Belov, Bart Chrzaszcz, Juliana Franco, Dominik Grewe, Dougal Maclaurin, James Molloy, Tom Natan, Tamara Norman, Xiaoyue Pan, Adam Paszke, Norman A. Rink, Michael Schaarschmidt, Timur Sitdikov, Agnieszka Swietlik, Dimitrios Vytiniotis, Joel Wee2024-01-20下载Training of modern large neural networks (NN) requires a combination of parallelization strategies encompassing data, model, or optimizer sharding.
Inference without Interference: Disaggregate LLM Inference for Mixed Downstream WorkloadsCunchen Hu, Heyang Huang, Liangliang Xu, Xusheng Chen, Jiang Xu, Shuang Chen, Hao Feng, Chenxi Wang, Sa Wang, Yungang Bao, Ninghui Sun, Yizhou Shan2024-01-20下载Transformer-based large language model (LLM) inference serving is now the backbone of many cloud services. LLM inference consists of a prefill phase and a decode phase.
Scheduling of Distributed Applications on the Computing Continuum: A SurveyNarges Mehran, Dragi Kimovski, Hermann Hellwagner, Dumitru Roman, Ahmet Soylu, Radu Prodan2024-01-20下载The demand for distributed applications has significantly increased over the past decade, with improvements in machine learning techniques fueling this growth.
Combining Cloud and Mobile Computing for Machine LearningRuiqi Xu, Tianchi Zhang2024-01-20下载Although the computing power of mobile devices is increasing, machine learning models are also growing in size. This trend creates problems for mobile devices due to limitations like their memory capa...
FedRKG: A Privacy-preserving Federated Recommendation Framework via Knowledge Graph EnhancementDezhong Yao, Tongtong Liu, Qi Cao, Hai Jin2024-01-20下载Federated Learning (FL) has emerged as a promising approach for preserving data privacy in recommendation systems by training models locally. Recently, Graph Neural Networks (GNN) have gained populari...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
On the Interplay of Artificial Intelligence and Space-Air-Ground Integrated Networks: A SurveyAdilya Bakambekova, Nour Kouzayha, Tareq Al-Naffouri2024-01-20下载Space-Air-Ground Integrated Networks (SAGINs), which incorporate space and aerial networks with terrestrial wireless systems, are vital enablers of the emerging sixth-generation (6G) wireless networks...
Security-Sensitive Task Offloading in Integrated Satellite-Terrestrial NetworksWenjun Lan, Kongyang Chen, Jiannong Cao, Yikai Li, Ning Li, Qi Chen, Yuvraj Sahni2024-01-20下载With the rapid development of sixth-generation (6G) communication technology, global communication networks are moving towards the goal of comprehensive and seamless coverage.

基于 VitePress 构建