Skip to content

2024-09-01

cs.AR - Architecture

标题作者发布日期PDF摘要
Research on LLM Acceleration Using the High-Performance RISC-V Processor "Xiangshan" (Nanhu Version) Based on the Open-Source Matrix Instruction Set Extension (Vector Dot Product)Xu-Hao Chen, Si-Peng Hu, Hong-Chao Liu, Bo-Ran Liu, Dan Tang, Di Zhao2024-09-01下载Considering the high-performance and low-power requirements of edge AI, this study designs a specialized instruction set processor for edge AI based on the RISC-V instruction set architecture, address...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Redefining Data-Centric Design: A New Approach with a Domain Model and Core Data Ontology for Computational SystemsWilliam Johnson, James Davis, Tara Kelly2024-09-01下载This paper presents an innovative data-centric paradigm for designing computational systems by introducing a new informatics domain model. The proposed model moves away from the conventional node-cent...
Federated Aggregation of Mallows Rankings: A Comparative Analysis of Borda and Lehmer CodingJin Sima, Vishal Rana, Olgica Milenkovic2024-09-01下载Rank aggregation combines multiple ranked lists into a consensus ranking. In fields like biomedical data sharing, rankings may be distributed and require privacy.
RTop-K: Ultra-Fast Row-Wise Top-K Selection for Neural Network Acceleration on GPUsXi Xie, Yuebo Luo, Hongwu Peng, Caiwen Ding2024-09-01下载Top-k selection algorithms are fundamental in a wide range of applications, including high-performance computing, information retrieval, big data processing, and neural network model training.
Container Data Item: An Abstract Datatype for Efficient Container-based Edge ComputingMd Rezwanur Rahman, Tarun Annapareddy, Shirin Ebadi, Varsha Natarajan, Adarsh Srinivasan, Eric Keller, Shivakant Mishra2024-09-01下载We present Container Data Item (CDI), an abstract datatype that allows multiple containers to efficiently operate on a common data item while preserving their strong security and isolation semantics.
Universal Finite-State and Self-Stabilizing Computation in Anonymous Dynamic NetworksGiuseppe A. Di Luna, Giovanni Viglietta2024-09-01下载A communication network is said to be "anonymous" if its agents are indistinguishable from each other; it is "dynamic" if its communication links may appear or disappear unpredictably over time.
HopGNN: Boosting Distributed GNN Training Efficiency via Feature-Centric Model MigrationWeijian Chen, Shuibing He, Haoyang Qu, Xuechen Zhang2024-09-01下载Distributed training of graph neural networks (GNNs) has become a crucial technique for processing large graphs. Prevalent GNN frameworks are model-centric, necessitating the transfer of massive graph...
Average-case optimization analysis for distributed consensus algorithms on regular graphsNhat Trung Nguyen, Alexander Rogozin, Alexander Gasnikov2024-09-01下载The consensus problem in distributed computing involves a network of agents aiming to compute the average of their initial vectors through local communication, represented by an undirected graph.
Fast Prototyping of Distributed Stream Processing Applications with stream2gymMd. Monzurul Amin Ifath, Miguel Neves, Israat Haque2024-09-01下载Stream processing applications have been widely adopted due to real-time data analytics demands, e.g., fraud detection, video analytics, IoT applications.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Reliability-considered Multi-platoon's Groupcasting using the Resource Sharing MethodChung-Ming Huang, Yen-Hung Wu, Duy-Tuan Dao2024-09-01下载In the context of 5G platoon communications, the Platoon Leader Vehicle (PLV) employs groupcasting to transmit control messages to Platoon Member Vehicles (PMVs).
AirCompSim: A Discrete Event Simulator for Air ComputingBaris Yamansavascilar, Atay Ozgovde, Cem Ersoy2024-09-01下载Air components, including UAVs, planes, balloons, and satellites have been widely utilized since the fixed capacity of ground infrastructure cannot meet the dynamic load of the users.
Fast Prototyping of Distributed Stream Processing Applications with stream2gymMd. Monzurul Amin Ifath, Miguel Neves, Israat Haque2024-09-01下载Stream processing applications have been widely adopted due to real-time data analytics demands, e.g., fraud detection, video analytics, IoT applications.

cs.PF - Performance

标题作者发布日期PDF摘要
Scaler: Efficient and Effective Cross Flow AnalysisSteven, Tang, Mingcan Xiang, Yang Wang, Bo Wu, Jianjun Chen, Tongping Liu2024-09-01下载Performance analysis is challenging as different components (e.g.,different libraries, and applications) of a complex system can interact with each other.

基于 VitePress 构建