Appearance
2024-09-01
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Research on LLM Acceleration Using the High-Performance RISC-V Processor "Xiangshan" (Nanhu Version) Based on the Open-Source Matrix Instruction Set Extension (Vector Dot Product) | Xu-Hao Chen, Si-Peng Hu, Hong-Chao Liu, Bo-Ran Liu, Dan Tang, Di Zhao | 2024-09-01 | 下载 | Considering the high-performance and low-power requirements of edge AI, this study designs a specialized instruction set processor for edge AI based on the RISC-V instruction set architecture, address... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Redefining Data-Centric Design: A New Approach with a Domain Model and Core Data Ontology for Computational Systems | William Johnson, James Davis, Tara Kelly | 2024-09-01 | 下载 | This paper presents an innovative data-centric paradigm for designing computational systems by introducing a new informatics domain model. The proposed model moves away from the conventional node-cent... |
| Federated Aggregation of Mallows Rankings: A Comparative Analysis of Borda and Lehmer Coding | Jin Sima, Vishal Rana, Olgica Milenkovic | 2024-09-01 | 下载 | Rank aggregation combines multiple ranked lists into a consensus ranking. In fields like biomedical data sharing, rankings may be distributed and require privacy. |
| RTop-K: Ultra-Fast Row-Wise Top-K Selection for Neural Network Acceleration on GPUs | Xi Xie, Yuebo Luo, Hongwu Peng, Caiwen Ding | 2024-09-01 | 下载 | Top-k selection algorithms are fundamental in a wide range of applications, including high-performance computing, information retrieval, big data processing, and neural network model training. |
| Container Data Item: An Abstract Datatype for Efficient Container-based Edge Computing | Md Rezwanur Rahman, Tarun Annapareddy, Shirin Ebadi, Varsha Natarajan, Adarsh Srinivasan, Eric Keller, Shivakant Mishra | 2024-09-01 | 下载 | We present Container Data Item (CDI), an abstract datatype that allows multiple containers to efficiently operate on a common data item while preserving their strong security and isolation semantics. |
| Universal Finite-State and Self-Stabilizing Computation in Anonymous Dynamic Networks | Giuseppe A. Di Luna, Giovanni Viglietta | 2024-09-01 | 下载 | A communication network is said to be "anonymous" if its agents are indistinguishable from each other; it is "dynamic" if its communication links may appear or disappear unpredictably over time. |
| HopGNN: Boosting Distributed GNN Training Efficiency via Feature-Centric Model Migration | Weijian Chen, Shuibing He, Haoyang Qu, Xuechen Zhang | 2024-09-01 | 下载 | Distributed training of graph neural networks (GNNs) has become a crucial technique for processing large graphs. Prevalent GNN frameworks are model-centric, necessitating the transfer of massive graph... |
| Average-case optimization analysis for distributed consensus algorithms on regular graphs | Nhat Trung Nguyen, Alexander Rogozin, Alexander Gasnikov | 2024-09-01 | 下载 | The consensus problem in distributed computing involves a network of agents aiming to compute the average of their initial vectors through local communication, represented by an undirected graph. |
| Fast Prototyping of Distributed Stream Processing Applications with stream2gym | Md. Monzurul Amin Ifath, Miguel Neves, Israat Haque | 2024-09-01 | 下载 | Stream processing applications have been widely adopted due to real-time data analytics demands, e.g., fraud detection, video analytics, IoT applications. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Reliability-considered Multi-platoon's Groupcasting using the Resource Sharing Method | Chung-Ming Huang, Yen-Hung Wu, Duy-Tuan Dao | 2024-09-01 | 下载 | In the context of 5G platoon communications, the Platoon Leader Vehicle (PLV) employs groupcasting to transmit control messages to Platoon Member Vehicles (PMVs). |
| AirCompSim: A Discrete Event Simulator for Air Computing | Baris Yamansavascilar, Atay Ozgovde, Cem Ersoy | 2024-09-01 | 下载 | Air components, including UAVs, planes, balloons, and satellites have been widely utilized since the fixed capacity of ground infrastructure cannot meet the dynamic load of the users. |
| Fast Prototyping of Distributed Stream Processing Applications with stream2gym | Md. Monzurul Amin Ifath, Miguel Neves, Israat Haque | 2024-09-01 | 下载 | Stream processing applications have been widely adopted due to real-time data analytics demands, e.g., fraud detection, video analytics, IoT applications. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Scaler: Efficient and Effective Cross Flow Analysis | Steven, Tang, Mingcan Xiang, Yang Wang, Bo Wu, Jianjun Chen, Tongping Liu | 2024-09-01 | 下载 | Performance analysis is challenging as different components (e.g.,different libraries, and applications) of a complex system can interact with each other. |