Appearance
2024-05-27
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Multi-qubit Lattice Surgery Scheduling | Allyson Silva, Xiangyi Zhang, Zak Webb, Mia Kramer, Chan Woo Yang, Xiao Liu, Jessica Lemieux, Ka-Wai Chen, Artur Scherer, Pooya Ronagh | 2024-05-27 | 下载 | Fault-tolerant quantum computation using two-dimensional topological quantum error correcting codes can benefit from multi-qubit long-range operations. |
| RTL-Repo: A Benchmark for Evaluating LLMs on Large-Scale RTL Design Projects | Ahmed Allam, Mohamed Shalan | 2024-05-27 | 下载 | Large Language Models (LLMs) have demonstrated potential in assisting with Register Transfer Level (RTL) design tasks. Nevertheless, there remains to be a significant gap in benchmarks that accurately... |
| DR-CGRA: Supporting Loop-Carried Dependencies in CGRAs Without Spilling Intermediate Values | Elad Hadar, Yoav Etsion | 2024-05-27 | 下载 | Coarse-grain reconfigurable architectures (CGRAs) are gaining traction thanks to their performance and power efficiency. Utilizing CGRAs to accelerate the execution of tight loops holds great potentia... |
| Optimized thread-block arrangement in a GPU implementation of a linear solver for atmospheric chemistry mechanisms | Christian Guzman Ruiz, Mario Acosta, Oriol Jorba, Eduardo Cesar Galobardes, Matthew Dawson, Guillermo Oyarzun, Carlos Pérez García-Pando, Kim Serradell | 2024-05-27 | 下载 | Earth system models (ESM) demand significant hardware resources and energy consumption to solve atmospheric chemistry processes. Recent studies have shown improved performance from running these model... |
| Evaluation of computational and energy performance in matrix multiplication algorithms on CPU and GPU using MKL, cuBLAS and SYCL | L. A. Torres, Carlos J. Barrios H, Yves Denneulin | 2024-05-27 | 下载 | Matrix multiplication is fundamental in the backpropagation algorithm used to train deep neural network models. Libraries like Intel's MKL or NVIDIA's cuBLAS implemented new and optimized matrix multi... |
| SWAT: Scalable and Efficient Window Attention-based Transformers Acceleration on FPGAs | Zhenyu Bai, Pranav Dangi, Huize Li, Tulika Mitra | 2024-05-27 | 下载 | Efficiently supporting long context length is crucial for Transformer models. The quadratic complexity of the self-attention computation plagues traditional Transformers. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Adaptive Device-Edge Collaboration on DNN Inference in AIoT: A Digital Twin-Assisted Approach | Shisheng Hu, Mushu Li, Jie Gao, Conghao Zhou, Xuemin Shen | 2024-05-27 | 下载 | Device-edge collaboration on deep neural network (DNN) inference is a promising approach to efficiently utilizing network resources for supporting artificial intelligence of things (AIoT) applications... |
| Optimized thread-block arrangement in a GPU implementation of a linear solver for atmospheric chemistry mechanisms | Christian Guzman Ruiz, Mario Acosta, Oriol Jorba, Eduardo Cesar Galobardes, Matthew Dawson, Guillermo Oyarzun, Carlos Pérez García-Pando, Kim Serradell | 2024-05-27 | 下载 | Earth system models (ESM) demand significant hardware resources and energy consumption to solve atmospheric chemistry processes. Recent studies have shown improved performance from running these model... |
| Evaluation of computational and energy performance in matrix multiplication algorithms on CPU and GPU using MKL, cuBLAS and SYCL | L. A. Torres, Carlos J. Barrios H, Yves Denneulin | 2024-05-27 | 下载 | Matrix multiplication is fundamental in the backpropagation algorithm used to train deep neural network models. Libraries like Intel's MKL or NVIDIA's cuBLAS implemented new and optimized matrix multi... |
| ReStorEdge: An edge computing system with reuse semantics | Adrian-Cristian Nicolaescu, Spyridon Mastorakis, Md Washik Al Azad, David Griffin, Miguel Rio | 2024-05-27 | 下载 | This paper investigates an edge computing system where requests are processed by a set of replicated edge servers. We investigate a class of applications where similar queries produce identical result... |
| Galaxy: A Resource-Efficient Collaborative Edge AI System for In-situ Transformer Inference | Shengyuan Ye, Jiangsu Du, Liekang Zeng, Wenzhong Ou, Xiaowen Chu, Yutong Lu, Xu Chen | 2024-05-27 | 下载 | Transformer-based models have unlocked a plethora of powerful intelligent applications at the edge, such as voice assistant in smart home. Traditional deployment approaches offload the inference workl... |
| Boolean Gates Based on Liquid Marbles | Luca Cavenaghi, Sandro Erba, Claudio Zandron | 2024-05-27 | 下载 | Liquid Marbles are liquid droplets encapsulated by hydrophobic powder particles. They offer an efficient approach to handling liquids due to their non-wetting nature. |
| Efficient Model Compression for Hierarchical Federated Learning | Xi Zhu, Songcan Yu, Junbo Wang, Qinglin Yang | 2024-05-27 | 下载 | Federated learning (FL), as an emerging collaborative learning paradigm, has garnered significant attention due to its capacity to preserve privacy within distributed learning systems. |
| LiveData -- A Worldwide Data Mesh for Stratified Data | Simone Bocca, Amarsanaa Ganbold, Tsolmon Zundui | 2024-05-27 | 下载 | Data reuse is fundamental for reducing the data integration effort required to build data supporting new applications, especially in data scarcity contexts. |
| Evaluation of Resource-Efficient Crater Detectors on Embedded Systems | Simon Vellas, Bill Psomas, Kalliopi Karadima, Dimitrios Danopoulos, Alexandros Paterakis, George Lentaris, Dimitrios Soudris, Konstantinos Karantzalos | 2024-05-27 | 下载 | Real-time analysis of Martian craters is crucial for mission-critical operations, including safe landings and geological exploration. This work leverages the latest breakthroughs for on-the-edge crate... |
| Federated Learning with Blockchain-Enhanced Machine Unlearning: A Trustworthy Approach | Xuhan Zuo, Minghao Wang, Tianqing Zhu, Lefeng Zhang, Shui Yu, Wanlei Zhou | 2024-05-27 | 下载 | With the growing need to comply with privacy regulations and respond to user data deletion requests, integrating machine unlearning into IoT-based federated learning has become imperative. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Enhancing Resiliency of Integrated Space-Air-Ground-Sea Networks with Renewable Energies: A Use Case After the 2023 Türkiye Earthquake | Bilal Karaman, Ilhan Basturk, Sezai Taskin, Ferdi Kara, Engin Zeydan, Halim Yanikomeroglu | 2024-05-27 | 下载 | Natural disasters can have catastrophic consequences, a poignant example is the series of and magnitude earthquakes that devastated Türkiye on February 6, 2023. |
| Survey of Graph Neural Network for Internet of Things and NextG Networks | Sabarish Krishna Moorthy, Jithin Jagannath | 2024-05-27 | 下载 | The exponential increase in Internet of Things (IoT) devices coupled with 6G pushing towards higher data rates and connected devices has sparked a surge in data. |
| The logistic queue model: theoretical properties and performance evaluation | Franco Coltraro, Marc Ruiz, Luis Velasco | 2024-05-27 | 下载 | The advent of digital twins (DT) for the control and management of communication networks requires accurate and fast methods to estimate key performance indicators (KPI) needed for autonomous decision... |
| ReStorEdge: An edge computing system with reuse semantics | Adrian-Cristian Nicolaescu, Spyridon Mastorakis, Md Washik Al Azad, David Griffin, Miguel Rio | 2024-05-27 | 下载 | This paper investigates an edge computing system where requests are processed by a set of replicated edge servers. We investigate a class of applications where similar queries produce identical result... |
| Galaxy: A Resource-Efficient Collaborative Edge AI System for In-situ Transformer Inference | Shengyuan Ye, Jiangsu Du, Liekang Zeng, Wenzhong Ou, Xiaowen Chu, Yutong Lu, Xu Chen | 2024-05-27 | 下载 | Transformer-based models have unlocked a plethora of powerful intelligent applications at the edge, such as voice assistant in smart home. Traditional deployment approaches offload the inference workl... |
| WirelessLLM: Empowering Large Language Models Towards Wireless Intelligence | Jiawei Shao, Jingwen Tong, Qiong Wu, Wei Guo, Zijian Li, Zehong Lin, Jun Zhang | 2024-05-27 | 下载 | The rapid evolution of wireless technologies and the growing complexity of network infrastructures necessitate a paradigm shift in how communication networks are designed, configured, and managed. |
| Quantum-safe Edge Applications: How to Secure Computation in Distributed Computing Systems | Claudio Cicconetti, Dario Sabella, Pietro Noviello, Gennaro Davide Paduanelli | 2024-05-27 | 下载 | The advent of distributed computing systems will offer great flexibility for application workloads, while also imposing more attention to security, where the future advent and adoption of quantum tech... |
| An experimental study of the response time in an edge-cloud continuum with ClusterLink | Marc Michalke, Fin Gentzen, Admela Jukan, Kfir Toledo, Etai Lev Ran | 2024-05-27 | 下载 | In this paper, we conduct an experimental study to provide a general sense of the application response time implications that inter-cluster communication experiences at the edge at the example of a sp... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| An Analysis of Performance Bottlenecks in MRI Pre-Processing | Mathieu Dugré, Yohan Chatelain, Tristan Glatard | 2024-05-27 | 下载 | Magnetic Resonance Image (MRI) pre-processing is a critical step for neuroimaging analysis. However, the computational cost of MRI pre-processing pipelines is a major bottleneck for large cohort studi... |
| Optimizing Layout of Recursive Datatypes with Marmoset | Vidush Singhal, Chaitanya Koparkar, Joseph Zullo, Artem Pelenitsyn, Michael Vollmer, Mike Rainey, Ryan Newton, Milind Kulkarni | 2024-05-27 | 下载 | While programmers know that the low-level memory representation of data structures can have significant effects on performance, compiler support to optimize the layout of those structures is an under-... |
| Optimized thread-block arrangement in a GPU implementation of a linear solver for atmospheric chemistry mechanisms | Christian Guzman Ruiz, Mario Acosta, Oriol Jorba, Eduardo Cesar Galobardes, Matthew Dawson, Guillermo Oyarzun, Carlos Pérez García-Pando, Kim Serradell | 2024-05-27 | 下载 | Earth system models (ESM) demand significant hardware resources and energy consumption to solve atmospheric chemistry processes. Recent studies have shown improved performance from running these model... |
| Evaluation of Resource-Efficient Crater Detectors on Embedded Systems | Simon Vellas, Bill Psomas, Kalliopi Karadima, Dimitrios Danopoulos, Alexandros Paterakis, George Lentaris, Dimitrios Soudris, Konstantinos Karantzalos | 2024-05-27 | 下载 | Real-time analysis of Martian craters is crucial for mission-critical operations, including safe landings and geological exploration. This work leverages the latest breakthroughs for on-the-edge crate... |
| LRAMM -- Low precision approximates GEMM via RSVD | Hongyaoxing Gu | 2024-05-27 | 下载 | Matrix multiplication computation acceleration has been a research hotspot across various domains. Due to the characteristics of some applications, approximate matrix multiplication can achieve signif... |