Skip to content

2024-05-27

cs.AR - Architecture

标题作者发布日期PDF摘要
Multi-qubit Lattice Surgery SchedulingAllyson Silva, Xiangyi Zhang, Zak Webb, Mia Kramer, Chan Woo Yang, Xiao Liu, Jessica Lemieux, Ka-Wai Chen, Artur Scherer, Pooya Ronagh2024-05-27下载Fault-tolerant quantum computation using two-dimensional topological quantum error correcting codes can benefit from multi-qubit long-range operations.
RTL-Repo: A Benchmark for Evaluating LLMs on Large-Scale RTL Design ProjectsAhmed Allam, Mohamed Shalan2024-05-27下载Large Language Models (LLMs) have demonstrated potential in assisting with Register Transfer Level (RTL) design tasks. Nevertheless, there remains to be a significant gap in benchmarks that accurately...
DR-CGRA: Supporting Loop-Carried Dependencies in CGRAs Without Spilling Intermediate ValuesElad Hadar, Yoav Etsion2024-05-27下载Coarse-grain reconfigurable architectures (CGRAs) are gaining traction thanks to their performance and power efficiency. Utilizing CGRAs to accelerate the execution of tight loops holds great potentia...
Optimized thread-block arrangement in a GPU implementation of a linear solver for atmospheric chemistry mechanismsChristian Guzman Ruiz, Mario Acosta, Oriol Jorba, Eduardo Cesar Galobardes, Matthew Dawson, Guillermo Oyarzun, Carlos Pérez García-Pando, Kim Serradell2024-05-27下载Earth system models (ESM) demand significant hardware resources and energy consumption to solve atmospheric chemistry processes. Recent studies have shown improved performance from running these model...
Evaluation of computational and energy performance in matrix multiplication algorithms on CPU and GPU using MKL, cuBLAS and SYCLL. A. Torres, Carlos J. Barrios H, Yves Denneulin2024-05-27下载Matrix multiplication is fundamental in the backpropagation algorithm used to train deep neural network models. Libraries like Intel's MKL or NVIDIA's cuBLAS implemented new and optimized matrix multi...
SWAT: Scalable and Efficient Window Attention-based Transformers Acceleration on FPGAsZhenyu Bai, Pranav Dangi, Huize Li, Tulika Mitra2024-05-27下载Efficiently supporting long context length is crucial for Transformer models. The quadratic complexity of the self-attention computation plagues traditional Transformers.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Adaptive Device-Edge Collaboration on DNN Inference in AIoT: A Digital Twin-Assisted ApproachShisheng Hu, Mushu Li, Jie Gao, Conghao Zhou, Xuemin Shen2024-05-27下载Device-edge collaboration on deep neural network (DNN) inference is a promising approach to efficiently utilizing network resources for supporting artificial intelligence of things (AIoT) applications...
Optimized thread-block arrangement in a GPU implementation of a linear solver for atmospheric chemistry mechanismsChristian Guzman Ruiz, Mario Acosta, Oriol Jorba, Eduardo Cesar Galobardes, Matthew Dawson, Guillermo Oyarzun, Carlos Pérez García-Pando, Kim Serradell2024-05-27下载Earth system models (ESM) demand significant hardware resources and energy consumption to solve atmospheric chemistry processes. Recent studies have shown improved performance from running these model...
Evaluation of computational and energy performance in matrix multiplication algorithms on CPU and GPU using MKL, cuBLAS and SYCLL. A. Torres, Carlos J. Barrios H, Yves Denneulin2024-05-27下载Matrix multiplication is fundamental in the backpropagation algorithm used to train deep neural network models. Libraries like Intel's MKL or NVIDIA's cuBLAS implemented new and optimized matrix multi...
ReStorEdge: An edge computing system with reuse semanticsAdrian-Cristian Nicolaescu, Spyridon Mastorakis, Md Washik Al Azad, David Griffin, Miguel Rio2024-05-27下载This paper investigates an edge computing system where requests are processed by a set of replicated edge servers. We investigate a class of applications where similar queries produce identical result...
Galaxy: A Resource-Efficient Collaborative Edge AI System for In-situ Transformer InferenceShengyuan Ye, Jiangsu Du, Liekang Zeng, Wenzhong Ou, Xiaowen Chu, Yutong Lu, Xu Chen2024-05-27下载Transformer-based models have unlocked a plethora of powerful intelligent applications at the edge, such as voice assistant in smart home. Traditional deployment approaches offload the inference workl...
Boolean Gates Based on Liquid MarblesLuca Cavenaghi, Sandro Erba, Claudio Zandron2024-05-27下载Liquid Marbles are liquid droplets encapsulated by hydrophobic powder particles. They offer an efficient approach to handling liquids due to their non-wetting nature.
Efficient Model Compression for Hierarchical Federated LearningXi Zhu, Songcan Yu, Junbo Wang, Qinglin Yang2024-05-27下载Federated learning (FL), as an emerging collaborative learning paradigm, has garnered significant attention due to its capacity to preserve privacy within distributed learning systems.
LiveData -- A Worldwide Data Mesh for Stratified DataSimone Bocca, Amarsanaa Ganbold, Tsolmon Zundui2024-05-27下载Data reuse is fundamental for reducing the data integration effort required to build data supporting new applications, especially in data scarcity contexts.
Evaluation of Resource-Efficient Crater Detectors on Embedded SystemsSimon Vellas, Bill Psomas, Kalliopi Karadima, Dimitrios Danopoulos, Alexandros Paterakis, George Lentaris, Dimitrios Soudris, Konstantinos Karantzalos2024-05-27下载Real-time analysis of Martian craters is crucial for mission-critical operations, including safe landings and geological exploration. This work leverages the latest breakthroughs for on-the-edge crate...
Federated Learning with Blockchain-Enhanced Machine Unlearning: A Trustworthy ApproachXuhan Zuo, Minghao Wang, Tianqing Zhu, Lefeng Zhang, Shui Yu, Wanlei Zhou2024-05-27下载With the growing need to comply with privacy regulations and respond to user data deletion requests, integrating machine unlearning into IoT-based federated learning has become imperative.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Enhancing Resiliency of Integrated Space-Air-Ground-Sea Networks with Renewable Energies: A Use Case After the 2023 Türkiye EarthquakeBilal Karaman, Ilhan Basturk, Sezai Taskin, Ferdi Kara, Engin Zeydan, Halim Yanikomeroglu2024-05-27下载Natural disasters can have catastrophic consequences, a poignant example is the series of 7.77.7 and 7.67.6 magnitude earthquakes that devastated Türkiye on February 6, 2023.
Survey of Graph Neural Network for Internet of Things and NextG NetworksSabarish Krishna Moorthy, Jithin Jagannath2024-05-27下载The exponential increase in Internet of Things (IoT) devices coupled with 6G pushing towards higher data rates and connected devices has sparked a surge in data.
The logistic queue model: theoretical properties and performance evaluationFranco Coltraro, Marc Ruiz, Luis Velasco2024-05-27下载The advent of digital twins (DT) for the control and management of communication networks requires accurate and fast methods to estimate key performance indicators (KPI) needed for autonomous decision...
ReStorEdge: An edge computing system with reuse semanticsAdrian-Cristian Nicolaescu, Spyridon Mastorakis, Md Washik Al Azad, David Griffin, Miguel Rio2024-05-27下载This paper investigates an edge computing system where requests are processed by a set of replicated edge servers. We investigate a class of applications where similar queries produce identical result...
Galaxy: A Resource-Efficient Collaborative Edge AI System for In-situ Transformer InferenceShengyuan Ye, Jiangsu Du, Liekang Zeng, Wenzhong Ou, Xiaowen Chu, Yutong Lu, Xu Chen2024-05-27下载Transformer-based models have unlocked a plethora of powerful intelligent applications at the edge, such as voice assistant in smart home. Traditional deployment approaches offload the inference workl...
WirelessLLM: Empowering Large Language Models Towards Wireless IntelligenceJiawei Shao, Jingwen Tong, Qiong Wu, Wei Guo, Zijian Li, Zehong Lin, Jun Zhang2024-05-27下载The rapid evolution of wireless technologies and the growing complexity of network infrastructures necessitate a paradigm shift in how communication networks are designed, configured, and managed.
Quantum-safe Edge Applications: How to Secure Computation in Distributed Computing SystemsClaudio Cicconetti, Dario Sabella, Pietro Noviello, Gennaro Davide Paduanelli2024-05-27下载The advent of distributed computing systems will offer great flexibility for application workloads, while also imposing more attention to security, where the future advent and adoption of quantum tech...
An experimental study of the response time in an edge-cloud continuum with ClusterLinkMarc Michalke, Fin Gentzen, Admela Jukan, Kfir Toledo, Etai Lev Ran2024-05-27下载In this paper, we conduct an experimental study to provide a general sense of the application response time implications that inter-cluster communication experiences at the edge at the example of a sp...

cs.PF - Performance

标题作者发布日期PDF摘要
An Analysis of Performance Bottlenecks in MRI Pre-ProcessingMathieu Dugré, Yohan Chatelain, Tristan Glatard2024-05-27下载Magnetic Resonance Image (MRI) pre-processing is a critical step for neuroimaging analysis. However, the computational cost of MRI pre-processing pipelines is a major bottleneck for large cohort studi...
Optimizing Layout of Recursive Datatypes with MarmosetVidush Singhal, Chaitanya Koparkar, Joseph Zullo, Artem Pelenitsyn, Michael Vollmer, Mike Rainey, Ryan Newton, Milind Kulkarni2024-05-27下载While programmers know that the low-level memory representation of data structures can have significant effects on performance, compiler support to optimize the layout of those structures is an under-...
Optimized thread-block arrangement in a GPU implementation of a linear solver for atmospheric chemistry mechanismsChristian Guzman Ruiz, Mario Acosta, Oriol Jorba, Eduardo Cesar Galobardes, Matthew Dawson, Guillermo Oyarzun, Carlos Pérez García-Pando, Kim Serradell2024-05-27下载Earth system models (ESM) demand significant hardware resources and energy consumption to solve atmospheric chemistry processes. Recent studies have shown improved performance from running these model...
Evaluation of Resource-Efficient Crater Detectors on Embedded SystemsSimon Vellas, Bill Psomas, Kalliopi Karadima, Dimitrios Danopoulos, Alexandros Paterakis, George Lentaris, Dimitrios Soudris, Konstantinos Karantzalos2024-05-27下载Real-time analysis of Martian craters is crucial for mission-critical operations, including safe landings and geological exploration. This work leverages the latest breakthroughs for on-the-edge crate...
LRAMM -- Low precision approximates GEMM via RSVDHongyaoxing Gu2024-05-27下载Matrix multiplication computation acceleration has been a research hotspot across various domains. Due to the characteristics of some applications, approximate matrix multiplication can achieve signif...

基于 VitePress 构建