Appearance
2024-01-22
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| ACS: Concurrent Kernel Execution on Irregular, Input-Dependent Computational Graphs | Sankeerth Durvasula, Adrian Zhao, Raymond Kiguru, Yushi Guan, Zhonghan Chen, Nandita Vijaykumar | 2024-01-22 | 下载 | GPUs are widely used to accelerate many important classes of workloads today. However, we observe that several important emerging classes of workloads, including simulation engines for deep reinforcem... |
| Retrieval-Guided Reinforcement Learning for Boolean Circuit Minimization | Animesh Basak Chowdhury, Marco Romanelli, Benjamin Tan, Ramesh Karri, Siddharth Garg | 2024-01-22 | 下载 | Logic synthesis, a pivotal stage in chip design, entails optimizing chip specifications encoded in hardware description languages like Verilog into highly efficient implementations using Boolean logic... |
| An Irredundant and Compressed Data Layout to Optimize Bandwidth Utilization of FPGA Accelerators | Corentin Ferry, Nicolas Derumigny, Steven Derrien, Sanjay Rajopadhye | 2024-01-22 | 下载 | Memory bandwidth is known to be a performance bottleneck for FPGA accelerators, especially when they deal with large multi-dimensional data-sets. |
| BETA: Binarized Energy-Efficient Transformer Accelerator at the Edge | Yuhao Ji, Chao Fang, Zhongfeng Wang | 2024-01-22 | 下载 | Existing binary Transformers are promising in edge deployment due to their compact model size, low computational complexity, and considerable inference accuracy. |
| Accelerating Seed Location Filtering in DNA Read Mapping Using a Commercial Compute-in-SRAM Architecture | Courtney Golden, Dan Ilan, Nicholas Cebry, Christopher Batten | 2024-01-22 | 下载 | DNA sequence alignment is an important workload in computational genomics. Reference-guided DNA assembly involves aligning many read sequences against candidate locations in a long reference genome. |
| Zero-Space Cost Fault Tolerance for Transformer-based Language Models on ReRAM | Bingbing Li, Geng Yuan, Zigeng Wang, Shaoyi Huang, Hongwu Peng, Payman Behnam, Wujie Wen, Hang Liu, Caiwen Ding | 2024-01-22 | 下载 | Resistive Random Access Memory (ReRAM) has emerged as a promising platform for deep neural networks (DNNs) due to its support for parallel in-situ matrix-vector multiplication. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Learning Recovery Strategies for Dynamic Self-healing in Reactive Systems | Mateo Sanabria, Ivana Dusparic, Nicolas Cardozo | 2024-01-22 | 下载 | Self-healing systems depend on following a set of predefined instructions to recover from a known failure state. Failure states are generally detected based on domain specific specialized metrics. |
| Efficient Collaborations through Weight-Driven Coalition Dynamics in Federated Learning Systems | Mohammed El Hanjri, Hamza Reguieg, Adil Attiaoui, Amine Abouaomar, Abdellatif Kobbane, Mohamed El Kamili | 2024-01-22 | 下载 | In the era of the Internet of Things (IoT), decentralized paradigms for machine learning are gaining prominence. In this paper, we introduce a federated learning model that capitalizes on the Euclidea... |
| Uncoded Storage Coded Transmission Elastic Computing with Straggler Tolerance in Heterogeneous Systems | Xi Zhong, Joerg Kliewer, Mingyue Ji | 2024-01-22 | 下载 | In 2018, Yang et al. introduced a novel and effective approach, using maximum distance separable (MDS) codes, to mitigate the impact of elasticity in cloud computing systems. |
| Centralization in Block Building and Proposer-Builder Separation | Maryam Bahrani, Pranav Garimidi, Tim Roughgarden | 2024-01-22 | 下载 | The goal of this paper is to rigorously interrogate conventional wisdom about centralization in block-building (due to, e.g., MEV and private order flow) and the outsourcing of block-building by valid... |
| Alya towards Exascale: Optimal OpenACC Performance of the Navier-Stokes Finite Element Assembly on GPUs | Herbert Owen, Dominik Ernst, Thomas Gruber, Oriol Lemkuhl, Guillaume Houzeaux, Lucas Gasparino, Gerhard Wellein | 2024-01-22 | 下载 | This paper addresses the challenge of providing portable and highly efficient code structures for CPU and GPU architectures. We choose the assembly of the right-hand term in the incompressible flow mo... |
| LLM-based policy generation for intent-based management of applications | Kristina Dzeparoska, Jieyu Lin, Ali Tizghadam, Alberto Leon-Garcia | 2024-01-22 | 下载 | Automated management requires decomposing high-level user requests, such as intents, to an abstraction that the system can understand and execute. |
| TurboSVM-FL: Boosting Federated Learning through SVM Aggregation for Lazy Clients | Mengdi Wang, Anna Bodonhelyi, Efe Bozkir, Enkelejda Kasneci | 2024-01-22 | 下载 | Federated learning is a distributed collaborative machine learning paradigm that has gained strong momentum in recent years. In federated learning, a central server periodically coordinates models wit... |
| Tight Bounds on the Message Complexity of Distributed Tree Verification | Shay Kutten, Peter Robinson, Ming Ming Tan | 2024-01-22 | 下载 | We consider the message complexity of verifying whether a given subgraph of the communication network forms a tree with specific properties both in the KT-ρ (nodes know their ρ-hop neighborhood, i... |
| Accelerating Causal Algorithms for Industrial-scale Data: A Distributed Computing Approach with Ray Framework | Vishal Verma, Vinod Reddy, Jaiprakash Ravi | 2024-01-22 | 下载 | The increasing need for causal analysis in large-scale industrial datasets necessitates the development of efficient and scalable causal algorithms for real-world applications. |
| Navigating the Maize: Cyclic and conditional computational graphs for molecular simulation | Thomas Löhr, Michele Assante, Michael Dodds, Lili Cao, Mikhail Kabeshov, Jon-Paul Janet, Marco Klähn, Ola Engkvist | 2024-01-22 | 下载 | Many computational chemistry and molecular simulation workflows can be expressed as graphs. This abstraction is useful to modularize and potentially reuse existing components, as well as provide paral... |
| Self-Balancing Semi-Hierarchical PCNs for CBDCs | Marco Benedetti, Francesco De Sclavis, Marco Favorito, Giuseppe Galano, Sara Giammusso, Antonio Muci, Matteo Nardelli | 2024-01-22 | 下载 | We introduce a family of PCNs (Payment Channel Networks) characterized by a semi-hierarchical topology and a custom set of channel rebalancing strategies. |
| Integrated Sensing, Communication, and Computing: An Information-oriented Resource Transaction Mechanism | Ning Chen, Zhipeng Cheng, Xuwei Fan, Zhang Liu, Bangzhen Huang, Jie Yang, Yifeng Zhao, Lianfen Huang | 2024-01-22 | 下载 | Information acquisition from target perception represents the key enabling technology of the Internet of Automatic Vehicles (IoAV), which is essential for the decision-making and control operation of ... |
| Transformers with Attentive Federated Aggregation for Time Series Stock Forecasting | Chu Myaet Thwal, Ye Lin Tun, Kitae Kim, Seong-Bae Park, Choong Seon Hong | 2024-01-22 | 下载 | Recent innovations in transformers have shown their superior performance in natural language processing (NLP) and computer vision (CV). The ability to capture long-range dependencies and interactions ... |
| Attention on Personalized Clinical Decision Support System: Federated Learning Approach | Chu Myaet Thwal, Kyi Thar, Ye Lin Tun, Choong Seon Hong | 2024-01-22 | 下载 | Health management has become a primary problem as new kinds of diseases and complex symptoms are introduced to a rapidly growing modern society. |
| OnDev-LCT: On-Device Lightweight Convolutional Transformers towards federated learning | Chu Myaet Thwal, Minh N. H. Nguyen, Ye Lin Tun, Seong Tae Kim, My T. Thai, Choong Seon Hong | 2024-01-22 | 下载 | Federated learning (FL) has emerged as a promising approach to collaboratively train machine learning models across multiple edge devices while preserving privacy. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Data-oriented Coordinated Uplink Transmission for Massive IoT System | Jyri Hämäläinen, Rui Dinis, Mehmet C. Ilter | 2024-01-22 | 下载 | Recently, the paradigm of massive ultra-reliable low-latency IoT communications (URLLC-IoT) has gained growing interest. Reliable delay-critical uplink transmission in IoT is a challenging task since ... |
| Fast and Scalable Network Slicing by Integrating Deep Learning with Lagrangian Methods | Tianlun Hu, Qi Liao, Qiang Liu, Antonio Massaro, Georg Carle | 2024-01-22 | 下载 | Network slicing is a key technique in 5G and beyond for efficiently supporting diverse services. Many network slicing solutions rely on deep learning to manage complex and high-dimensional resource al... |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| SyzRetrospector: A Large-Scale Retrospective Study of Syzbot | Joseph Bursey, Ardalan Amiri Sani, Zhiyun Qian | 2024-01-22 | 下载 | Over the past 6 years, Syzbot has fuzzed the Linux kernel day and night to report over 5570 bugs, of which 4604 have been patched [11]. While this is impressive, we have found the average time to find... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Alya towards Exascale: Optimal OpenACC Performance of the Navier-Stokes Finite Element Assembly on GPUs | Herbert Owen, Dominik Ernst, Thomas Gruber, Oriol Lemkuhl, Guillaume Houzeaux, Lucas Gasparino, Gerhard Wellein | 2024-01-22 | 下载 | This paper addresses the challenge of providing portable and highly efficient code structures for CPU and GPU architectures. We choose the assembly of the right-hand term in the incompressible flow mo... |
| Systematic Performance Evaluation Framework for LEO Mega-Constellation Satellite Networks | Yu Wang, Chuili Kong, Xian Meng, Hejia Luo, Ke-Xin Li, Jun Wang | 2024-01-22 | 下载 | Low Earth orbit (LEO) mega-constellation satellite networks have shown great potential to extend the coverage capability of conventional terrestrial networks. |