Appearance
2024-04-03
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| QED: Scalable Verification of Hardware Memory Consistency | Gokulan Ravi, Xiaokang Qiu, Mithuna Thottethodi, T. N. Vijaykumar | 2024-04-03 | 下载 | Memory consistency model (MCM) issues in out-of-order-issue microprocessor-based shared-memory systems are notoriously non-intuitive and a source of hardware design bugs. |
| Spin-NeuroMem: A Low-Power Neuromorphic Associative Memory Design Based on Spintronic Devices | Siqing Fu, Lizhou Wu, Tiejun Li, Chunyuan Zhang, Jianmin Zhang, Sheng Ma | 2024-04-03 | 下载 | Biologically-inspired computing models have made significant progress in recent years, but the conventional von Neumann architecture is inefficient for the large-scale matrix operations and massive pa... |
| Block-SSD: A New Block-Based Blocking SSD Architecture | Ryan Wong, Arjun Tyagi, Sungjun Cho, Pratik Sampat, Yiqiu Sun | 2024-04-03 | 下载 | Computer science and related fields (e.g., computer engineering, computer hardware engineering, electrical engineering, electrical and computer engineering, computer systems engineering) often draw in... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Reducing the Impact of I/O Contention in Numerical Weather Prediction Workflows at Scale Using DAOS | Nicolau Manubens, Simon D. Smart, Emanuele Danovaro, Tiago Quintino, Adrian Jackson | 2024-04-03 | 下载 | Operational Numerical Weather Prediction (NWP) workflows are highly data-intensive. Data volumes have increased by many orders of magnitude over the last 40 years, and are expected to continue to do s... |
| vPALs: Towards Verified Performance-aware Learning System For Resource Management | Guoliang He, Gingfung Yeung, Sheriffo Ceesay, Adam Barker | 2024-04-03 | 下载 | Accurately predicting task performance at runtime in a cluster is advantageous for a resource management system to determine whether a task should be migrated due to performance degradation caused by ... |
| GeoT: Tensor Centric Library for Graph Neural Network via Efficient Segment Reduction on GPU | Zhongming Yu, Genghan Zhang, Hanxian Huang, Xin Chen, Jishen Zhao | 2024-04-03 | 下载 | In recent years, Graph Neural Networks (GNNs) have ignited a surge of innovation, significantly enhancing the processing of geometric data structures such as graphs, point clouds, and meshes. |
| Staging Blocked Evaluation over Structured Sparse Matrices | Pratyush Das, Amirhossein Basareh, Adhitha Dias, Artem Pelenitsyn, Kirshanthan Sundararajah, Milind Kulkarni, Ben Delaware | 2024-04-03 | 下载 | The matrices used in many computational settings are naturally sparse, holding a small percentage of nonzero elements. Storing such matrices in specialized sparse formats enables algorithms that avoid... |
| Scalable quantum detector tomography by high-performance computing | Timon Schapeler, Robert Schade, Michael Lass, Christian Plessl, Tim J. Bartley | 2024-04-03 | 下载 | At large scales, quantum systems may become advantageous over their classical counterparts at performing certain tasks. Developing tools to analyse these systems at the relevant scales, in a manner co... |
| A Survey on Error-Bounded Lossy Compression for Scientific Datasets | Sheng Di, Jinyang Liu, Kai Zhao, Xin Liang, Robert Underwood, Zhaorui Zhang, Milan Shah, Yafan Huang, Jiajun Huang, Xiaodong Yu, Congrong Ren, Hanqi Guo, Grant Wilkins, Dingwen Tao, Jiannan Tian, Sian Jin, Zizhe Jian, Daoce Wang, MD Hasanur Rahman, Boyuan Zhang, Shihui Song, Jon C. Calhoun, Guanpeng Li, Kazutomo Yoshii, Khalid Ayed Alharthi, Franck Cappello | 2024-04-03 | 下载 | Error-bounded lossy compression has been effective in significantly reducing the data storage/transfer burden while preserving the reconstructed data fidelity very well. |
| Optimizing the Deployment of Tiny Transformers on Low-Power MCUs | Victor J. B. Jung, Alessio Burrello, Moritz Scherer, Francesco Conti, Luca Benini | 2024-04-03 | 下载 | Transformer networks are rapidly becoming SotA in many fields, such as NLP and CV. Similarly to CNN, there is a strong push for deploying Transformer models at the extreme edge, ultimately fitting the... |
| History Trees and Their Applications | Giovanni Viglietta | 2024-04-03 | 下载 | In the theoretical study of distributed communication networks, "history trees" are a discrete structure that naturally models the concept that anonymous agents become distinguishable upon receiving d... |
| Vocabulary Attack to Hijack Large Language Model Applications | Patrick Levi, Christoph P. Neumann | 2024-04-03 | 下载 | The fast advancements in Large Language Models (LLMs) are driving an increasing number of applications. Together with the growing number of users, we also see an increasing number of attackers who try... |
| Speed, power and cost implications for GPU acceleration of Computational Fluid Dynamics on HPC systems | Zachary Cooper-Baldock, Brenda Vara Almirall, Kiao Inthavong | 2024-04-03 | 下载 | Computational Fluid Dynamics (CFD) is the simulation of fluid flow undertaken with the use of computational hardware. The underlying equations are computationally challenging to solve and necessitate ... |
| MOPAR: A Model Partitioning Framework for Deep Learning Inference Services on Serverless Platforms | Jiaang Duan, Shiyou Qian, Dingyu Yang, Hanwen Hu, Jian Cao, Guangtao Xue | 2024-04-03 | 下载 | With its elastic power and a pay-as-you-go cost model, the deployment of deep learning inference services (DLISs) on serverless platforms is emerging as a prevalent trend. |
| Optimal Batch Allocation for Wireless Federated Learning | Jaeyoung Song, Sang-Woon Jeon | 2024-04-03 | 下载 | Federated learning aims to construct a global model that fits the dataset distributed across local devices without direct access to private data, leveraging communication between a server and the loca... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Robust Federated Learning for Wireless Networks: A Demonstration with Channel Estimation | Zexin Fang, Bin Han, Hans D. Schotten | 2024-04-03 | 下载 | Federated learning (FL) offers a privacy-preserving collaborative approach for training models in wireless networks, with channel estimation emerging as a promising application. |
| Traffic Divergence Theory: An Analysis Formalism for Dynamic Networks | Matin Macktoobian, Zhan Shu, Qing Zhao | 2024-04-03 | 下载 | Traffic dynamics is universally crucial in analyzing and designing almost any network. This article introduces a novel theoretical approach to analyzing network traffic dynamics. |
| Autonomous Vehicle Networks for More Reliable Truck Tracking in Challenged High Mountain Roads, Tunnels and Bridges Environments | Junhao Chen, Milena Radenkovic | 2024-04-03 | 下载 | The popularity of online shopping has challenged the existing express tracking. How to provide customers with reliable and stable express tracking has become one of the important issues that express c... |
| When Digital Twin Meets Generative AI: Intelligent Closed-Loop Network Management | Xinyu Huang, Haojun Yang, Conghao Zhou, Mingcheng He, Xuemin Shen, Weihua Zhuang | 2024-04-03 | 下载 | Generative artificial intelligence (GAI) and digital twin (DT) are advanced data processing and virtualization technologies to revolutionize communication networks. |
| Exploring Opportunistic Routing for Remote Sea Emergencies | Cleon Liew, Milena Radenkovic | 2024-04-03 | 下载 | This paper explores the Opportunistic Routing Protocols in the context of remote sea emergency scenarios, using the MH370 plane crash as a case study (OppNetMH370). |
| Fully Decentralized Task Offloading in Multi-Access Edge Computing Systems | Shubham Aggarwal, Muhammad Aneeq uz Zaman, Melih Bastopcu, Sennur Ulukus, Tamer Başar | 2024-04-03 | 下载 | We consider the problem of task offloading in multi-access edge computing (MEC) systems constituting devices assisted by an edge server (ES), where the devices can split task execution between a l... |
| A Universal Deep Neural Network for Signal Detection in Wireless Communication Systems | Khalid Albagami, Nguyen Van Huynh, Geoffrey Ye Li | 2024-04-03 | 下载 | Recently, deep learning (DL) has been emerging as a promising approach for channel estimation and signal detection in wireless communications. |
| DRL-Based RAT Selection in a Hybrid Vehicular Communication Network | Badreddine Yacine Yacheur, Toufik Ahmed, Mohamed Mosbah | 2024-04-03 | 下载 | Cooperative intelligent transport systems rely on a set of Vehicle-to-Everything (V2X) applications to enhance road safety. Emerging new V2X applications like Advanced Driver Assistance Systems (ADASs... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Staging Blocked Evaluation over Structured Sparse Matrices | Pratyush Das, Amirhossein Basareh, Adhitha Dias, Artem Pelenitsyn, Kirshanthan Sundararajah, Milind Kulkarni, Ben Delaware | 2024-04-03 | 下载 | The matrices used in many computational settings are naturally sparse, holding a small percentage of nonzero elements. Storing such matrices in specialized sparse formats enables algorithms that avoid... |
| Investigation of Energy-efficient AI Model Architectures and Compression Techniques for "Green" Fetal Brain Segmentation | Szymon Mazurek, Monika Pytlarz, Sylwia Malec, Alessandro Crimi | 2024-04-03 | 下载 | Artificial intelligence have contributed to advancements across various industries. However, the rapid growth of artificial intelligence technologies also raises concerns about their environmental imp... |
| Optimizing the Deployment of Tiny Transformers on Low-Power MCUs | Victor J. B. Jung, Alessio Burrello, Moritz Scherer, Francesco Conti, Luca Benini | 2024-04-03 | 下载 | Transformer networks are rapidly becoming SotA in many fields, such as NLP and CV. Similarly to CNN, there is a strong push for deploying Transformer models at the extreme edge, ultimately fitting the... |
| Speed, power and cost implications for GPU acceleration of Computational Fluid Dynamics on HPC systems | Zachary Cooper-Baldock, Brenda Vara Almirall, Kiao Inthavong | 2024-04-03 | 下载 | Computational Fluid Dynamics (CFD) is the simulation of fluid flow undertaken with the use of computational hardware. The underlying equations are computationally challenging to solve and necessitate ... |