Skip to content

2024-04-03

cs.AR - Architecture

标题作者发布日期PDF摘要
QED: Scalable Verification of Hardware Memory ConsistencyGokulan Ravi, Xiaokang Qiu, Mithuna Thottethodi, T. N. Vijaykumar2024-04-03下载Memory consistency model (MCM) issues in out-of-order-issue microprocessor-based shared-memory systems are notoriously non-intuitive and a source of hardware design bugs.
Spin-NeuroMem: A Low-Power Neuromorphic Associative Memory Design Based on Spintronic DevicesSiqing Fu, Lizhou Wu, Tiejun Li, Chunyuan Zhang, Jianmin Zhang, Sheng Ma2024-04-03下载Biologically-inspired computing models have made significant progress in recent years, but the conventional von Neumann architecture is inefficient for the large-scale matrix operations and massive pa...
Block-SSD: A New Block-Based Blocking SSD ArchitectureRyan Wong, Arjun Tyagi, Sungjun Cho, Pratik Sampat, Yiqiu Sun2024-04-03下载Computer science and related fields (e.g., computer engineering, computer hardware engineering, electrical engineering, electrical and computer engineering, computer systems engineering) often draw in...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Reducing the Impact of I/O Contention in Numerical Weather Prediction Workflows at Scale Using DAOSNicolau Manubens, Simon D. Smart, Emanuele Danovaro, Tiago Quintino, Adrian Jackson2024-04-03下载Operational Numerical Weather Prediction (NWP) workflows are highly data-intensive. Data volumes have increased by many orders of magnitude over the last 40 years, and are expected to continue to do s...
vPALs: Towards Verified Performance-aware Learning System For Resource ManagementGuoliang He, Gingfung Yeung, Sheriffo Ceesay, Adam Barker2024-04-03下载Accurately predicting task performance at runtime in a cluster is advantageous for a resource management system to determine whether a task should be migrated due to performance degradation caused by ...
GeoT: Tensor Centric Library for Graph Neural Network via Efficient Segment Reduction on GPUZhongming Yu, Genghan Zhang, Hanxian Huang, Xin Chen, Jishen Zhao2024-04-03下载In recent years, Graph Neural Networks (GNNs) have ignited a surge of innovation, significantly enhancing the processing of geometric data structures such as graphs, point clouds, and meshes.
Staging Blocked Evaluation over Structured Sparse MatricesPratyush Das, Amirhossein Basareh, Adhitha Dias, Artem Pelenitsyn, Kirshanthan Sundararajah, Milind Kulkarni, Ben Delaware2024-04-03下载The matrices used in many computational settings are naturally sparse, holding a small percentage of nonzero elements. Storing such matrices in specialized sparse formats enables algorithms that avoid...
Scalable quantum detector tomography by high-performance computingTimon Schapeler, Robert Schade, Michael Lass, Christian Plessl, Tim J. Bartley2024-04-03下载At large scales, quantum systems may become advantageous over their classical counterparts at performing certain tasks. Developing tools to analyse these systems at the relevant scales, in a manner co...
A Survey on Error-Bounded Lossy Compression for Scientific DatasetsSheng Di, Jinyang Liu, Kai Zhao, Xin Liang, Robert Underwood, Zhaorui Zhang, Milan Shah, Yafan Huang, Jiajun Huang, Xiaodong Yu, Congrong Ren, Hanqi Guo, Grant Wilkins, Dingwen Tao, Jiannan Tian, Sian Jin, Zizhe Jian, Daoce Wang, MD Hasanur Rahman, Boyuan Zhang, Shihui Song, Jon C. Calhoun, Guanpeng Li, Kazutomo Yoshii, Khalid Ayed Alharthi, Franck Cappello2024-04-03下载Error-bounded lossy compression has been effective in significantly reducing the data storage/transfer burden while preserving the reconstructed data fidelity very well.
Optimizing the Deployment of Tiny Transformers on Low-Power MCUsVictor J. B. Jung, Alessio Burrello, Moritz Scherer, Francesco Conti, Luca Benini2024-04-03下载Transformer networks are rapidly becoming SotA in many fields, such as NLP and CV. Similarly to CNN, there is a strong push for deploying Transformer models at the extreme edge, ultimately fitting the...
History Trees and Their ApplicationsGiovanni Viglietta2024-04-03下载In the theoretical study of distributed communication networks, "history trees" are a discrete structure that naturally models the concept that anonymous agents become distinguishable upon receiving d...
Vocabulary Attack to Hijack Large Language Model ApplicationsPatrick Levi, Christoph P. Neumann2024-04-03下载The fast advancements in Large Language Models (LLMs) are driving an increasing number of applications. Together with the growing number of users, we also see an increasing number of attackers who try...
Speed, power and cost implications for GPU acceleration of Computational Fluid Dynamics on HPC systemsZachary Cooper-Baldock, Brenda Vara Almirall, Kiao Inthavong2024-04-03下载Computational Fluid Dynamics (CFD) is the simulation of fluid flow undertaken with the use of computational hardware. The underlying equations are computationally challenging to solve and necessitate ...
MOPAR: A Model Partitioning Framework for Deep Learning Inference Services on Serverless PlatformsJiaang Duan, Shiyou Qian, Dingyu Yang, Hanwen Hu, Jian Cao, Guangtao Xue2024-04-03下载With its elastic power and a pay-as-you-go cost model, the deployment of deep learning inference services (DLISs) on serverless platforms is emerging as a prevalent trend.
Optimal Batch Allocation for Wireless Federated LearningJaeyoung Song, Sang-Woon Jeon2024-04-03下载Federated learning aims to construct a global model that fits the dataset distributed across local devices without direct access to private data, leveraging communication between a server and the loca...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Robust Federated Learning for Wireless Networks: A Demonstration with Channel EstimationZexin Fang, Bin Han, Hans D. Schotten2024-04-03下载Federated learning (FL) offers a privacy-preserving collaborative approach for training models in wireless networks, with channel estimation emerging as a promising application.
Traffic Divergence Theory: An Analysis Formalism for Dynamic NetworksMatin Macktoobian, Zhan Shu, Qing Zhao2024-04-03下载Traffic dynamics is universally crucial in analyzing and designing almost any network. This article introduces a novel theoretical approach to analyzing network traffic dynamics.
Autonomous Vehicle Networks for More Reliable Truck Tracking in Challenged High Mountain Roads, Tunnels and Bridges EnvironmentsJunhao Chen, Milena Radenkovic2024-04-03下载The popularity of online shopping has challenged the existing express tracking. How to provide customers with reliable and stable express tracking has become one of the important issues that express c...
When Digital Twin Meets Generative AI: Intelligent Closed-Loop Network ManagementXinyu Huang, Haojun Yang, Conghao Zhou, Mingcheng He, Xuemin Shen, Weihua Zhuang2024-04-03下载Generative artificial intelligence (GAI) and digital twin (DT) are advanced data processing and virtualization technologies to revolutionize communication networks.
Exploring Opportunistic Routing for Remote Sea EmergenciesCleon Liew, Milena Radenkovic2024-04-03下载This paper explores the Opportunistic Routing Protocols in the context of remote sea emergency scenarios, using the MH370 plane crash as a case study (OppNetMH370).
Fully Decentralized Task Offloading in Multi-Access Edge Computing SystemsShubham Aggarwal, Muhammad Aneeq uz Zaman, Melih Bastopcu, Sennur Ulukus, Tamer Başar2024-04-03下载We consider the problem of task offloading in multi-access edge computing (MEC) systems constituting NN devices assisted by an edge server (ES), where the devices can split task execution between a l...
A Universal Deep Neural Network for Signal Detection in Wireless Communication SystemsKhalid Albagami, Nguyen Van Huynh, Geoffrey Ye Li2024-04-03下载Recently, deep learning (DL) has been emerging as a promising approach for channel estimation and signal detection in wireless communications.
DRL-Based RAT Selection in a Hybrid Vehicular Communication NetworkBadreddine Yacine Yacheur, Toufik Ahmed, Mohamed Mosbah2024-04-03下载Cooperative intelligent transport systems rely on a set of Vehicle-to-Everything (V2X) applications to enhance road safety. Emerging new V2X applications like Advanced Driver Assistance Systems (ADASs...

cs.PF - Performance

标题作者发布日期PDF摘要
Staging Blocked Evaluation over Structured Sparse MatricesPratyush Das, Amirhossein Basareh, Adhitha Dias, Artem Pelenitsyn, Kirshanthan Sundararajah, Milind Kulkarni, Ben Delaware2024-04-03下载The matrices used in many computational settings are naturally sparse, holding a small percentage of nonzero elements. Storing such matrices in specialized sparse formats enables algorithms that avoid...
Investigation of Energy-efficient AI Model Architectures and Compression Techniques for "Green" Fetal Brain SegmentationSzymon Mazurek, Monika Pytlarz, Sylwia Malec, Alessandro Crimi2024-04-03下载Artificial intelligence have contributed to advancements across various industries. However, the rapid growth of artificial intelligence technologies also raises concerns about their environmental imp...
Optimizing the Deployment of Tiny Transformers on Low-Power MCUsVictor J. B. Jung, Alessio Burrello, Moritz Scherer, Francesco Conti, Luca Benini2024-04-03下载Transformer networks are rapidly becoming SotA in many fields, such as NLP and CV. Similarly to CNN, there is a strong push for deploying Transformer models at the extreme edge, ultimately fitting the...
Speed, power and cost implications for GPU acceleration of Computational Fluid Dynamics on HPC systemsZachary Cooper-Baldock, Brenda Vara Almirall, Kiao Inthavong2024-04-03下载Computational Fluid Dynamics (CFD) is the simulation of fluid flow undertaken with the use of computational hardware. The underlying equations are computationally challenging to solve and necessitate ...

基于 VitePress 构建