Skip to content

2024-03-12

cs.AR - Architecture

标题作者发布日期PDF摘要
Improving Memory Dependence Prediction with Static AnalysisLuke Panayi, Rohan Gandhi, Jim Whittaker, Vassilios Chouliaras, Martin Berger, Paul Kelly2024-03-12下载This paper explores the potential of communicating information gained by static analysis from compilers to Out-of-Order (OoO) machines, focusing on the memory dependence predictor (MDP).
Low-Energy On-Device Personalization for MCUsYushan Huang, Ranya Aloufi, Xavier Cadet, Yuchen Zhao, Payam Barnaghi, Hamed Haddadi2024-03-12下载Microcontroller Units (MCUs) are ideal platforms for edge applications due to their low cost and energy consumption, and are widely used in various applications, including personalized machine learnin...
Performance Analysis of Matrix Multiplication for Deep Learning on the EdgeCristian Ramírez, Adrián Castelló, Héctor Martínez, Enrique S. Quintana-Ortí2024-03-12下载The devices designed for the Internet-of-Things encompass a large variety of distinct processor architectures, forming a highly heterogeneous zoo.
Enabling Unstructured Sparse Acceleration on Structured Sparse AcceleratorsGeonhwa Jeong, Po-An Tsai, Abhimanyu R. Bambhaniya, Stephen W. Keckler, Tushar Krishna2024-03-12下载Exploiting sparsity in deep neural networks (DNNs) has been a promising area for meeting the growing computation requirements. To minimize the overhead of sparse acceleration, hardware designers have ...
The Dawn of AI-Native EDA: Opportunities and Challenges of Large Circuit ModelsLei Chen, Yiqi Chen, Zhufei Chu, Wenji Fang, Tsung-Yi Ho, Ru Huang, Yu Huang, Sadaf Khan, Min Li, Xingquan Li, Yu Li, Yun Liang, Jinwei Liu, Yi Liu, Yibo Lin, Guojie Luo, Zhengyuan Shi, Guangyu Sun, Dimitrios Tsaras, Runsheng Wang, Ziyi Wang, Xinming Wei, Zhiyao Xie, Qiang Xu, Chenhao Xue, Junchi Yan, Jun Yang, Bei Yu, Mingxuan Yuan, Evangeline F. Y. Young, Xuan Zeng, Haoyi Zhang, Zuodong Zhang, Yuxiang Zhao, Hui-Ling Zhen, Ziyang Zheng, Binwu Zhu, Keren Zhu, Sunan Zou2024-03-12下载Within the Electronic Design Automation (EDA) domain, AI-driven solutions have emerged as formidable tools, yet they typically augment rather than redefine existing methodologies.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Cost-Effective Methodology for Complex Tuning Searches in HPC: Navigating Interdependencies and DimensionalityAdrian Perez Dieguez, Min Choi, Mahmut Okyay, Mauro Del Ben, Bryan M. Wong, Khaled Z. Ibrahim2024-03-12下载Tuning searches are pivotal in High-Performance Computing (HPC), addressing complex optimization challenges in computational applications. The complexity arises not only from finely tuning parameters ...
Efficient Language Model Architectures for Differentially Private Federated LearningJae Hun Ro, Srinadh Bhojanapalli, Zheng Xu, Yanxiang Zhang, Ananda Theertha Suresh2024-03-12下载Cross-device federated learning (FL) is a technique that trains a model on data distributed across typically millions of edge devices without data leaving the devices.
SCALHEALTH: Scalable Blockchain Integration for Secure IoT Healthcare SystemsMehrzad Mohammadi, Reza Javan, Mohammad Beheshti-Atashgah, Mohammad Reza Aref2024-03-12下载Internet of Things (IoT) devices are capable of allowing for far-reaching access to and evaluation of patient data to monitor health and diagnose from a distance.
Efficient Fault Tolerance for Pipelined Query Engines via Write-ahead LineageZiheng Wang, Alex Aiken2024-03-12下载Modern distributed pipelined query engines either do not support intra-query fault tolerance or employ high-overhead approaches such as persisting intermediate outputs or checkpointing state.
Accelerating Biclique Counting on GPULinshan Qiu, Zhonggen Li, Xiangyu Ke, Lu Chen, Yunjun Gao2024-03-12下载Counting (p,q)-bicliques in bipartite graphs poses a foundational challenge with broad applications, from densest subgraph discovery in algorithmic research to personalized content recommendation in p...
MPCPA: Multi-Center Privacy Computing with Predictions Aggregation based on Denoising Diffusion Probabilistic ModelGuibo Luo, Hanwen Zhang, Xiuling Wang, Mingzhi Chen, Yuesheng Zhu2024-03-12下载Privacy-preserving computing is crucial for multi-center machine learning in many applications such as healthcare and finance. In this paper a Multi-center Privacy Computing framework with Predictions...
Characterization of Large Language Model Development in the DatacenterQinghao Hu, Zhisheng Ye, Zerui Wang, Guoteng Wang, Meng Zhang, Qiaoling Chen, Peng Sun, Dahua Lin, Xiaolin Wang, Yingwei Luo, Yonggang Wen, Tianwei Zhang2024-03-12下载Large Language Models (LLMs) have presented impressive performance across several transformative tasks. However, it is non-trivial to efficiently utilize large-scale cluster resources to develop LLMs,...
Communication Optimization for Distributed Training: Architecture, Advances, and OpportunitiesYunze Wei, Tianshuo Hu, Cong Liang, Yong Cui2024-03-12下载The past few years have witnessed the flourishing of large-scale deep neural network models with ever-growing parameter numbers. Training such large-scale models typically requires massive memory and ...
Towards a Dynamic Future with Adaptable Computing and Network Convergence (ACNC)Masoud Shokrnezhad, Hao Yu, Tarik Taleb, Richard Li, Kyunghan Lee, Jaeseung Song, Cedric Westphal2024-03-12下载In the context of advancing 6G, a substantial paradigm shift is anticipated, highlighting comprehensive everything-to-everything interactions characterized by numerous connections and stringent adhere...
Measuring Data Similarity for Efficient Federated Learning: A Feasibility StudyFernanda Famá, Charalampos Kalalas, Sandra Lagen, Paolo Dini2024-03-12下载In multiple federated learning schemes, a random subset of clients sends in each round their model updates to the server for aggregation. Although this client selection strategy aims to reduce communi...
GPU-Accelerated Vecchia Approximations of Gaussian Processes for Geospatial Data using Batched Matrix ComputationsQilong Pan, Sameh Abdulah, Marc G. Genton, David E. Keyes, Hatem Ltaief, Ying Sun2024-03-12下载Gaussian processes (GPs) are commonly used for geospatial analysis, but they suffer from high computational complexity when dealing with massive data.
Polylog-Competitive Deterministic Local Routing and SchedulingBernhard Haeupler, Shyamal Patel, Antti Roeyskoe, Cliff Stein, Goran Zuzic2024-03-12下载This paper addresses point-to-point packet routing in undirected networks, which is the most important communication primitive in most networks.
Atomicity and Abstraction for Cross-Blockchain InteractionsHuaixi Lu, Akshay Jajoo, Kedar S. Namjoshi2024-03-12下载A blockchain facilitates secure and atomic transactions between mutually untrusting parties on that chain. Today, there are multiple blockchains with differing interfaces and security properties.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Zero-Rating, One Big Mess: Analyzing Differential Pricing Practices of European MNOsGabriel Karl Gegenhuber, Wilfried Mayer, Edgar Weippl2024-03-12下载Zero-rating, the practice of not billing data traffic that belongs to certain applications, has become popular within the mobile ecosystem around the globe.
Online Digital Twin-Empowered Content Resale Mechanism in Age of Information-Aware Edge Caching NetworksYuhan Yi, Guanglin Zhang, Hai Jiang2024-03-12下载For users requesting popular contents from content providers, edge caching can alleviate backhaul pressure and enhance the quality of experience of users.
From Files to Streams: Revisiting Web History and Exploring Potentials for Future ProspectsLucas Vogel, Thomas Springer, Matthias Wählisch2024-03-12下载Over the last 30 years, the World Wide Web has changed significantly. In this paper, we argue that common practices to prepare web pages for delivery conflict with many efforts to present content with...
Emerging Technologies for 6G Non-Terrestrial-Networks: From Academia to Industrial ApplicationsCong T. Nguyen, Yuris Mulya Saputra, Nguyen Van Huynh, Tan N. Nguyen, Dinh Thai Hoang, Diep N Nguyen, Van-Quan Pham, Miroslav Voznak, Symeon Chatzinotas, Dinh-Hieu Tran2024-03-12下载Terrestrial networks form the fundamental infrastructure of modern communication systems, serving more than 4 billion users globally. However, terrestrial networks are facing a wide range of challenge...
Adapting LoRaWAN to the Open-RAN ArchitectureSobhi Alfayoumi, Joan Melia-Segui, Xavier Vilajosana2024-03-12下载This article proposes O-LoRaWAN, an adaptation of the LoRaWAN architecture into a modular network architecture based on the Open RAN (O-RAN) principles.
Towards a Dynamic Future with Adaptable Computing and Network Convergence (ACNC)Masoud Shokrnezhad, Hao Yu, Tarik Taleb, Richard Li, Kyunghan Lee, Jaeseung Song, Cedric Westphal2024-03-12下载In the context of advancing 6G, a substantial paradigm shift is anticipated, highlighting comprehensive everything-to-everything interactions characterized by numerous connections and stringent adhere...
A Survey on Federated Learning in Intelligent Transportation SystemsRongqing Zhang, Hanqiu Wang, Bing Li, Xiang Cheng, Liuqing Yang2024-03-12下载The development of Intelligent Transportation System (ITS) has brought about comprehensive urban traffic information that not only provides convenience to urban residents in their daily lives but also...
Discrete-Time Modeling and Handover Analysis of Intelligent Reflecting Surface-Assisted NetworksHaoyan Wei, Hongtao Zhang2024-03-12下载Owning to the reflection gain and double path loss featured by intelligent reflecting surface (IRS) channels, handover (HO) locations become irregular and the signal strength fluctuates sharply with v...
Imitation Learning for Adaptive Video Streaming with Future Adversarial Information Bottleneck PrincipleShuoyao Wang, Jiawei Lin, Fangwei Ye2024-03-12下载Adaptive video streaming plays a crucial role in ensuring high-quality video streaming services. Despite extensive research efforts devoted to Adaptive BitRate (ABR) techniques, the current reinforcem...

基于 VitePress 构建