Skip to content

2024-07-16

cs.AR - Architecture

标题作者发布日期PDF摘要
Characterizing and Understanding HGNN Training on GPUsDengke Han, Mingyu Yan, Xiaochun Ye, Dongrui Fan2024-07-16下载Owing to their remarkable representation capabilities for heterogeneous graph data, Heterogeneous Graph Neural Networks (HGNNs) have been widely adopted in many critical real-world domains such as rec...
Latency optimized Deep Neural Networks (DNNs): An Artificial Intelligence approach at the Edge using Multiprocessor System on Chip (MPSoC)Seyed Nima Omidsajedi, Rekha Reddy, Jianming Yi, Jan Herbst, Christoph Lipps, Hans Dieter Schotten2024-07-16下载Almost in every heavily computation-dependent application, from 6G communication systems to autonomous driving platforms, a large portion of computing should be near to the client side.
ApproxPilot: A GNN-based Accelerator Approximation FrameworkQing Zhang, Cheng Liu, Siting Liu, Yajuan Hui, Huawei Li, Xiaowei Li2024-07-16下载A typical optimization of customized accelerators for error-tolerant applications such as multimedia, recognition, and classification is to replace traditional arithmetic units like multipliers and ad...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
The Latency Price of Threshold Cryptosystem in BlockchainsZhuolun Xiang, Sourav Das, Zekun Li, Zhoujun Ma, Alexander Spiegelman2024-07-16下载Threshold cryptography is essential for many blockchain protocols. For example, many protocols rely on threshold common coin to implement asynchronous consensus, leader elections, and provide support ...
Building AI Agents for Autonomous Clouds: Challenges and Design PrinciplesManish Shetty, Yinfang Chen, Gagan Somashekar, Minghua Ma, Yogesh Simmhan, Xuchao Zhang, Jonathan Mace, Dax Vandevoorde, Pedro Las-Casas, Shachee Mishra Gupta, Suman Nath, Chetan Bansal, Saravan Rajmohan2024-07-16下载The rapid growth in the use of Large Language Models (LLMs) and AI Agents as part of software development and deployment is revolutionizing the information technology landscape.
Gaming and Blockchain: Hype and RealityMax McGuinness2024-07-16下载This paper explores the adoption of blockchain technology in the gaming industry. While supporters affirm that distributed ledger technology has potential to revolutionize gaming economies and provide...
MEMO: Fine-grained Tensor Management For Ultra-long Context LLM TrainingPinxue Zhao, Hailin Zhang, Fangcheng Fu, Xiaonan Nie, Qibin Liu, Fang Yang, Yuanbo Peng, Dian Jiao, Shuaipeng Li, Jinbao Xue, Yangyu Tao, Bin Cui2024-07-16下载Nowadays, Large Language Models (LLMs) have been trained using extended context lengths to foster more creative applications. However, long context training poses great challenges considering the cons...
Hydra: Brokering Cloud and HPC Resources to Support the Execution of Heterogeneous Workloads at ScaleAymen Alsaadi, Shantenu Jha, Matteo Turilli2024-07-16下载Scientific discovery increasingly depends on middleware that enables the execution of heterogeneous workflows on heterogeneous platforms One of the main challenges is to design software components tha...
zIA: a GenAI-powered local auntie assists tourists in ItalyAlexio Cassani, Michele Ruberl, Antonio Salis, Giacomo Giannese, Gianluca Boanelli2024-07-16下载The Tourism and Destination Management Organization (DMO) industry is rapidly evolving to adapt to new technologies and traveler expectations.
Scalable and Reliable Over-the-Air Federated Edge LearningMaximilian Egger, Christoph Hofmeister, Cem Kaya, Rawad Bitar, Antonia Wachter-Zeh2024-07-16下载Federated edge learning (FEEL) has emerged as a core paradigm for large-scale optimization. However, FEEL still suffers from a communication bottleneck due to the transmission of high-dimensional mode...
PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined SpeculationBranden Butler, Sixing Yu, Arya Mazaheri, Ali Jannesari2024-07-16下载Inference of Large Language Models (LLMs) across computer clusters has become a focal point of research in recent times, with many acceleration techniques taking inspiration from CPU speculative execu...
Enhancing Split Computing and Early Exit Applications through Predefined SparsityLuigi Capogrosso, Enrico Fraccaroli, Giulio Petrozziello, Francesco Setti, Samarjit Chakraborty, Franco Fummi, Marco Cristani2024-07-16下载In the past decade, Deep Neural Networks (DNNs) achieved state-of-the-art performance in a broad range of problems, spanning from object classification and action recognition to smart building and hea...
Self-Regulating Random Walks for Resilient Decentralized Learning on GraphsMaximilian Egger, Rawad Bitar, Ghadir Ayache, Antonia Wachter-Zeh, Salim El Rouayheb2024-07-16下载Consider the setting of multiple random walks (RWs) on a graph executing a certain computational task. For instance, in decentralized learning via RWs, a model is updated at each iteration based on th...
Revolutionizing MRI Data Processing Using FSL: Preliminary Findings with the Fugaku SupercomputerTianxiang Lyu, Wataru Uchida, Zhe Sun, Christina Andica, Keita Tokuda, Rui Zou, Jie Mao, Keigo Shimoji, Koji Kamagata, Mitsuhisa Sato, Ryutaro Himeno, Shigeki Aoki2024-07-16下载The amount of Magnetic resonance imaging data has grown tremendously recently, creating an urgent need to accelerate data processing, which requires substantial computational resources and time.
Reducing Tail Latencies Through Environment- and Neighbour-aware Thread ManagementAndrew Jeffery, Chris Jensen, Richard Mortier2024-07-16下载Application tail latency is a key metric for many services, with high latencies being linked directly to loss of revenue. Modern deeply-nested micro-service architectures exacerbate tail latencies, in...
Finite State Machines-Based Path-Following Collaborative Computing Strategy for Emergency UAV SwarmsJialin Hu, Zhiyuan Ren, Wenchi Cheng2024-07-16下载Offloading services to UAV swarms for delay-sensitive tasks in Emergency UAV Networks (EUN) can greatly enhance rescue efficiency. Most task-offloading strategies assumed that UAVs were location-fixed...
Bringing Auto-tuning to HIP: Analysis of Tuning Impact and Difficulty on AMD and Nvidia GPUsMilo Lurati, Stijn Heldens, Alessio Sclocco, Ben van Werkhoven2024-07-16下载Many studies have focused on developing and improving auto-tuning algorithms for Nvidia Graphics Processing Units (GPUs), but the effectiveness and efficiency of these approaches on AMD devices have h...
Performance Analysis of Internet of Vehicles Mesh Networks Based on Actual Switch ModelsJialin Hu, Zhiyuan Ren, Wenchi Cheng, Zhiliang Shuai, Zhao Li2024-07-16下载The rapid growth of the automotive industry has exacerbated the conflict between the complex traffic environment, increasing communication demands, and limited resources.
Fast Iterative Graph Computing with Updated Neighbor StatesYijie Zhou, Shufeng Gong, Feng Yao, Hanzhang Chen, Song Yu, Pengxi Liu, Yanfeng Zhang, Ge Yu, Jeffrey Xu Yu2024-07-16下载Enhancing the efficiency of iterative computation on graphs has garnered considerable attention in both industry and academia. Nonetheless, the majority of efforts focus on expediting iterative comput...
Cloud-based Semi-Quantum MoneyYichi Zhang, Siyuan Jin, Yuhan Huang, Bei Zeng, Qiming Shao2024-07-16下载In the 1970s, Wiesner introduced the concept of quantum money, where quantum states generated according to specific rules function as currency.
Octopus: Experiences with a Hybrid Event-Driven Architecture for Distributed Scientific ComputingHaochen Pan, Ryan Chard, Sicheng Zhou, Alok Kamatar, Rafael Vescovi, Valérie Hayot-Sasson, André Bauer, Maxime Gonthier, Kyle Chard, Ian Foster2024-07-16下载Scientific research increasingly relies on distributed computational resources, storage systems, networks, and instruments, ranging from HPC and cloud systems to edge devices.
Paralleling and Accelerating Arc Consistency Enforcement with Recurrent Tensor ComputationsMingqi Yang2024-07-16下载We propose a new arc consistency enforcement paradigm that transforms arc consistency enforcement into recurrent tensor operations. In each iteration of the recurrence, all involved processes can be f...
Detection of Global Anomalies on Distributed IoT Edges with Device-to-Device CommunicationHideya Ochiai, Riku Nishihata, Eisuke Tomiyama, Yuwei Sun, Hiroshi Esaki2024-07-16下载Anomaly detection is an important function in IoT applications for finding outliers caused by abnormal events. Anomaly detection sometimes comes with high-frequency data sampling which should be carri...
Edge-Mapping of Service Function Trees for Sensor Event ProcessingBabar Shahzaad, Alistair Barros, Colin Fidge2024-07-16下载Fog computing offers increased performance and efficiency for Industrial Internet of Things (IIoT) applications through distributed data processing in nearby proximity to sensors.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
A UAV-assisted Wireless Localization Challenge on AERPAWPaul Kudyba, Jaya Sravani Mandapaka, Weijie Wang, Logan McCorkendale, Zachary McCorkendale, Mathias Kidane, Haijian Sun, Eric Adams, Kamesh Namuduri, Fraida Fund, Mihail Sichitiu, Ozgur Ozdemir2024-07-16下载As wireless researchers are tasked to enable wireless communication as infrastructure in more dynamic aerial settings, there is a growing need for large-scale experimental platforms that provide reali...
An Overview and Solution for Democratizing AI Workflows at the Network EdgeAndrej Čop, Blaž Bertalanič, Carolina Fortuna2024-07-16下载With the process of democratization of the network edge, hardware and software for networks are becoming available to the public, overcoming the confines of traditional cloud providers and network ope...
Performance Analysis of Internet of Vehicles Mesh Networks Based on Actual Switch ModelsJialin Hu, Zhiyuan Ren, Wenchi Cheng, Zhiliang Shuai, Zhao Li2024-07-16下载The rapid growth of the automotive industry has exacerbated the conflict between the complex traffic environment, increasing communication demands, and limited resources.
Spatial-spectral Cell-free Networks: A Large-scale Case StudyZesheng Zhu, Lifeng Wang, Xin Wang, Dongming Wang, Kai-Kit Wong2024-07-16下载This paper studies the large-scale cell-free networks where dense distributed access points (APs) serve many users. As a promising next-generation network architecture, cell-free networks enable ultra...
Analytical Performance Estimations for Quantum Repeater Network ScenariosAllen Zang, Joaquin Chung, Rajkumar Kettimuthu, Martin Suchara, Tian Zhong2024-07-16下载Quantum repeater chains will form the backbone of future quantum networks that distribute entanglement between network nodes. Therefore, it is important to understand the entanglement distribution per...
Digital Twin Vehicular Edge Computing Network: Task Offloading and Resource AllocationYu Xie, Qiong Wu, Pingyi Fan2024-07-16下载With the increasing demand for multiple applications on internet of vehicles. It requires vehicles to carry out multiple computing tasks in real time.

cs.PF - Performance

标题作者发布日期PDF摘要
Characterizing and Understanding HGNN Training on GPUsDengke Han, Mingyu Yan, Xiaochun Ye, Dongrui Fan2024-07-16下载Owing to their remarkable representation capabilities for heterogeneous graph data, Heterogeneous Graph Neural Networks (HGNNs) have been widely adopted in many critical real-world domains such as rec...
Estimating the Energy Footprint of Software Systems: a PrimerFernando Castor2024-07-16下载In Green Software Development, quantifying the energy footprint of a software system is one of the most basic activities. This documents provides a high-level overview of how the energy footprint of a...
Reducing Tail Latencies Through Environment- and Neighbour-aware Thread ManagementAndrew Jeffery, Chris Jensen, Richard Mortier2024-07-16下载Application tail latency is a key metric for many services, with high latencies being linked directly to loss of revenue. Modern deeply-nested micro-service architectures exacerbate tail latencies, in...
Bringing Auto-tuning to HIP: Analysis of Tuning Impact and Difficulty on AMD and Nvidia GPUsMilo Lurati, Stijn Heldens, Alessio Sclocco, Ben van Werkhoven2024-07-16下载Many studies have focused on developing and improving auto-tuning algorithms for Nvidia Graphics Processing Units (GPUs), but the effectiveness and efficiency of these approaches on AMD devices have h...
Analytical Performance Estimations for Quantum Repeater Network ScenariosAllen Zang, Joaquin Chung, Rajkumar Kettimuthu, Martin Suchara, Tian Zhong2024-07-16下载Quantum repeater chains will form the backbone of future quantum networks that distribute entanglement between network nodes. Therefore, it is important to understand the entanglement distribution per...

基于 VitePress 构建