2024-07-16

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Characterizing and Understanding HGNN Training on GPUs	Dengke Han, Mingyu Yan, Xiaochun Ye, Dongrui Fan	2024-07-16	下载	Owing to their remarkable representation capabilities for heterogeneous graph data, Heterogeneous Graph Neural Networks (HGNNs) have been widely adopted in many critical real-world domains such as rec...
Latency optimized Deep Neural Networks (DNNs): An Artificial Intelligence approach at the Edge using Multiprocessor System on Chip (MPSoC)	Seyed Nima Omidsajedi, Rekha Reddy, Jianming Yi, Jan Herbst, Christoph Lipps, Hans Dieter Schotten	2024-07-16	下载	Almost in every heavily computation-dependent application, from 6G communication systems to autonomous driving platforms, a large portion of computing should be near to the client side.
ApproxPilot: A GNN-based Accelerator Approximation Framework	Qing Zhang, Cheng Liu, Siting Liu, Yajuan Hui, Huawei Li, Xiaowei Li	2024-07-16	下载	A typical optimization of customized accelerators for error-tolerant applications such as multimedia, recognition, and classification is to replace traditional arithmetic units like multipliers and ad...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
The Latency Price of Threshold Cryptosystem in Blockchains	Zhuolun Xiang, Sourav Das, Zekun Li, Zhoujun Ma, Alexander Spiegelman	2024-07-16	下载	Threshold cryptography is essential for many blockchain protocols. For example, many protocols rely on threshold common coin to implement asynchronous consensus, leader elections, and provide support ...
Building AI Agents for Autonomous Clouds: Challenges and Design Principles	Manish Shetty, Yinfang Chen, Gagan Somashekar, Minghua Ma, Yogesh Simmhan, Xuchao Zhang, Jonathan Mace, Dax Vandevoorde, Pedro Las-Casas, Shachee Mishra Gupta, Suman Nath, Chetan Bansal, Saravan Rajmohan	2024-07-16	下载	The rapid growth in the use of Large Language Models (LLMs) and AI Agents as part of software development and deployment is revolutionizing the information technology landscape.
Gaming and Blockchain: Hype and Reality	Max McGuinness	2024-07-16	下载	This paper explores the adoption of blockchain technology in the gaming industry. While supporters affirm that distributed ledger technology has potential to revolutionize gaming economies and provide...
MEMO: Fine-grained Tensor Management For Ultra-long Context LLM Training	Pinxue Zhao, Hailin Zhang, Fangcheng Fu, Xiaonan Nie, Qibin Liu, Fang Yang, Yuanbo Peng, Dian Jiao, Shuaipeng Li, Jinbao Xue, Yangyu Tao, Bin Cui	2024-07-16	下载	Nowadays, Large Language Models (LLMs) have been trained using extended context lengths to foster more creative applications. However, long context training poses great challenges considering the cons...
Hydra: Brokering Cloud and HPC Resources to Support the Execution of Heterogeneous Workloads at Scale	Aymen Alsaadi, Shantenu Jha, Matteo Turilli	2024-07-16	下载	Scientific discovery increasingly depends on middleware that enables the execution of heterogeneous workflows on heterogeneous platforms One of the main challenges is to design software components tha...
zIA: a GenAI-powered local auntie assists tourists in Italy	Alexio Cassani, Michele Ruberl, Antonio Salis, Giacomo Giannese, Gianluca Boanelli	2024-07-16	下载	The Tourism and Destination Management Organization (DMO) industry is rapidly evolving to adapt to new technologies and traveler expectations.
Scalable and Reliable Over-the-Air Federated Edge Learning	Maximilian Egger, Christoph Hofmeister, Cem Kaya, Rawad Bitar, Antonia Wachter-Zeh	2024-07-16	下载	Federated edge learning (FEEL) has emerged as a core paradigm for large-scale optimization. However, FEEL still suffers from a communication bottleneck due to the transmission of high-dimensional mode...
PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation	Branden Butler, Sixing Yu, Arya Mazaheri, Ali Jannesari	2024-07-16	下载	Inference of Large Language Models (LLMs) across computer clusters has become a focal point of research in recent times, with many acceleration techniques taking inspiration from CPU speculative execu...
Enhancing Split Computing and Early Exit Applications through Predefined Sparsity	Luigi Capogrosso, Enrico Fraccaroli, Giulio Petrozziello, Francesco Setti, Samarjit Chakraborty, Franco Fummi, Marco Cristani	2024-07-16	下载	In the past decade, Deep Neural Networks (DNNs) achieved state-of-the-art performance in a broad range of problems, spanning from object classification and action recognition to smart building and hea...
Self-Regulating Random Walks for Resilient Decentralized Learning on Graphs	Maximilian Egger, Rawad Bitar, Ghadir Ayache, Antonia Wachter-Zeh, Salim El Rouayheb	2024-07-16	下载	Consider the setting of multiple random walks (RWs) on a graph executing a certain computational task. For instance, in decentralized learning via RWs, a model is updated at each iteration based on th...
Revolutionizing MRI Data Processing Using FSL: Preliminary Findings with the Fugaku Supercomputer	Tianxiang Lyu, Wataru Uchida, Zhe Sun, Christina Andica, Keita Tokuda, Rui Zou, Jie Mao, Keigo Shimoji, Koji Kamagata, Mitsuhisa Sato, Ryutaro Himeno, Shigeki Aoki	2024-07-16	下载	The amount of Magnetic resonance imaging data has grown tremendously recently, creating an urgent need to accelerate data processing, which requires substantial computational resources and time.
Reducing Tail Latencies Through Environment- and Neighbour-aware Thread Management	Andrew Jeffery, Chris Jensen, Richard Mortier	2024-07-16	下载	Application tail latency is a key metric for many services, with high latencies being linked directly to loss of revenue. Modern deeply-nested micro-service architectures exacerbate tail latencies, in...
Finite State Machines-Based Path-Following Collaborative Computing Strategy for Emergency UAV Swarms	Jialin Hu, Zhiyuan Ren, Wenchi Cheng	2024-07-16	下载	Offloading services to UAV swarms for delay-sensitive tasks in Emergency UAV Networks (EUN) can greatly enhance rescue efficiency. Most task-offloading strategies assumed that UAVs were location-fixed...
Bringing Auto-tuning to HIP: Analysis of Tuning Impact and Difficulty on AMD and Nvidia GPUs	Milo Lurati, Stijn Heldens, Alessio Sclocco, Ben van Werkhoven	2024-07-16	下载	Many studies have focused on developing and improving auto-tuning algorithms for Nvidia Graphics Processing Units (GPUs), but the effectiveness and efficiency of these approaches on AMD devices have h...
Performance Analysis of Internet of Vehicles Mesh Networks Based on Actual Switch Models	Jialin Hu, Zhiyuan Ren, Wenchi Cheng, Zhiliang Shuai, Zhao Li	2024-07-16	下载	The rapid growth of the automotive industry has exacerbated the conflict between the complex traffic environment, increasing communication demands, and limited resources.
Fast Iterative Graph Computing with Updated Neighbor States	Yijie Zhou, Shufeng Gong, Feng Yao, Hanzhang Chen, Song Yu, Pengxi Liu, Yanfeng Zhang, Ge Yu, Jeffrey Xu Yu	2024-07-16	下载	Enhancing the efficiency of iterative computation on graphs has garnered considerable attention in both industry and academia. Nonetheless, the majority of efforts focus on expediting iterative comput...
Cloud-based Semi-Quantum Money	Yichi Zhang, Siyuan Jin, Yuhan Huang, Bei Zeng, Qiming Shao	2024-07-16	下载	In the 1970s, Wiesner introduced the concept of quantum money, where quantum states generated according to specific rules function as currency.
Octopus: Experiences with a Hybrid Event-Driven Architecture for Distributed Scientific Computing	Haochen Pan, Ryan Chard, Sicheng Zhou, Alok Kamatar, Rafael Vescovi, Valérie Hayot-Sasson, André Bauer, Maxime Gonthier, Kyle Chard, Ian Foster	2024-07-16	下载	Scientific research increasingly relies on distributed computational resources, storage systems, networks, and instruments, ranging from HPC and cloud systems to edge devices.
Paralleling and Accelerating Arc Consistency Enforcement with Recurrent Tensor Computations	Mingqi Yang	2024-07-16	下载	We propose a new arc consistency enforcement paradigm that transforms arc consistency enforcement into recurrent tensor operations. In each iteration of the recurrence, all involved processes can be f...
Detection of Global Anomalies on Distributed IoT Edges with Device-to-Device Communication	Hideya Ochiai, Riku Nishihata, Eisuke Tomiyama, Yuwei Sun, Hiroshi Esaki	2024-07-16	下载	Anomaly detection is an important function in IoT applications for finding outliers caused by abnormal events. Anomaly detection sometimes comes with high-frequency data sampling which should be carri...
Edge-Mapping of Service Function Trees for Sensor Event Processing	Babar Shahzaad, Alistair Barros, Colin Fidge	2024-07-16	下载	Fog computing offers increased performance and efficiency for Industrial Internet of Things (IIoT) applications through distributed data processing in nearby proximity to sensors.

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
A UAV-assisted Wireless Localization Challenge on AERPAW	Paul Kudyba, Jaya Sravani Mandapaka, Weijie Wang, Logan McCorkendale, Zachary McCorkendale, Mathias Kidane, Haijian Sun, Eric Adams, Kamesh Namuduri, Fraida Fund, Mihail Sichitiu, Ozgur Ozdemir	2024-07-16	下载	As wireless researchers are tasked to enable wireless communication as infrastructure in more dynamic aerial settings, there is a growing need for large-scale experimental platforms that provide reali...
An Overview and Solution for Democratizing AI Workflows at the Network Edge	Andrej Čop, Blaž Bertalanič, Carolina Fortuna	2024-07-16	下载	With the process of democratization of the network edge, hardware and software for networks are becoming available to the public, overcoming the confines of traditional cloud providers and network ope...
Performance Analysis of Internet of Vehicles Mesh Networks Based on Actual Switch Models	Jialin Hu, Zhiyuan Ren, Wenchi Cheng, Zhiliang Shuai, Zhao Li	2024-07-16	下载	The rapid growth of the automotive industry has exacerbated the conflict between the complex traffic environment, increasing communication demands, and limited resources.
Spatial-spectral Cell-free Networks: A Large-scale Case Study	Zesheng Zhu, Lifeng Wang, Xin Wang, Dongming Wang, Kai-Kit Wong	2024-07-16	下载	This paper studies the large-scale cell-free networks where dense distributed access points (APs) serve many users. As a promising next-generation network architecture, cell-free networks enable ultra...
Analytical Performance Estimations for Quantum Repeater Network Scenarios	Allen Zang, Joaquin Chung, Rajkumar Kettimuthu, Martin Suchara, Tian Zhong	2024-07-16	下载	Quantum repeater chains will form the backbone of future quantum networks that distribute entanglement between network nodes. Therefore, it is important to understand the entanglement distribution per...
Digital Twin Vehicular Edge Computing Network: Task Offloading and Resource Allocation	Yu Xie, Qiong Wu, Pingyi Fan	2024-07-16	下载	With the increasing demand for multiple applications on internet of vehicles. It requires vehicles to carry out multiple computing tasks in real time.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Characterizing and Understanding HGNN Training on GPUs	Dengke Han, Mingyu Yan, Xiaochun Ye, Dongrui Fan	2024-07-16	下载	Owing to their remarkable representation capabilities for heterogeneous graph data, Heterogeneous Graph Neural Networks (HGNNs) have been widely adopted in many critical real-world domains such as rec...
Estimating the Energy Footprint of Software Systems: a Primer	Fernando Castor	2024-07-16	下载	In Green Software Development, quantifying the energy footprint of a software system is one of the most basic activities. This documents provides a high-level overview of how the energy footprint of a...
Reducing Tail Latencies Through Environment- and Neighbour-aware Thread Management	Andrew Jeffery, Chris Jensen, Richard Mortier	2024-07-16	下载	Application tail latency is a key metric for many services, with high latencies being linked directly to loss of revenue. Modern deeply-nested micro-service architectures exacerbate tail latencies, in...
Bringing Auto-tuning to HIP: Analysis of Tuning Impact and Difficulty on AMD and Nvidia GPUs	Milo Lurati, Stijn Heldens, Alessio Sclocco, Ben van Werkhoven	2024-07-16	下载	Many studies have focused on developing and improving auto-tuning algorithms for Nvidia Graphics Processing Units (GPUs), but the effectiveness and efficiency of these approaches on AMD devices have h...
Analytical Performance Estimations for Quantum Repeater Network Scenarios	Allen Zang, Joaquin Chung, Rajkumar Kettimuthu, Martin Suchara, Tian Zhong	2024-07-16	下载	Quantum repeater chains will form the backbone of future quantum networks that distribute entanglement between network nodes. Therefore, it is important to understand the entanglement distribution per...

2024-07-16 ​

cs.AR - Architecture ​

cs.DC - Distributed, Parallel, and Cluster Computing ​

cs.NI - Networking and Internet Architecture ​

cs.PF - Performance ​

2024-07-16

cs.AR - Architecture

cs.DC - Distributed, Parallel, and Cluster Computing

cs.NI - Networking and Internet Architecture

cs.PF - Performance