Appearance
2024-07-16
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Characterizing and Understanding HGNN Training on GPUs | Dengke Han, Mingyu Yan, Xiaochun Ye, Dongrui Fan | 2024-07-16 | 下载 | Owing to their remarkable representation capabilities for heterogeneous graph data, Heterogeneous Graph Neural Networks (HGNNs) have been widely adopted in many critical real-world domains such as rec... |
| Latency optimized Deep Neural Networks (DNNs): An Artificial Intelligence approach at the Edge using Multiprocessor System on Chip (MPSoC) | Seyed Nima Omidsajedi, Rekha Reddy, Jianming Yi, Jan Herbst, Christoph Lipps, Hans Dieter Schotten | 2024-07-16 | 下载 | Almost in every heavily computation-dependent application, from 6G communication systems to autonomous driving platforms, a large portion of computing should be near to the client side. |
| ApproxPilot: A GNN-based Accelerator Approximation Framework | Qing Zhang, Cheng Liu, Siting Liu, Yajuan Hui, Huawei Li, Xiaowei Li | 2024-07-16 | 下载 | A typical optimization of customized accelerators for error-tolerant applications such as multimedia, recognition, and classification is to replace traditional arithmetic units like multipliers and ad... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| The Latency Price of Threshold Cryptosystem in Blockchains | Zhuolun Xiang, Sourav Das, Zekun Li, Zhoujun Ma, Alexander Spiegelman | 2024-07-16 | 下载 | Threshold cryptography is essential for many blockchain protocols. For example, many protocols rely on threshold common coin to implement asynchronous consensus, leader elections, and provide support ... |
| Building AI Agents for Autonomous Clouds: Challenges and Design Principles | Manish Shetty, Yinfang Chen, Gagan Somashekar, Minghua Ma, Yogesh Simmhan, Xuchao Zhang, Jonathan Mace, Dax Vandevoorde, Pedro Las-Casas, Shachee Mishra Gupta, Suman Nath, Chetan Bansal, Saravan Rajmohan | 2024-07-16 | 下载 | The rapid growth in the use of Large Language Models (LLMs) and AI Agents as part of software development and deployment is revolutionizing the information technology landscape. |
| Gaming and Blockchain: Hype and Reality | Max McGuinness | 2024-07-16 | 下载 | This paper explores the adoption of blockchain technology in the gaming industry. While supporters affirm that distributed ledger technology has potential to revolutionize gaming economies and provide... |
| MEMO: Fine-grained Tensor Management For Ultra-long Context LLM Training | Pinxue Zhao, Hailin Zhang, Fangcheng Fu, Xiaonan Nie, Qibin Liu, Fang Yang, Yuanbo Peng, Dian Jiao, Shuaipeng Li, Jinbao Xue, Yangyu Tao, Bin Cui | 2024-07-16 | 下载 | Nowadays, Large Language Models (LLMs) have been trained using extended context lengths to foster more creative applications. However, long context training poses great challenges considering the cons... |
| Hydra: Brokering Cloud and HPC Resources to Support the Execution of Heterogeneous Workloads at Scale | Aymen Alsaadi, Shantenu Jha, Matteo Turilli | 2024-07-16 | 下载 | Scientific discovery increasingly depends on middleware that enables the execution of heterogeneous workflows on heterogeneous platforms One of the main challenges is to design software components tha... |
| zIA: a GenAI-powered local auntie assists tourists in Italy | Alexio Cassani, Michele Ruberl, Antonio Salis, Giacomo Giannese, Gianluca Boanelli | 2024-07-16 | 下载 | The Tourism and Destination Management Organization (DMO) industry is rapidly evolving to adapt to new technologies and traveler expectations. |
| Scalable and Reliable Over-the-Air Federated Edge Learning | Maximilian Egger, Christoph Hofmeister, Cem Kaya, Rawad Bitar, Antonia Wachter-Zeh | 2024-07-16 | 下载 | Federated edge learning (FEEL) has emerged as a core paradigm for large-scale optimization. However, FEEL still suffers from a communication bottleneck due to the transmission of high-dimensional mode... |
| PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation | Branden Butler, Sixing Yu, Arya Mazaheri, Ali Jannesari | 2024-07-16 | 下载 | Inference of Large Language Models (LLMs) across computer clusters has become a focal point of research in recent times, with many acceleration techniques taking inspiration from CPU speculative execu... |
| Enhancing Split Computing and Early Exit Applications through Predefined Sparsity | Luigi Capogrosso, Enrico Fraccaroli, Giulio Petrozziello, Francesco Setti, Samarjit Chakraborty, Franco Fummi, Marco Cristani | 2024-07-16 | 下载 | In the past decade, Deep Neural Networks (DNNs) achieved state-of-the-art performance in a broad range of problems, spanning from object classification and action recognition to smart building and hea... |
| Self-Regulating Random Walks for Resilient Decentralized Learning on Graphs | Maximilian Egger, Rawad Bitar, Ghadir Ayache, Antonia Wachter-Zeh, Salim El Rouayheb | 2024-07-16 | 下载 | Consider the setting of multiple random walks (RWs) on a graph executing a certain computational task. For instance, in decentralized learning via RWs, a model is updated at each iteration based on th... |
| Revolutionizing MRI Data Processing Using FSL: Preliminary Findings with the Fugaku Supercomputer | Tianxiang Lyu, Wataru Uchida, Zhe Sun, Christina Andica, Keita Tokuda, Rui Zou, Jie Mao, Keigo Shimoji, Koji Kamagata, Mitsuhisa Sato, Ryutaro Himeno, Shigeki Aoki | 2024-07-16 | 下载 | The amount of Magnetic resonance imaging data has grown tremendously recently, creating an urgent need to accelerate data processing, which requires substantial computational resources and time. |
| Reducing Tail Latencies Through Environment- and Neighbour-aware Thread Management | Andrew Jeffery, Chris Jensen, Richard Mortier | 2024-07-16 | 下载 | Application tail latency is a key metric for many services, with high latencies being linked directly to loss of revenue. Modern deeply-nested micro-service architectures exacerbate tail latencies, in... |
| Finite State Machines-Based Path-Following Collaborative Computing Strategy for Emergency UAV Swarms | Jialin Hu, Zhiyuan Ren, Wenchi Cheng | 2024-07-16 | 下载 | Offloading services to UAV swarms for delay-sensitive tasks in Emergency UAV Networks (EUN) can greatly enhance rescue efficiency. Most task-offloading strategies assumed that UAVs were location-fixed... |
| Bringing Auto-tuning to HIP: Analysis of Tuning Impact and Difficulty on AMD and Nvidia GPUs | Milo Lurati, Stijn Heldens, Alessio Sclocco, Ben van Werkhoven | 2024-07-16 | 下载 | Many studies have focused on developing and improving auto-tuning algorithms for Nvidia Graphics Processing Units (GPUs), but the effectiveness and efficiency of these approaches on AMD devices have h... |
| Performance Analysis of Internet of Vehicles Mesh Networks Based on Actual Switch Models | Jialin Hu, Zhiyuan Ren, Wenchi Cheng, Zhiliang Shuai, Zhao Li | 2024-07-16 | 下载 | The rapid growth of the automotive industry has exacerbated the conflict between the complex traffic environment, increasing communication demands, and limited resources. |
| Fast Iterative Graph Computing with Updated Neighbor States | Yijie Zhou, Shufeng Gong, Feng Yao, Hanzhang Chen, Song Yu, Pengxi Liu, Yanfeng Zhang, Ge Yu, Jeffrey Xu Yu | 2024-07-16 | 下载 | Enhancing the efficiency of iterative computation on graphs has garnered considerable attention in both industry and academia. Nonetheless, the majority of efforts focus on expediting iterative comput... |
| Cloud-based Semi-Quantum Money | Yichi Zhang, Siyuan Jin, Yuhan Huang, Bei Zeng, Qiming Shao | 2024-07-16 | 下载 | In the 1970s, Wiesner introduced the concept of quantum money, where quantum states generated according to specific rules function as currency. |
| Octopus: Experiences with a Hybrid Event-Driven Architecture for Distributed Scientific Computing | Haochen Pan, Ryan Chard, Sicheng Zhou, Alok Kamatar, Rafael Vescovi, Valérie Hayot-Sasson, André Bauer, Maxime Gonthier, Kyle Chard, Ian Foster | 2024-07-16 | 下载 | Scientific research increasingly relies on distributed computational resources, storage systems, networks, and instruments, ranging from HPC and cloud systems to edge devices. |
| Paralleling and Accelerating Arc Consistency Enforcement with Recurrent Tensor Computations | Mingqi Yang | 2024-07-16 | 下载 | We propose a new arc consistency enforcement paradigm that transforms arc consistency enforcement into recurrent tensor operations. In each iteration of the recurrence, all involved processes can be f... |
| Detection of Global Anomalies on Distributed IoT Edges with Device-to-Device Communication | Hideya Ochiai, Riku Nishihata, Eisuke Tomiyama, Yuwei Sun, Hiroshi Esaki | 2024-07-16 | 下载 | Anomaly detection is an important function in IoT applications for finding outliers caused by abnormal events. Anomaly detection sometimes comes with high-frequency data sampling which should be carri... |
| Edge-Mapping of Service Function Trees for Sensor Event Processing | Babar Shahzaad, Alistair Barros, Colin Fidge | 2024-07-16 | 下载 | Fog computing offers increased performance and efficiency for Industrial Internet of Things (IIoT) applications through distributed data processing in nearby proximity to sensors. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| A UAV-assisted Wireless Localization Challenge on AERPAW | Paul Kudyba, Jaya Sravani Mandapaka, Weijie Wang, Logan McCorkendale, Zachary McCorkendale, Mathias Kidane, Haijian Sun, Eric Adams, Kamesh Namuduri, Fraida Fund, Mihail Sichitiu, Ozgur Ozdemir | 2024-07-16 | 下载 | As wireless researchers are tasked to enable wireless communication as infrastructure in more dynamic aerial settings, there is a growing need for large-scale experimental platforms that provide reali... |
| An Overview and Solution for Democratizing AI Workflows at the Network Edge | Andrej Čop, Blaž Bertalanič, Carolina Fortuna | 2024-07-16 | 下载 | With the process of democratization of the network edge, hardware and software for networks are becoming available to the public, overcoming the confines of traditional cloud providers and network ope... |
| Performance Analysis of Internet of Vehicles Mesh Networks Based on Actual Switch Models | Jialin Hu, Zhiyuan Ren, Wenchi Cheng, Zhiliang Shuai, Zhao Li | 2024-07-16 | 下载 | The rapid growth of the automotive industry has exacerbated the conflict between the complex traffic environment, increasing communication demands, and limited resources. |
| Spatial-spectral Cell-free Networks: A Large-scale Case Study | Zesheng Zhu, Lifeng Wang, Xin Wang, Dongming Wang, Kai-Kit Wong | 2024-07-16 | 下载 | This paper studies the large-scale cell-free networks where dense distributed access points (APs) serve many users. As a promising next-generation network architecture, cell-free networks enable ultra... |
| Analytical Performance Estimations for Quantum Repeater Network Scenarios | Allen Zang, Joaquin Chung, Rajkumar Kettimuthu, Martin Suchara, Tian Zhong | 2024-07-16 | 下载 | Quantum repeater chains will form the backbone of future quantum networks that distribute entanglement between network nodes. Therefore, it is important to understand the entanglement distribution per... |
| Digital Twin Vehicular Edge Computing Network: Task Offloading and Resource Allocation | Yu Xie, Qiong Wu, Pingyi Fan | 2024-07-16 | 下载 | With the increasing demand for multiple applications on internet of vehicles. It requires vehicles to carry out multiple computing tasks in real time. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Characterizing and Understanding HGNN Training on GPUs | Dengke Han, Mingyu Yan, Xiaochun Ye, Dongrui Fan | 2024-07-16 | 下载 | Owing to their remarkable representation capabilities for heterogeneous graph data, Heterogeneous Graph Neural Networks (HGNNs) have been widely adopted in many critical real-world domains such as rec... |
| Estimating the Energy Footprint of Software Systems: a Primer | Fernando Castor | 2024-07-16 | 下载 | In Green Software Development, quantifying the energy footprint of a software system is one of the most basic activities. This documents provides a high-level overview of how the energy footprint of a... |
| Reducing Tail Latencies Through Environment- and Neighbour-aware Thread Management | Andrew Jeffery, Chris Jensen, Richard Mortier | 2024-07-16 | 下载 | Application tail latency is a key metric for many services, with high latencies being linked directly to loss of revenue. Modern deeply-nested micro-service architectures exacerbate tail latencies, in... |
| Bringing Auto-tuning to HIP: Analysis of Tuning Impact and Difficulty on AMD and Nvidia GPUs | Milo Lurati, Stijn Heldens, Alessio Sclocco, Ben van Werkhoven | 2024-07-16 | 下载 | Many studies have focused on developing and improving auto-tuning algorithms for Nvidia Graphics Processing Units (GPUs), but the effectiveness and efficiency of these approaches on AMD devices have h... |
| Analytical Performance Estimations for Quantum Repeater Network Scenarios | Allen Zang, Joaquin Chung, Rajkumar Kettimuthu, Martin Suchara, Tian Zhong | 2024-07-16 | 下载 | Quantum repeater chains will form the backbone of future quantum networks that distribute entanglement between network nodes. Therefore, it is important to understand the entanglement distribution per... |