Appearance
2024-09-09
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| RayFlex: An Open-Source RTL Implementation of the Hardware Ray Tracer Datapath | Fangjia Shen, Aaron Barnes, Anusuya Nallathambi, Timothy G. Rogers | 2024-09-09 | 下载 | The advent of hardware ray tracing (RT) units has brought unprecedented realism to real-time rendered computer graphics. However, the potential of these units extends beyond graphics, offering acceler... |
| Hardware Acceleration of Kolmogorov-Arnold Network (KAN) for Lightweight Edge Inference | Wei-Hsing Huang, Jianwei Jia, Yuyao Kong, Faaiq Waqar, Tai-Hao Wen, Meng-Fan Chang, Shimeng Yu | 2024-09-09 | 下载 | Recently, a novel model named Kolmogorov-Arnold Networks (KAN) has been proposed with the potential to achieve the functionality of traditional deep neural networks (DNNs) using orders of magnitude fe... |
| Fast Generation of Custom Floating-Point Spatial Filters on FPGAs | Nelson Campos, Eran Edirisinghe, Salva Chesnokov, Daniel Larkin | 2024-09-09 | 下载 | Convolutional Neural Networks (CNNs) have been utilised in many image and video processing applications. The convolution operator, also known as a spatial filter, is usually a linear operation, but th... |
| The Quest to Build Trust Earlier in Digital Design | Benjamin Tan | 2024-09-09 | 下载 | The ever-rising complexity of computer systems presents challenges for maintaining security and trust throughout their lifetime. As hardware forms the foundation of a secure system, we need tools and ... |
| DFabric: Scaling Out Data Parallel Applications with CXL-Ethernet Hybrid Interconnects | Xu Zhang, Ke Liu, Yisong Chang, Ke Zhang, Mingyu Chen | 2024-09-09 | 下载 | Emerging interconnects, such as CXL and NVLink, have been integrated into the intra-host topology to scale more accelerators and facilitate efficient communication between them, such as GPUs. |
| The Unseen AI Disruptions for Power Grids: LLM-Induced Transients | Yuzhuo Li, Mariam Mughees, Yize Chen, Yunwei Ryan Li | 2024-09-09 | 下载 | Recent breakthroughs of large language models (LLMs) have exhibited superior capability across major industries and stimulated multi-hundred-billion-dollar investment in AI-centric data centers in the... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| DNA sequence alignment: An assignment for OpenMP, MPI, and CUDA/OpenCL | Arturo Gonzalez-Escribano, Diego García-Álvarez, Jesús Cámara | 2024-09-09 | 下载 | We present an assignment for a full Parallel Computing course. Since 2017/2018, we have proposed a different problem each academic year to illustrate various methodologies for approaching the same com... |
| A Thorough Investigation of Content-Defined Chunking Algorithms for Data Deduplication | Marcel Gregoriadis, Leonhard Balduf, Björn Scheuermann, Johan Pouwelse | 2024-09-09 | 下载 | Data deduplication emerged as a powerful solution for reducing storage and bandwidth costs in cloud settings by eliminating redundancies at the level of chunks. |
| OciorCOOL: Faster Byzantine Agreement and Reliable Broadcast | Jinyuan Chen | 2024-09-09 | 下载 | COOL (Chen'21) is an error-free and deterministic Byzantine agreement protocol that achieves consensus on an -bit message with a communication complexity of bits ... |
| FLoRA: Federated Fine-Tuning Large Language Models with Heterogeneous Low-Rank Adaptations | Ziyao Wang, Zheyu Shen, Yexiao He, Guoheng Sun, Hongyi Wang, Lingjuan Lyu, Ang Li | 2024-09-09 | 下载 | The rapid development of Large Language Models (LLMs) has been pivotal in advancing AI, with pre-trained LLMs being adaptable to diverse downstream tasks through fine-tuning. |
| NeurLZ: An Online Neural Learning-Based Method to Enhance Scientific Lossy Compression | Wenqi Jia, Zhewen Hu, Youyuan Liu, Boyuan Zhang, Jinzhen Wang, Jinyang Liu, Wei Niu, Stavros Kalafatis, Junzhou Huang, Sian Jin, Daoce Wang, Jiannan Tian, Miao Yin | 2024-09-09 | 下载 | Large-scale scientific simulations generate massive datasets, posing challenges for storage and I/O. Traditional lossy compression struggles to advance more in balancing compression ratio, data qualit... |
| Consensus-based Distributed Quantum Kernel Learning for Speech Recognition | Kuan-Cheng Chen, Wenxuan Ma, Xiaotian Xu | 2024-09-09 | 下载 | This paper presents a Consensus-based Distributed Quantum Kernel Learning (CDQKL) framework aimed at improving speech recognition through distributed quantum computing. |
| Model Input Verification of Large Scale Simulations | Rumyana Neykova, Derek Groen | 2024-09-09 | 下载 | Reliable simulations are critical for analyzing and understanding complex systems, but their accuracy depends on correct input data. Incorrect inputs such as invalid or out-of-range values, missing da... |
| CoBo: Collaborative Learning via Bilevel Optimization | Diba Hashemi, Lie He, Martin Jaggi | 2024-09-09 | 下载 | Collaborative learning is an important tool to train multiple clients more effectively by enabling communication among clients. Identifying helpful clients, however, presents challenging and often int... |
| Scalable Time-Series Causal Discovery with Approximate Causal Ordering | Ziyang Jiao, Ce Guo, Wayne Luk | 2024-09-09 | 下载 | Causal discovery in time-series data presents a significant computational challenge. Standard algorithms are often prohibitively expensive for datasets with many variables or samples. |
| DFabric: Scaling Out Data Parallel Applications with CXL-Ethernet Hybrid Interconnects | Xu Zhang, Ke Liu, Yisong Chang, Ke Zhang, Mingyu Chen | 2024-09-09 | 下载 | Emerging interconnects, such as CXL and NVLink, have been integrated into the intra-host topology to scale more accelerators and facilitate efficient communication between them, such as GPUs. |
| Towards Practical Overlay Networks for Decentralized Federated Learning | Yifan Hua, Jinlong Pang, Xiaoxue Zhang, Yi Liu, Xiaofeng Shi, Bao Wang, Yang Liu, Chen Qian | 2024-09-09 | 下载 | Decentralized federated learning (DFL) uses peer-to-peer communication to avoid the single point of failure problem in federated learning and has been considered an attractive solution for machine lea... |
| Joint Model Assignment and Resource Allocation for Cost-Effective Mobile Generative Services | Shuangwei Gao, Peng Yang, Yuxin Kong, Feng Lyu, Ning Zhang | 2024-09-09 | 下载 | Artificial Intelligence Generated Content (AIGC) services can efficiently satisfy user-specified content creation demands, but the high computational requirements pose various challenges to supporting... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Positioning of a Next Generation Mobile Cell to Maximise Aggregate Network Capacity | Paulo Furtado Correia, Andre Coelho, Manuel Ricardo | 2024-09-09 | 下载 | In wireless communications, the need to cover operation areas, such as seaports, is at the forefront of discussion, especially regarding network capacity provisioning. |
| When Learning Meets Dynamics: Distributed User Connectivity Maximization in UAV-Based Communication Networks | Bowei Li, Saugat Tripathi, Salman Hosain, Ran Zhang, Jiang, Xie, Miao Wang | 2024-09-09 | 下载 | Distributed management over Unmanned Aerial Vehicle (UAV) based communication networks (UCNs) has attracted increasing research attention. In this work, we study a distributed user connectivity maximi... |
| Coordinated Sampling in SDNs with Dynamic Flow Rates | Soroosh Esmaeilian, Mahdi Dolati, Sogand Sadrhaghighi, Majid Ghaderi | 2024-09-09 | 下载 | Traffic sampling has become an indispensable tool in network management. While there exists a plethora of sampling systems, they generally assume flow rates are stable and predictable over a sampling ... |
| Optimizing Vehicular Users Association in Urban Mobile Networks | Geymerson S. Ramos, Razvan Stanica, Rian G. S. Pinheiro, Andre L. L. Aquino | 2024-09-09 | 下载 | This study aims to optimize vehicular user association to base stations in a mobile network. We propose an efficient heuristic solution that considers the base station average handover frequency, the ... |
| Towards Resilient 6G O-RAN: An Energy-Efficient URLLC Resource Allocation Framework | Rana M. Sohaib, Syed Tariq Shah, Poonam Yadav | 2024-09-09 | 下载 | The demands of ultra-reliable low-latency communication (URLLC) in ``NextG" cellular networks necessitate innovative approaches for efficient resource utilisation. |
| Adaptive Multi-Layer Deployment for A Digital Twin Empowered Satellite-Terrestrial Integrated Network | Yihong Tao, Bo Lei, Haoyang Shi, Jingkai Chen, Xing Zhang | 2024-09-09 | 下载 | With the development of satellite communication technology, satellite-terrestrial integrated networks (STIN), which integrate satellite networks and ground networks, can realize seamless global covera... |
| Validation of Practicality for CSI Sensing Utilizing Machine Learning | Tomoya Tanaka, Ayumu Yabuki, Mizuki Funakoshi, Ryo Yonemoto | 2024-09-09 | 下载 | In this study, we leveraged Channel State Information (CSI), commonly utilized in WLAN communication, as training data to develop and evaluate five distinct machine learning models for recognizing hum... |
| Towards Practical Overlay Networks for Decentralized Federated Learning | Yifan Hua, Jinlong Pang, Xiaoxue Zhang, Yi Liu, Xiaofeng Shi, Bao Wang, Yang Liu, Chen Qian | 2024-09-09 | 下载 | Decentralized federated learning (DFL) uses peer-to-peer communication to avoid the single point of failure problem in federated learning and has been considered an attractive solution for machine lea... |
| Robotic Ad-Hoc Networks | Marius Silaghi, Khulud Alawaji, Mohammed Alghamdi, Akram Alghanmi, Ameerah Alsulami | 2024-09-09 | 下载 | Practical robotic adhoc networks (RANETs), a type of mobile wireless adhoc networks (WANETs) supporting the WiFi-Direct modes common in internet of things and phone devices, is proposed based on a str... |
| How We Lost The Internet | Micah Beck, Terry Moore | 2024-09-09 | 下载 | In this paper we reexamine an assumption that underpinned the development of the Internet architecture, namely that a stateless and loosely synchronous point-to-point datagram delivery service would b... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Scalable Time-Series Causal Discovery with Approximate Causal Ordering | Ziyang Jiao, Ce Guo, Wayne Luk | 2024-09-09 | 下载 | Causal discovery in time-series data presents a significant computational challenge. Standard algorithms are often prohibitively expensive for datasets with many variables or samples. |
| The Unseen AI Disruptions for Power Grids: LLM-Induced Transients | Yuzhuo Li, Mariam Mughees, Yize Chen, Yunwei Ryan Li | 2024-09-09 | 下载 | Recent breakthroughs of large language models (LLMs) have exhibited superior capability across major industries and stimulated multi-hundred-billion-dollar investment in AI-centric data centers in the... |