Skip to content

2024-09-09

cs.AR - Architecture

标题作者发布日期PDF摘要
RayFlex: An Open-Source RTL Implementation of the Hardware Ray Tracer DatapathFangjia Shen, Aaron Barnes, Anusuya Nallathambi, Timothy G. Rogers2024-09-09下载The advent of hardware ray tracing (RT) units has brought unprecedented realism to real-time rendered computer graphics. However, the potential of these units extends beyond graphics, offering acceler...
Hardware Acceleration of Kolmogorov-Arnold Network (KAN) for Lightweight Edge InferenceWei-Hsing Huang, Jianwei Jia, Yuyao Kong, Faaiq Waqar, Tai-Hao Wen, Meng-Fan Chang, Shimeng Yu2024-09-09下载Recently, a novel model named Kolmogorov-Arnold Networks (KAN) has been proposed with the potential to achieve the functionality of traditional deep neural networks (DNNs) using orders of magnitude fe...
Fast Generation of Custom Floating-Point Spatial Filters on FPGAsNelson Campos, Eran Edirisinghe, Salva Chesnokov, Daniel Larkin2024-09-09下载Convolutional Neural Networks (CNNs) have been utilised in many image and video processing applications. The convolution operator, also known as a spatial filter, is usually a linear operation, but th...
The Quest to Build Trust Earlier in Digital DesignBenjamin Tan2024-09-09下载The ever-rising complexity of computer systems presents challenges for maintaining security and trust throughout their lifetime. As hardware forms the foundation of a secure system, we need tools and ...
DFabric: Scaling Out Data Parallel Applications with CXL-Ethernet Hybrid InterconnectsXu Zhang, Ke Liu, Yisong Chang, Ke Zhang, Mingyu Chen2024-09-09下载Emerging interconnects, such as CXL and NVLink, have been integrated into the intra-host topology to scale more accelerators and facilitate efficient communication between them, such as GPUs.
The Unseen AI Disruptions for Power Grids: LLM-Induced TransientsYuzhuo Li, Mariam Mughees, Yize Chen, Yunwei Ryan Li2024-09-09下载Recent breakthroughs of large language models (LLMs) have exhibited superior capability across major industries and stimulated multi-hundred-billion-dollar investment in AI-centric data centers in the...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
DNA sequence alignment: An assignment for OpenMP, MPI, and CUDA/OpenCLArturo Gonzalez-Escribano, Diego García-Álvarez, Jesús Cámara2024-09-09下载We present an assignment for a full Parallel Computing course. Since 2017/2018, we have proposed a different problem each academic year to illustrate various methodologies for approaching the same com...
A Thorough Investigation of Content-Defined Chunking Algorithms for Data DeduplicationMarcel Gregoriadis, Leonhard Balduf, Björn Scheuermann, Johan Pouwelse2024-09-09下载Data deduplication emerged as a powerful solution for reducing storage and bandwidth costs in cloud settings by eliminating redundancies at the level of chunks.
OciorCOOL: Faster Byzantine Agreement and Reliable BroadcastJinyuan Chen2024-09-09下载COOL (Chen'21) is an error-free and deterministic Byzantine agreement protocol that achieves consensus on an \ell-bit message with a communication complexity of O(max{n,ntlogt})O(\max\{n\ell, n t \log t \}) bits ...
FLoRA: Federated Fine-Tuning Large Language Models with Heterogeneous Low-Rank AdaptationsZiyao Wang, Zheyu Shen, Yexiao He, Guoheng Sun, Hongyi Wang, Lingjuan Lyu, Ang Li2024-09-09下载The rapid development of Large Language Models (LLMs) has been pivotal in advancing AI, with pre-trained LLMs being adaptable to diverse downstream tasks through fine-tuning.
NeurLZ: An Online Neural Learning-Based Method to Enhance Scientific Lossy CompressionWenqi Jia, Zhewen Hu, Youyuan Liu, Boyuan Zhang, Jinzhen Wang, Jinyang Liu, Wei Niu, Stavros Kalafatis, Junzhou Huang, Sian Jin, Daoce Wang, Jiannan Tian, Miao Yin2024-09-09下载Large-scale scientific simulations generate massive datasets, posing challenges for storage and I/O. Traditional lossy compression struggles to advance more in balancing compression ratio, data qualit...
Consensus-based Distributed Quantum Kernel Learning for Speech RecognitionKuan-Cheng Chen, Wenxuan Ma, Xiaotian Xu2024-09-09下载This paper presents a Consensus-based Distributed Quantum Kernel Learning (CDQKL) framework aimed at improving speech recognition through distributed quantum computing.
Model Input Verification of Large Scale SimulationsRumyana Neykova, Derek Groen2024-09-09下载Reliable simulations are critical for analyzing and understanding complex systems, but their accuracy depends on correct input data. Incorrect inputs such as invalid or out-of-range values, missing da...
CoBo: Collaborative Learning via Bilevel OptimizationDiba Hashemi, Lie He, Martin Jaggi2024-09-09下载Collaborative learning is an important tool to train multiple clients more effectively by enabling communication among clients. Identifying helpful clients, however, presents challenging and often int...
Scalable Time-Series Causal Discovery with Approximate Causal OrderingZiyang Jiao, Ce Guo, Wayne Luk2024-09-09下载Causal discovery in time-series data presents a significant computational challenge. Standard algorithms are often prohibitively expensive for datasets with many variables or samples.
DFabric: Scaling Out Data Parallel Applications with CXL-Ethernet Hybrid InterconnectsXu Zhang, Ke Liu, Yisong Chang, Ke Zhang, Mingyu Chen2024-09-09下载Emerging interconnects, such as CXL and NVLink, have been integrated into the intra-host topology to scale more accelerators and facilitate efficient communication between them, such as GPUs.
Towards Practical Overlay Networks for Decentralized Federated LearningYifan Hua, Jinlong Pang, Xiaoxue Zhang, Yi Liu, Xiaofeng Shi, Bao Wang, Yang Liu, Chen Qian2024-09-09下载Decentralized federated learning (DFL) uses peer-to-peer communication to avoid the single point of failure problem in federated learning and has been considered an attractive solution for machine lea...
Joint Model Assignment and Resource Allocation for Cost-Effective Mobile Generative ServicesShuangwei Gao, Peng Yang, Yuxin Kong, Feng Lyu, Ning Zhang2024-09-09下载Artificial Intelligence Generated Content (AIGC) services can efficiently satisfy user-specified content creation demands, but the high computational requirements pose various challenges to supporting...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Positioning of a Next Generation Mobile Cell to Maximise Aggregate Network CapacityPaulo Furtado Correia, Andre Coelho, Manuel Ricardo2024-09-09下载In wireless communications, the need to cover operation areas, such as seaports, is at the forefront of discussion, especially regarding network capacity provisioning.
When Learning Meets Dynamics: Distributed User Connectivity Maximization in UAV-Based Communication NetworksBowei Li, Saugat Tripathi, Salman Hosain, Ran Zhang, Jiang, Xie, Miao Wang2024-09-09下载Distributed management over Unmanned Aerial Vehicle (UAV) based communication networks (UCNs) has attracted increasing research attention. In this work, we study a distributed user connectivity maximi...
Coordinated Sampling in SDNs with Dynamic Flow RatesSoroosh Esmaeilian, Mahdi Dolati, Sogand Sadrhaghighi, Majid Ghaderi2024-09-09下载Traffic sampling has become an indispensable tool in network management. While there exists a plethora of sampling systems, they generally assume flow rates are stable and predictable over a sampling ...
Optimizing Vehicular Users Association in Urban Mobile NetworksGeymerson S. Ramos, Razvan Stanica, Rian G. S. Pinheiro, Andre L. L. Aquino2024-09-09下载This study aims to optimize vehicular user association to base stations in a mobile network. We propose an efficient heuristic solution that considers the base station average handover frequency, the ...
Towards Resilient 6G O-RAN: An Energy-Efficient URLLC Resource Allocation FrameworkRana M. Sohaib, Syed Tariq Shah, Poonam Yadav2024-09-09下载The demands of ultra-reliable low-latency communication (URLLC) in ``NextG" cellular networks necessitate innovative approaches for efficient resource utilisation.
Adaptive Multi-Layer Deployment for A Digital Twin Empowered Satellite-Terrestrial Integrated NetworkYihong Tao, Bo Lei, Haoyang Shi, Jingkai Chen, Xing Zhang2024-09-09下载With the development of satellite communication technology, satellite-terrestrial integrated networks (STIN), which integrate satellite networks and ground networks, can realize seamless global covera...
Validation of Practicality for CSI Sensing Utilizing Machine LearningTomoya Tanaka, Ayumu Yabuki, Mizuki Funakoshi, Ryo Yonemoto2024-09-09下载In this study, we leveraged Channel State Information (CSI), commonly utilized in WLAN communication, as training data to develop and evaluate five distinct machine learning models for recognizing hum...
Towards Practical Overlay Networks for Decentralized Federated LearningYifan Hua, Jinlong Pang, Xiaoxue Zhang, Yi Liu, Xiaofeng Shi, Bao Wang, Yang Liu, Chen Qian2024-09-09下载Decentralized federated learning (DFL) uses peer-to-peer communication to avoid the single point of failure problem in federated learning and has been considered an attractive solution for machine lea...
Robotic Ad-Hoc NetworksMarius Silaghi, Khulud Alawaji, Mohammed Alghamdi, Akram Alghanmi, Ameerah Alsulami2024-09-09下载Practical robotic adhoc networks (RANETs), a type of mobile wireless adhoc networks (WANETs) supporting the WiFi-Direct modes common in internet of things and phone devices, is proposed based on a str...
How We Lost The InternetMicah Beck, Terry Moore2024-09-09下载In this paper we reexamine an assumption that underpinned the development of the Internet architecture, namely that a stateless and loosely synchronous point-to-point datagram delivery service would b...

cs.PF - Performance

标题作者发布日期PDF摘要
Scalable Time-Series Causal Discovery with Approximate Causal OrderingZiyang Jiao, Ce Guo, Wayne Luk2024-09-09下载Causal discovery in time-series data presents a significant computational challenge. Standard algorithms are often prohibitively expensive for datasets with many variables or samples.
The Unseen AI Disruptions for Power Grids: LLM-Induced TransientsYuzhuo Li, Mariam Mughees, Yize Chen, Yunwei Ryan Li2024-09-09下载Recent breakthroughs of large language models (LLMs) have exhibited superior capability across major industries and stimulated multi-hundred-billion-dollar investment in AI-centric data centers in the...

基于 VitePress 构建