Skip to content

2024-03-02

cs.AR - Architecture

标题作者发布日期PDF摘要
Less is More: Hop-Wise Graph Attention for Scalable and Generalizable Learning on CircuitsChenhui Deng, Zichao Yue, Cunxi Yu, Gokce Sarar, Ryan Carey, Rajeev Jain, Zhiru Zhang2024-03-02下载While graph neural networks (GNNs) have gained popularity for learning circuit representations in various electronic design automation (EDA) tasks, they face challenges in scalability when applied to ...
Performance evaluation of acceleration of convolutional layers on OpenEdgeCGRANicolò Carpentieri, Juan Sapriza, Davide Schiavone, Daniele Jahier Pagliari, David Atienza, Maurizio Martina, Alessio Burrello2024-03-02下载Recently, efficiently deploying deep learning solutions on the edge has received increasing attention. New platforms are emerging to support the increasing demand for flexibility and high performance.
Low Complexity Deep Learning Augmented Wireless Channel Estimation for Pilot-Based OFDM on Zynq System on ChipAnimesh Sharma, Syed Asrar Ul Haq, Sumit J. Darak2024-03-02下载Channel estimation (CE) is one of the critical signal-processing tasks of the wireless physical layer (PHY). Recent deep learning (DL) based CE have outperformed statistical approaches such as least-s...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Rate-limited Shuffling for Distributed ComputingShanuja Sasi, Onur Günlü2024-03-02下载This paper studies the shuffling phase in a distributed computing model with rate-limited links between nodes. Each node is connected to all other nodes via a noiseless broadcast link with a finite ca...
Summary Paper: Use Case on Building Collaborative Safe Autonomous Systems-A Robotdog for Guiding Visually Impaired PeopleAman Malhotra, Selma Saidi2024-03-02下载This is a summary paper of a use case of a Robotdog dedicated to guide visually impaired people in complex environment like a smart intersection.
Defending Against Data Reconstruction Attacks in Federated Learning: An Information Theory ApproachQi Tan, Qi Li, Yi Zhao, Zhuotao Liu, Xiaobing Guo, Ke Xu2024-03-02下载Federated Learning (FL) trains a black-box and high-dimensional model among different clients by exchanging parameters instead of direct data sharing, which mitigates the privacy leak incurred by mach...
GSL-LPA: Fast Label Propagation Algorithm (LPA) for Community Detection with no Internally-Disconnected CommunitiesSubhajit Sahu2024-03-02下载Community detection is the problem of identifying tightly connected clusters of nodes within a network. Efficient parallel algorithms for this play a crucial role in various applications, especially a...
HeteGen: Heterogeneous Parallel Inference for Large Language Models on Resource-Constrained DevicesXuanlei Zhao, Bin Jia, Haotian Zhou, Ziming Liu, Shenggan Cheng, Yang You2024-03-02下载In recent times, the emergence of Large Language Models (LLMs) has resulted in increasingly larger model size, posing challenges for inference on low-resource devices.
LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive QuantizationJuntao Zhao, Borui Wan, Yanghua Peng, Haibin Lin, Chuan Wu2024-03-02下载Recent breakthroughs in Large-scale language models (LLMs) have demonstrated impressive performance on various tasks. The immense sizes of LLMs have led to very high resource demand and cost for runni...
Beyond Inference: Performance Analysis of DNN Server Overheads for Computer VisionAhmed F. AbouElhamayed, Susanne Balle, Deshanand Singh, Mohamed S. Abdelfattah2024-03-02下载Deep neural network (DNN) inference has become an important part of many data-center workloads. This has prompted focused efforts to design ever-faster deep learning accelerators such as GPUs and TPUs...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Superflows: A New Tool for Forensic Network Flow AnalysisMichael Collins, Jyotirmoy V. Deshmukh, Dristi Dinesh, Mukund Raghothaman, Srivatsan Ravi, Yuan Xia2024-03-02下载Network security analysts gather data from diverse sources, from high-level summaries of network flow and traffic volumes to low-level details such as service logs from servers and the contents of ind...
Experimental Evaluation of the ETSI DCC Adaptive Approach and Related AlgorithmsOscar Amador, Ignacio Soto, Maria Calderon, Manuel Urueña2024-03-02下载Decentralized Congestion Control (DCC) mechanisms have been a core part of protocol stacks for vehicular networks since their inception and standardization.
Misconfiguration in O-RAN: Analysis of the impact of AI/MLNoe Yungaicela-Naula, Vishal Sharma, Sandra Scott-Hayward2024-03-02下载User demand on network communication infrastructure has never been greater with applications such as extended reality, holographic telepresence, and wireless brain-computer interfaces challenging curr...

cs.PF - Performance

标题作者发布日期PDF摘要
HeteGen: Heterogeneous Parallel Inference for Large Language Models on Resource-Constrained DevicesXuanlei Zhao, Bin Jia, Haotian Zhou, Ziming Liu, Shenggan Cheng, Yang You2024-03-02下载In recent times, the emergence of Large Language Models (LLMs) has resulted in increasingly larger model size, posing challenges for inference on low-resource devices.
GraphMini: Accelerating Graph Pattern Matching Using Auxiliary GraphsJuelin Liu, Sandeep Polisetty, Hui Guan, Marco Serafini2024-03-02下载Graph pattern matching is a fundamental problem encountered by many common graph mining tasks and the basic building block of several graph mining systems.

基于 VitePress 构建