Skip to content

2024-06-16

cs.AR - Architecture

标题作者发布日期PDF摘要
Tender: Accelerating Large Language Models via Tensor Decomposition and Runtime RequantizationJungi Lee, Wonbeom Lee, Jaewoong Sim2024-06-16下载Large language models (LLMs) demonstrate outstanding performance in various tasks in machine learning and have thus become one of the most important workloads in today's computing landscape.
Optimization of Armv9 architecture general large language model inference performance based on Llama.cppLonghao Chen, Yina Zhao, Qiangjun Xie, Qinghua Sheng2024-06-16下载This article optimizes the inference performance of the Qwen-1.8B model by performing Int8 quantization, vectorizing some operators in llama.cpp, and modifying the compilation script to improve the co...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Leveraging Foundation Models for Multi-modal Federated Learning with Incomplete ModalityLiwei Che, Jiaqi Wang, Xinyue Liu, Fenglong Ma2024-06-16下载Federated learning (FL) has obtained tremendous progress in providing collaborative training solutions for distributed data silos with privacy guarantees.
M-SET: Multi-Drone Swarm Intelligence Experimentation with Collision Avoidance RealismChuhao Qin, Alexander Robins, Callum Lillywhite-Roake, Adam Pearce, Hritik Mehta, Scott James, Tsz Ho Wong, Evangelos Pournaras2024-06-16下载Distributed sensing by cooperative drone swarms is crucial for several Smart City applications, such as traffic monitoring and disaster response.
Linkage on Security, Privacy and Fairness in Federated Learning: New Balances and New PerspectivesLinlin Wang, Tianqing Zhu, Wanlei Zhou, Philip S. Yu2024-06-16下载Federated learning is fast becoming a popular paradigm for applications involving mobile devices, banking systems, healthcare, and IoT systems.
Knowledge Distillation in Federated Learning: a Survey on Long Lasting Challenges and New SolutionsLaiqiao Qin, Tianqing Zhu, Wanlei Zhou, Philip S. Yu2024-06-16下载Federated Learning (FL) is a distributed and privacy-preserving machine learning paradigm that coordinates multiple clients to train a model while keeping the raw data localized.
Design and Optimization of Hierarchical Gradient Coding for Distributed Learning at Edge DevicesWeiheng Tang, Jingyi Li, Lin Chen, Xu Chen2024-06-16下载Edge computing has recently emerged as a promising paradigm to boost the performance of distributed learning by leveraging the distributed resources at edge nodes.
Federated Learning Optimization: A Comparative Study of Data and Model Exchange Strategies in Dynamic NetworksAlka Luqman, Yeow Wei Liang Brandon, Anupam Chattopadhyay2024-06-16下载The promise and proliferation of large-scale dynamic federated learning gives rise to a prominent open question - is it prudent to share data or model across nodes, if efficiency of transmission and f...
PWDFT-SW: Extending the Limit of Plane-Wave DFT Calculations to 16K Atoms on the New Sunway SupercomputerQingcai Jiang, Zhenwei Cao, Junshi Chen, Xinming Qin, Wei Hu, Hong An, Jinlong Yang2024-06-16下载First-principles density functional theory (DFT) with plane wave (PW) basis set is the most widely used method in quantum mechanical material simulations due to its advantages in accuracy and universa...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Make Your Home Safe: Time-aware Unsupervised User Behavior Anomaly Detection in Smart Homes via Loss-guided MaskJingyu Xiao, Zhiyao Xu, Qingsong Zou, Qing Li, Dan Zhao, Dong Fang, Ruoyu Li, Wenxin Tang, Kang Li, Xudong Zuo, Penghui Hu, Yong Jiang, Zixuan Weng, Michael R. Lyv2024-06-16下载Smart homes, powered by the Internet of Things, offer great convenience but also pose security concerns due to abnormal behaviors, such as improper operations of users and potential attacks from malic...
LEO Satellite Networks Assisted Geo-distributed Data ProcessingZhiyuan Zhao, Zhe Chen, Zheng Lin, Wenjun Zhu, Kun Qiu, Chaoqun You, Yue Gao2024-06-16下载Nowadays, the increasing deployment of edge clouds globally provides users with low-latency services. However, connecting an edge cloud to a core cloud via optic cables in terrestrial networks poses s...
Design and Optimization of Hierarchical Gradient Coding for Distributed Learning at Edge DevicesWeiheng Tang, Jingyi Li, Lin Chen, Xu Chen2024-06-16下载Edge computing has recently emerged as a promising paradigm to boost the performance of distributed learning by leveraging the distributed resources at edge nodes.

cs.PF - Performance

标题作者发布日期PDF摘要
Optimization of Armv9 architecture general large language model inference performance based on Llama.cppLonghao Chen, Yina Zhao, Qiangjun Xie, Qinghua Sheng2024-06-16下载This article optimizes the inference performance of the Qwen-1.8B model by performing Int8 quantization, vectorizing some operators in llama.cpp, and modifying the compilation script to improve the co...

基于 VitePress 构建