Skip to content

2024-05-25

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
FPsPIN: An FPGA-based Open-Hardware Research Platform for Processing in the NetworkTimo Schneider, Pengcheng Xu, Torsten Hoefler2024-05-25下载In the era of post-Moore computing, network offload emerges as a solution to two challenges: the imperative for low-latency communication and the push towards hardware specialisation.
Analyzing the Attack Surface and Threats of Industrial Internet of Things DevicesSimon Liebl, Leah Lathrop, Ulrich Raithel, Andreas Aßmuth, Ian Ferguson, Matthias Söllner2024-05-25下载The growing connectivity of industrial devices as a result of the Internet of Things is increasing the risks to Industrial Control Systems. Since attacks on such devices can also cause damage to peopl...
Achieving Observability on Fog Computing with the use of open-source toolsBreno Costa, Abhik Banerjee, Prem Prakash Jayaraman, Leonardo R. Carvalho, João Bachiega, Aleteia Araujo2024-05-25下载Fog computing can provide computational resources and low-latency communication at the network edge. But with it comes uncertainties that must be managed in order to guarantee Service Level Agreements...
TURNIP: A "Nondeterministic" GPU Runtime with CPU RAM OffloadZhimin Ding, Jiawen Yao, Brianna Barrow, Tania Lorido Botran, Christopher Jermaine, Yuxin Tang, Jiehui Li, Xinyu Yao, Sleem Mahmoud Abdelghafar, Daniel Bourgeois2024-05-25下载An obvious way to alleviate memory difficulties in GPU-based AI computing is via CPU offload, where data are moved between GPU and CPU RAM, so inexpensive CPU RAM is used to increase the amount of sto...
HETHUB: A Distributed Training System with Heterogeneous Cluster for Large-Scale ModelsSi Xu, Zixiao Huang, Yan Zeng, Shengen Yan, Xuefei Ning, Quanlu Zhang, Haolin Ye, Sipei Gu, Chunsheng Shui, Zhezheng Lin, Hao Zhang, Sheng Wang, Guohao Dai, Yu Wang2024-05-25下载Training large-scale models relies on a vast number of computing resources. For example, training the GPT-4 model (1.8 trillion parameters) requires 25000 A100 GPUs .
Boolean Matrix Multiplication for Highly Clustered Data on the Congested CliqueAndrzej Lingas2024-05-25下载We present a protocol for the Boolean matrix product of two n×bn\times b Boolean matrices on the congested clique designed for the situation when the rows of the first matrix or the columns of the seco...
An Experimental Study of Different Aggregation Schemes in Semi-Asynchronous Federated LearningYunbo Li, Jiaping Gui, Yue Wu2024-05-25下载Federated learning is highly valued due to its high-performance computing in distributed environments while safeguarding data privacy. To address resource heterogeneity, researchers have proposed a se...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
FPsPIN: An FPGA-based Open-Hardware Research Platform for Processing in the NetworkTimo Schneider, Pengcheng Xu, Torsten Hoefler2024-05-25下载In the era of post-Moore computing, network offload emerges as a solution to two challenges: the imperative for low-latency communication and the push towards hardware specialisation.
A Simulation Study of Source Routing for Load Balancing in Software-Defined Satellite NetworksF. Bergamini2024-05-25下载In the next generation network, the satellite network will play a fundamental role, in overcoming the limitation of the terrestrial network. Nonetheless, the satellite-terrestrial network integration ...

cs.PF - Performance

标题作者发布日期PDF摘要
FPsPIN: An FPGA-based Open-Hardware Research Platform for Processing in the NetworkTimo Schneider, Pengcheng Xu, Torsten Hoefler2024-05-25下载In the era of post-Moore computing, network offload emerges as a solution to two challenges: the imperative for low-latency communication and the push towards hardware specialisation.
An Experimental Study of Different Aggregation Schemes in Semi-Asynchronous Federated LearningYunbo Li, Jiaping Gui, Yue Wu2024-05-25下载Federated learning is highly valued due to its high-performance computing in distributed environments while safeguarding data privacy. To address resource heterogeneity, researchers have proposed a se...

基于 VitePress 构建