2024-03-01

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
NeuPIMs: NPU-PIM Heterogeneous Acceleration for Batched LLM Inferencing	Guseul Heo, Sangyeop Lee, Jaehong Cho, Hyunmin Choi, Sanghyeon Lee, Hyungkyu Ham, Gwangsun Kim, Divya Mahajan, Jongse Park	2024-03-01	下载	Modern transformer-based Large Language Models (LLMs) are constructed with a series of decoder blocks. Each block comprises three key components: (1) QKV generation, (2) multi-head attention, and (3) ...
Attacking Delay-based PUFs with Minimal Adversary Model	Hongming Fei, Owen Millwood, Prosanta Gope, Jack Miskelly, Biplab Sikdar	2024-03-01	下载	Physically Unclonable Functions (PUFs) provide a streamlined solution for lightweight device authentication. Delay-based Arbiter PUFs, with their ease of implementation and vast challenge space, have ...
SFQ counter-based precomputation for large-scale cryogenic VQE machines	Yosuke Ueno, Satoshi Imamura, Yuna Tomida, Teruo Tanimoto, Masamitsu Tanaka, Yutaka Tabuchi, Koji Inoue, Hiroshi Nakamura	2024-03-01	下载	The variational quantum eigensolver (VQE) is a promising candidate that brings practical benefits from quantum computing. However, the required bandwidth in/out of a cryostat is a limiting factor to s...
FTTN: Feature-Targeted Testing for Numerical Properties of NVIDIA & AMD Matrix Accelerators	Xinyi Li, Ang Li, Bo Fang, Katarzyna Swirydowicz, Ignacio Laguna, Ganesh Gopalakrishnan	2024-03-01	下载	NVIDIA Tensor Cores and AMD Matrix Cores (together called Matrix Accelerators) are of growing interest in high-performance computing and machine learning owing to their high performance.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
A Sufficient Epistemic Condition for Solving Stabilizing Agreement	Giorgio Cignarale, Stephan Felber, Hugo Rincon Galeana	2024-03-01	下载	In this paper we provide a first-ever epistemic formulation of stabilizing agreement, defined as the non-terminating variant of the well established consensus problem.
A Spark Optimizer for Adaptive, Fine-Grained Parameter Tuning	Chenghao Lyu, Qi Fan, Philippe Guyard, Yanlei Diao	2024-03-01	下载	As Spark becomes a common big data analytics platform, its growing complexity makes automatic tuning of numerous parameters critical for performance.
Neural Acceleration of Incomplete Cholesky Preconditioners	Joshua Dennis Booth, Hongyang Sun, Trevor Garnett	2024-03-01	下载	The solution of a sparse system of linear equations is ubiquitous in scientific applications. Iterative methods, such as the Preconditioned Conjugate Gradient method (PCG), are normally chosen over di...
Containerization in Multi-Cloud Environment: Roles, Strategies, Challenges, and Solutions for Effective Implementation	Muhammad Waseem, Aakash Ahmad, Peng Liang, Muhammad Azeem Akbar, Arif Ali Khan, Iftikhar Ahmad, Manu Setälä, Tommi Mikkonen	2024-03-01	下载	Containerization in multi-cloud environments has received significant attention in recent years both from academic research and industrial development perspectives.
Are Unikernels Ready for Serverless on the Edge?	Felix Moebius, Tobias Pfandzelter, David Bermbach	2024-03-01	下载	Function-as-a-Service (FaaS) is a promising edge computing execution model but requires secure sandboxing mechanisms to isolate workloads from multiple tenants on constrained infrastructure.
Jiagu: Optimizing Serverless Computing Resource Utilization with Harmonized Efficiency and Practicability	Qingyuan Liu, Yanning Yang, Dong Du, Yubin Xia, Ping Zhang, Jia Feng, James Larus, Haibo Chen	2024-03-01	下载	Current serverless platforms struggle to optimize resource utilization due to their dynamic and fine-grained nature. Conventional techniques like overcommitment and autoscaling fall short, often sacri...
Training Computer Scientists for the Challenges of Hybrid Quantum-Classical Computing	Vincenzo De Maio, Meerzhan Kanatbekova, Felix Zilk, Nicolai Friis, Tobias Guggemos, Ivona Brandic	2024-03-01	下载	As we enter the post-Moore era, we experience the rise of various non-von-Neumann-architectures to address the increasing computational demand for modern applications, with quantum computing being amo...
FedRDMA: Communication-Efficient Cross-Silo Federated LLM via Chunked RDMA Transmission	Zeling Zhang, Dongqi Cai, Yiran Zhang, Mengwei Xu, Shangguang Wang, Ao Zhou	2024-03-01	下载	Communication overhead is a significant bottleneck in federated learning (FL), which has been exaggerated with the increasing size of AI models.
Disaggregated Multi-Tower: Topology-aware Modeling Technique for Efficient Large-Scale Recommendation	Liang Luo, Buyun Zhang, Michael Tsang, Yinbin Ma, Ching-Hsiang Chu, Yuxin Chen, Shen Li, Yuchen Hao, Yanli Zhao, Guna Lakshminarayanan, Ellie Dingqiao Wen, Jongsoo Park, Dheevatsa Mudigere, Maxim Naumov	2024-03-01	下载	We study a mismatch between the deep learning recommendation models' flat architecture, common distributed training paradigm and hierarchical data center topology.
WindGP: Efficient Graph Partitioning on Heterogenous Machines	Li Zeng, Haohan Huang, Binfan Zheng, Kang Yang, Shengcheng Shao, Jinhua Zhou, Jun Xie, Rongqian Zhao, Xin Chen	2024-03-01	下载	Graph Partitioning is widely used in many real-world applications such as fraud detection and social network analysis, in order to enable the distributed graph computing on large graphs.

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Design and Performance Evaluation of SEANet, a Software-defined Networking Platform for the Internet of Underwater Things	Deniz Unal, Sara Falleni, Kerem Enhos, Emrecan Demirors, Stefano Basagni, Tommaso Melodia	2024-03-01	下载	This paper presents the design and performance evaluation of the SEANet platform, a software-defined acoustic modem designed for enhancing underwater networking and Internet of Underwater Things (IoUT...
Toward Autonomous Cooperation in Heterogeneous Nanosatellite Constellations Using Dynamic Graph Neural Networks	Guillem Casadesus-Vila, Joan-Adria Ruiz-de-Azua, Eduard Alarcon	2024-03-01	下载	The upcoming landscape of Earth Observation missions will defined by networked heterogeneous nanosatellite constellations required to meet strict mission requirements, such as revisit times and spatia...
Exploring Upper-6GHz and mmWave in Real-World 5G Networks: A Direct on-Field Comparison	Marcello Morini, Eugenio Moro, Ilario Filippini, Antonio Capone, Danilo De Donno	2024-03-01	下载	The spectrum crunch challenge poses a vital threat to the progress of cellular networks and recently prompted the inclusion of millimeter wave (mmWave) and Upper 6GHz (U6G) in the 3GPP standards.
Hercules: Heterogeneous Requirements Congestion Control Protocol	Neta Rozen-Schiff, Itzcak Pechtalt, Amit Navon, Leon Bruckman	2024-03-01	下载	Future network services present a significant challenge for network providers due to high number and high variety of co-existing requirements.
Comparative Study of Simulators for Vehicular Networks	Rida Saghir, Thenuka Karunathilake, Anna Förster	2024-03-01	下载	Vehicular Adhoc networks (VANETs) are composed of vehicles connected with wireless links to exchange data. VANETs have become the backbone of the Intelligent Transportation Systems (ITS) in smart citi...
FedRDMA: Communication-Efficient Cross-Silo Federated LLM via Chunked RDMA Transmission	Zeling Zhang, Dongqi Cai, Yiran Zhang, Mengwei Xu, Shangguang Wang, Ao Zhou	2024-03-01	下载	Communication overhead is a significant bottleneck in federated learning (FL), which has been exaggerated with the increasing size of AI models.
Yodel: A Layer 3.5 Name-Based Multicast Network Architecture For The Future Internet	Morteza Moghaddassian, Alberto Leon-Garcia	2024-03-01	下载	Multicasting refers to the ability of transmitting data to multiple recipients without data sources needing to provide more than one copy of the data to the network.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
An Experimental Study of Low-Latency Video Streaming over 5G	Imran Khan, Tuyen X. Tran, Matti Hiltunen, Theodore Karagioules, Dimitrios Koutsonikolas	2024-03-01	下载	Low-latency video streaming over 5G has become rapidly popular over the last few years due to its increased usage in hosting virtual events, online education, webinars, and all-hands meetings.