2024-03-04

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
On Latency Predictors for Neural Architecture Search	Yash Akhauri, Mohamed S. Abdelfattah	2024-03-04	下载	Efficient deployment of neural networks (NN) requires the co-optimization of accuracy and latency. For example, hardware-aware neural architecture search has been used to automatically find NN archite...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
MPI Errors Detection using GNN Embedding and Vector Embedding over LLVM IR	Jad El Karchi, Hanze Chen, Ali TehraniJamsaz, Ali Jannesari, Mihail Popov, Emmanuelle Saillard	2024-03-04	下载	Identifying errors in parallel MPI programs is a challenging task. Despite the growing number of verification tools, debugging parallel programs remains a significant challenge.
Hybrid quantum programming with PennyLane Lightning on HPC platforms	Ali Asadi, Amintor Dusko, Chae-Yeun Park, Vincent Michaud-Rioux, Isidor Schoch, Shuli Shu, Trevor Vincent, Lee James O'Riordan	2024-03-04	下载	We introduce PennyLane's Lightning suite, a collection of high-performance state-vector simulators targeting CPU, GPU, and HPC-native architectures and workloads.
A Survey on Federated Unlearning: Challenges and Opportunities	Hyejun Jeong, Shiqing Ma, Amir Houmansadr	2024-03-04	下载	Federated learning (FL), introduced in 2017, facilitates collaborative learning between non-trusting parties with no need for the parties to explicitly share their data among themselves.
Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve	Amey Agrawal, Nitin Kedia, Ashish Panwar, Jayashree Mohan, Nipun Kwatra, Bhargav S. Gulavani, Alexey Tumanov, Ramachandran Ramjee	2024-03-04	下载	Each LLM serving request goes through two phases. The first is prefill which processes the entire input prompt and produces the first output token and the second is decode which generates the rest of ...
Quantum Computing: Vision and Challenges	Sukhpal Singh Gill, Oktay Cetinkaya, Stefano Marrone, Daniel Claudino, David Haunschild, Leon Schlote, Huaming Wu, Carlo Ottaviani, Xiaoyuan Liu, Sree Pragna Machupalli, Kamalpreet Kaur, Priyansh Arora, Ji Liu, Ahmed Farouk, Houbing Herbert Song, Steve Uhlig, Kotagiri Ramamohanarao	2024-03-04	下载	The recent development of quantum computing, which uses entanglement, superposition, and other quantum fundamental concepts, can provide substantial processing advantages over traditional computing.
Demeter: Resource-Efficient Distributed Stream Processing under Dynamic Loads with Multi-Configuration Optimization	Morgan Geldenhuys, Dominik Scheinert, Odej Kao, Lauritz Thamsen	2024-03-04	下载	Distributed Stream Processing (DSP) focuses on the near real-time processing of large streams of unbounded data. To increase processing capacities, DSP systems are able to dynamically scale across a c...
Daedalus: Self-Adaptive Horizontal Autoscaling for Resource Efficiency of Distributed Stream Processing Systems	Benjamin J. J. Pfister, Dominik Scheinert, Morgan K. Geldenhuys, Odej Kao	2024-03-04	下载	Distributed Stream Processing (DSP) systems are capable of processing large streams of unbounded data, offering high throughput and low latencies.
Inference Acceleration for Large Language Models on CPUs	Ditto PS, Jithin VG, Adarsh MS	2024-03-04	下载	In recent years, large language models have demonstrated remarkable performance across various natural language processing (NLP) tasks. However, deploying these models for real-world applications ofte...
Online Locality Meets Distributed Quantum Computing	Amirreza Akbari, Xavier Coiteux-Roy, Francesco d'Amore, François Le Gall, Henrik Lievonen, Darya Melnyk, Augusto Modanese, Shreyas Pai, Marc-Olivier Renou, Václav Rozhoň, Jukka Suomela	2024-03-04	下载	We connect three distinct lines of research that have recently explored extensions of the classical LOCAL model of distributed computing: A. distributed quantum computing and non-signaling distributio...
DéjàVu: KV-cache Streaming for Fast, Fault-tolerant Generative LLM Serving	Foteini Strati, Sara Mcallister, Amar Phanishayee, Jakub Tarnawski, Ana Klimovic	2024-03-04	下载	Distributed LLM serving is costly and often underutilizes hardware accelerators due to three key challenges: bubbles in pipeline-parallel deployments caused by the bimodal latency of prompt and token ...
Graph neural network for in-network placement of real-time metaverse tasks in next-generation network	Sulaiman Muhammad Rashid, Ibrahim Aliyu, Il-Kwon Jeong, Tai-Won Um, Jinsul Kim	2024-03-04	下载	This study addresses the challenge of real-time metaverse applications by proposing an in-network placement and task-offloading solution for delay-constrained computing tasks in next-generation networ...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Magnetic Localization for In-Body Nano-Communication Medical Systems	Krzysztof Skos, Albert Diez Comas, Josep Miquel Jornet, Pawel Kulakowski	2024-03-04	下载	Nano-machines circulating inside the human body, collecting data on tissue conditions, represent a vital part of next-generation medical diagnostic systems.
Software-defined optical networking applications enabled by programmable integrated photonics	Zhenyun Xie, David Sánchez-Jácome, Luis Torrijos-Morán, Daniel Pérez-López	2024-03-04	下载	Data center networks are experiencing unprecedented exponential growth, mostly driven by the continuous computing demands in machine learning and artificial intelligence algorithms.
Probabilistic Fault-Tolerant Robust Traffic Grooming in OTN-over-DWDM Networks	Dimitrios Michael Manias, Joe Naoum-Sawaya, Abbas Javadtalab, Abdallah Shami	2024-03-04	下载	The development of next-generation networks is revolutionizing network operators' management and orchestration practices worldwide. The critical services supported by these networks require increasing...
Towards Intent-Based Network Management: Large Language Models for Intent Extraction in 5G Core Networks	Dimitrios Michael Manias, Ali Chouman, Abdallah Shami	2024-03-04	下载	The integration of Machine Learning and Artificial Intelligence (ML/AI) into fifth-generation (5G) networks has made evident the limitations of network intelligence with ever-increasing, strenuous req...
I DPID It My Way! A Covert Timing Channel in Software-Defined Networks	Robert Krösche, Kashyap Thimmaraju, Liron Schiff, Stefan Schmid	2024-03-04	下载	Software-defined networking is considered a promising new paradigm, enabling more reliable and formally verifiable communication networks. However, this paper shows that the separation of the control ...
MTS: Bringing Multi-Tenancy to Virtual Networking	Kashyap Thimmaraju, Saad Hermak, Gábor Rétvári, Stefan Schmid	2024-03-04	下载	Multi-tenant cloud computing provides great benefits in terms of resource sharing, elastic pricing, and scalability, however, it also changes the security landscape and introduces the need for strong ...
Towards Fair and Efficient Learning-based Congestion Control	Xudong Liao, Han Tian, Chaoliang Zeng, Xinchen Wan, Kai Chen	2024-03-04	下载	Recent years have witnessed a plethora of learning-based solutions for congestion control (CC) that demonstrate better performance over traditional TCP schemes.
Graph neural network for in-network placement of real-time metaverse tasks in next-generation network	Sulaiman Muhammad Rashid, Ibrahim Aliyu, Il-Kwon Jeong, Tai-Won Um, Jinsul Kim	2024-03-04	下载	This study addresses the challenge of real-time metaverse applications by proposing an in-network placement and task-offloading solution for delay-constrained computing tasks in next-generation networ...
Towards Memory-Efficient Traffic Policing in Time-Sensitive Networking	Xuyan Jiang, Xiangrui Yang, Tongqing Zhou, Wenwen Fu, Wei Quan, Yihao Jiao, Yinhan Sun, Zhigang Sun	2024-03-04	下载	Time-Sensitive Networking (TSN) is an emerging real-time Ethernet technology that provides deterministic communication for time-critical traffic.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
On Latency Predictors for Neural Architecture Search	Yash Akhauri, Mohamed S. Abdelfattah	2024-03-04	下载	Efficient deployment of neural networks (NN) requires the co-optimization of accuracy and latency. For example, hardware-aware neural architecture search has been used to automatically find NN archite...