Skip to content

2024-03-04

cs.AR - Architecture

标题作者发布日期PDF摘要
On Latency Predictors for Neural Architecture SearchYash Akhauri, Mohamed S. Abdelfattah2024-03-04下载Efficient deployment of neural networks (NN) requires the co-optimization of accuracy and latency. For example, hardware-aware neural architecture search has been used to automatically find NN archite...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
MPI Errors Detection using GNN Embedding and Vector Embedding over LLVM IRJad El Karchi, Hanze Chen, Ali TehraniJamsaz, Ali Jannesari, Mihail Popov, Emmanuelle Saillard2024-03-04下载Identifying errors in parallel MPI programs is a challenging task. Despite the growing number of verification tools, debugging parallel programs remains a significant challenge.
Hybrid quantum programming with PennyLane Lightning on HPC platformsAli Asadi, Amintor Dusko, Chae-Yeun Park, Vincent Michaud-Rioux, Isidor Schoch, Shuli Shu, Trevor Vincent, Lee James O'Riordan2024-03-04下载We introduce PennyLane's Lightning suite, a collection of high-performance state-vector simulators targeting CPU, GPU, and HPC-native architectures and workloads.
A Survey on Federated Unlearning: Challenges and OpportunitiesHyejun Jeong, Shiqing Ma, Amir Houmansadr2024-03-04下载Federated learning (FL), introduced in 2017, facilitates collaborative learning between non-trusting parties with no need for the parties to explicitly share their data among themselves.
Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-ServeAmey Agrawal, Nitin Kedia, Ashish Panwar, Jayashree Mohan, Nipun Kwatra, Bhargav S. Gulavani, Alexey Tumanov, Ramachandran Ramjee2024-03-04下载Each LLM serving request goes through two phases. The first is prefill which processes the entire input prompt and produces the first output token and the second is decode which generates the rest of ...
Quantum Computing: Vision and ChallengesSukhpal Singh Gill, Oktay Cetinkaya, Stefano Marrone, Daniel Claudino, David Haunschild, Leon Schlote, Huaming Wu, Carlo Ottaviani, Xiaoyuan Liu, Sree Pragna Machupalli, Kamalpreet Kaur, Priyansh Arora, Ji Liu, Ahmed Farouk, Houbing Herbert Song, Steve Uhlig, Kotagiri Ramamohanarao2024-03-04下载The recent development of quantum computing, which uses entanglement, superposition, and other quantum fundamental concepts, can provide substantial processing advantages over traditional computing.
Demeter: Resource-Efficient Distributed Stream Processing under Dynamic Loads with Multi-Configuration OptimizationMorgan Geldenhuys, Dominik Scheinert, Odej Kao, Lauritz Thamsen2024-03-04下载Distributed Stream Processing (DSP) focuses on the near real-time processing of large streams of unbounded data. To increase processing capacities, DSP systems are able to dynamically scale across a c...
Daedalus: Self-Adaptive Horizontal Autoscaling for Resource Efficiency of Distributed Stream Processing SystemsBenjamin J. J. Pfister, Dominik Scheinert, Morgan K. Geldenhuys, Odej Kao2024-03-04下载Distributed Stream Processing (DSP) systems are capable of processing large streams of unbounded data, offering high throughput and low latencies.
Inference Acceleration for Large Language Models on CPUsDitto PS, Jithin VG, Adarsh MS2024-03-04下载In recent years, large language models have demonstrated remarkable performance across various natural language processing (NLP) tasks. However, deploying these models for real-world applications ofte...
Online Locality Meets Distributed Quantum ComputingAmirreza Akbari, Xavier Coiteux-Roy, Francesco d'Amore, François Le Gall, Henrik Lievonen, Darya Melnyk, Augusto Modanese, Shreyas Pai, Marc-Olivier Renou, Václav Rozhoň, Jukka Suomela2024-03-04下载We connect three distinct lines of research that have recently explored extensions of the classical LOCAL model of distributed computing: A. distributed quantum computing and non-signaling distributio...
DéjàVu: KV-cache Streaming for Fast, Fault-tolerant Generative LLM ServingFoteini Strati, Sara Mcallister, Amar Phanishayee, Jakub Tarnawski, Ana Klimovic2024-03-04下载Distributed LLM serving is costly and often underutilizes hardware accelerators due to three key challenges: bubbles in pipeline-parallel deployments caused by the bimodal latency of prompt and token ...
Graph neural network for in-network placement of real-time metaverse tasks in next-generation networkSulaiman Muhammad Rashid, Ibrahim Aliyu, Il-Kwon Jeong, Tai-Won Um, Jinsul Kim2024-03-04下载This study addresses the challenge of real-time metaverse applications by proposing an in-network placement and task-offloading solution for delay-constrained computing tasks in next-generation networ...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Magnetic Localization for In-Body Nano-Communication Medical SystemsKrzysztof Skos, Albert Diez Comas, Josep Miquel Jornet, Pawel Kulakowski2024-03-04下载Nano-machines circulating inside the human body, collecting data on tissue conditions, represent a vital part of next-generation medical diagnostic systems.
Software-defined optical networking applications enabled by programmable integrated photonicsZhenyun Xie, David Sánchez-Jácome, Luis Torrijos-Morán, Daniel Pérez-López2024-03-04下载Data center networks are experiencing unprecedented exponential growth, mostly driven by the continuous computing demands in machine learning and artificial intelligence algorithms.
Probabilistic Fault-Tolerant Robust Traffic Grooming in OTN-over-DWDM NetworksDimitrios Michael Manias, Joe Naoum-Sawaya, Abbas Javadtalab, Abdallah Shami2024-03-04下载The development of next-generation networks is revolutionizing network operators' management and orchestration practices worldwide. The critical services supported by these networks require increasing...
Towards Intent-Based Network Management: Large Language Models for Intent Extraction in 5G Core NetworksDimitrios Michael Manias, Ali Chouman, Abdallah Shami2024-03-04下载The integration of Machine Learning and Artificial Intelligence (ML/AI) into fifth-generation (5G) networks has made evident the limitations of network intelligence with ever-increasing, strenuous req...
I DPID It My Way! A Covert Timing Channel in Software-Defined NetworksRobert Krösche, Kashyap Thimmaraju, Liron Schiff, Stefan Schmid2024-03-04下载Software-defined networking is considered a promising new paradigm, enabling more reliable and formally verifiable communication networks. However, this paper shows that the separation of the control ...
MTS: Bringing Multi-Tenancy to Virtual NetworkingKashyap Thimmaraju, Saad Hermak, Gábor Rétvári, Stefan Schmid2024-03-04下载Multi-tenant cloud computing provides great benefits in terms of resource sharing, elastic pricing, and scalability, however, it also changes the security landscape and introduces the need for strong ...
Towards Fair and Efficient Learning-based Congestion ControlXudong Liao, Han Tian, Chaoliang Zeng, Xinchen Wan, Kai Chen2024-03-04下载Recent years have witnessed a plethora of learning-based solutions for congestion control (CC) that demonstrate better performance over traditional TCP schemes.
Graph neural network for in-network placement of real-time metaverse tasks in next-generation networkSulaiman Muhammad Rashid, Ibrahim Aliyu, Il-Kwon Jeong, Tai-Won Um, Jinsul Kim2024-03-04下载This study addresses the challenge of real-time metaverse applications by proposing an in-network placement and task-offloading solution for delay-constrained computing tasks in next-generation networ...
Towards Memory-Efficient Traffic Policing in Time-Sensitive NetworkingXuyan Jiang, Xiangrui Yang, Tongqing Zhou, Wenwen Fu, Wei Quan, Yihao Jiao, Yinhan Sun, Zhigang Sun2024-03-04下载Time-Sensitive Networking (TSN) is an emerging real-time Ethernet technology that provides deterministic communication for time-critical traffic.

cs.PF - Performance

标题作者发布日期PDF摘要
On Latency Predictors for Neural Architecture SearchYash Akhauri, Mohamed S. Abdelfattah2024-03-04下载Efficient deployment of neural networks (NN) requires the co-optimization of accuracy and latency. For example, hardware-aware neural architecture search has been used to automatically find NN archite...

基于 VitePress 构建