Appearance
2024-03-04
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| On Latency Predictors for Neural Architecture Search | Yash Akhauri, Mohamed S. Abdelfattah | 2024-03-04 | 下载 | Efficient deployment of neural networks (NN) requires the co-optimization of accuracy and latency. For example, hardware-aware neural architecture search has been used to automatically find NN archite... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| MPI Errors Detection using GNN Embedding and Vector Embedding over LLVM IR | Jad El Karchi, Hanze Chen, Ali TehraniJamsaz, Ali Jannesari, Mihail Popov, Emmanuelle Saillard | 2024-03-04 | 下载 | Identifying errors in parallel MPI programs is a challenging task. Despite the growing number of verification tools, debugging parallel programs remains a significant challenge. |
| Hybrid quantum programming with PennyLane Lightning on HPC platforms | Ali Asadi, Amintor Dusko, Chae-Yeun Park, Vincent Michaud-Rioux, Isidor Schoch, Shuli Shu, Trevor Vincent, Lee James O'Riordan | 2024-03-04 | 下载 | We introduce PennyLane's Lightning suite, a collection of high-performance state-vector simulators targeting CPU, GPU, and HPC-native architectures and workloads. |
| A Survey on Federated Unlearning: Challenges and Opportunities | Hyejun Jeong, Shiqing Ma, Amir Houmansadr | 2024-03-04 | 下载 | Federated learning (FL), introduced in 2017, facilitates collaborative learning between non-trusting parties with no need for the parties to explicitly share their data among themselves. |
| Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve | Amey Agrawal, Nitin Kedia, Ashish Panwar, Jayashree Mohan, Nipun Kwatra, Bhargav S. Gulavani, Alexey Tumanov, Ramachandran Ramjee | 2024-03-04 | 下载 | Each LLM serving request goes through two phases. The first is prefill which processes the entire input prompt and produces the first output token and the second is decode which generates the rest of ... |
| Quantum Computing: Vision and Challenges | Sukhpal Singh Gill, Oktay Cetinkaya, Stefano Marrone, Daniel Claudino, David Haunschild, Leon Schlote, Huaming Wu, Carlo Ottaviani, Xiaoyuan Liu, Sree Pragna Machupalli, Kamalpreet Kaur, Priyansh Arora, Ji Liu, Ahmed Farouk, Houbing Herbert Song, Steve Uhlig, Kotagiri Ramamohanarao | 2024-03-04 | 下载 | The recent development of quantum computing, which uses entanglement, superposition, and other quantum fundamental concepts, can provide substantial processing advantages over traditional computing. |
| Demeter: Resource-Efficient Distributed Stream Processing under Dynamic Loads with Multi-Configuration Optimization | Morgan Geldenhuys, Dominik Scheinert, Odej Kao, Lauritz Thamsen | 2024-03-04 | 下载 | Distributed Stream Processing (DSP) focuses on the near real-time processing of large streams of unbounded data. To increase processing capacities, DSP systems are able to dynamically scale across a c... |
| Daedalus: Self-Adaptive Horizontal Autoscaling for Resource Efficiency of Distributed Stream Processing Systems | Benjamin J. J. Pfister, Dominik Scheinert, Morgan K. Geldenhuys, Odej Kao | 2024-03-04 | 下载 | Distributed Stream Processing (DSP) systems are capable of processing large streams of unbounded data, offering high throughput and low latencies. |
| Inference Acceleration for Large Language Models on CPUs | Ditto PS, Jithin VG, Adarsh MS | 2024-03-04 | 下载 | In recent years, large language models have demonstrated remarkable performance across various natural language processing (NLP) tasks. However, deploying these models for real-world applications ofte... |
| Online Locality Meets Distributed Quantum Computing | Amirreza Akbari, Xavier Coiteux-Roy, Francesco d'Amore, François Le Gall, Henrik Lievonen, Darya Melnyk, Augusto Modanese, Shreyas Pai, Marc-Olivier Renou, Václav Rozhoň, Jukka Suomela | 2024-03-04 | 下载 | We connect three distinct lines of research that have recently explored extensions of the classical LOCAL model of distributed computing: A. distributed quantum computing and non-signaling distributio... |
| DéjàVu: KV-cache Streaming for Fast, Fault-tolerant Generative LLM Serving | Foteini Strati, Sara Mcallister, Amar Phanishayee, Jakub Tarnawski, Ana Klimovic | 2024-03-04 | 下载 | Distributed LLM serving is costly and often underutilizes hardware accelerators due to three key challenges: bubbles in pipeline-parallel deployments caused by the bimodal latency of prompt and token ... |
| Graph neural network for in-network placement of real-time metaverse tasks in next-generation network | Sulaiman Muhammad Rashid, Ibrahim Aliyu, Il-Kwon Jeong, Tai-Won Um, Jinsul Kim | 2024-03-04 | 下载 | This study addresses the challenge of real-time metaverse applications by proposing an in-network placement and task-offloading solution for delay-constrained computing tasks in next-generation networ... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Magnetic Localization for In-Body Nano-Communication Medical Systems | Krzysztof Skos, Albert Diez Comas, Josep Miquel Jornet, Pawel Kulakowski | 2024-03-04 | 下载 | Nano-machines circulating inside the human body, collecting data on tissue conditions, represent a vital part of next-generation medical diagnostic systems. |
| Software-defined optical networking applications enabled by programmable integrated photonics | Zhenyun Xie, David Sánchez-Jácome, Luis Torrijos-Morán, Daniel Pérez-López | 2024-03-04 | 下载 | Data center networks are experiencing unprecedented exponential growth, mostly driven by the continuous computing demands in machine learning and artificial intelligence algorithms. |
| Probabilistic Fault-Tolerant Robust Traffic Grooming in OTN-over-DWDM Networks | Dimitrios Michael Manias, Joe Naoum-Sawaya, Abbas Javadtalab, Abdallah Shami | 2024-03-04 | 下载 | The development of next-generation networks is revolutionizing network operators' management and orchestration practices worldwide. The critical services supported by these networks require increasing... |
| Towards Intent-Based Network Management: Large Language Models for Intent Extraction in 5G Core Networks | Dimitrios Michael Manias, Ali Chouman, Abdallah Shami | 2024-03-04 | 下载 | The integration of Machine Learning and Artificial Intelligence (ML/AI) into fifth-generation (5G) networks has made evident the limitations of network intelligence with ever-increasing, strenuous req... |
| I DPID It My Way! A Covert Timing Channel in Software-Defined Networks | Robert Krösche, Kashyap Thimmaraju, Liron Schiff, Stefan Schmid | 2024-03-04 | 下载 | Software-defined networking is considered a promising new paradigm, enabling more reliable and formally verifiable communication networks. However, this paper shows that the separation of the control ... |
| MTS: Bringing Multi-Tenancy to Virtual Networking | Kashyap Thimmaraju, Saad Hermak, Gábor Rétvári, Stefan Schmid | 2024-03-04 | 下载 | Multi-tenant cloud computing provides great benefits in terms of resource sharing, elastic pricing, and scalability, however, it also changes the security landscape and introduces the need for strong ... |
| Towards Fair and Efficient Learning-based Congestion Control | Xudong Liao, Han Tian, Chaoliang Zeng, Xinchen Wan, Kai Chen | 2024-03-04 | 下载 | Recent years have witnessed a plethora of learning-based solutions for congestion control (CC) that demonstrate better performance over traditional TCP schemes. |
| Graph neural network for in-network placement of real-time metaverse tasks in next-generation network | Sulaiman Muhammad Rashid, Ibrahim Aliyu, Il-Kwon Jeong, Tai-Won Um, Jinsul Kim | 2024-03-04 | 下载 | This study addresses the challenge of real-time metaverse applications by proposing an in-network placement and task-offloading solution for delay-constrained computing tasks in next-generation networ... |
| Towards Memory-Efficient Traffic Policing in Time-Sensitive Networking | Xuyan Jiang, Xiangrui Yang, Tongqing Zhou, Wenwen Fu, Wei Quan, Yihao Jiao, Yinhan Sun, Zhigang Sun | 2024-03-04 | 下载 | Time-Sensitive Networking (TSN) is an emerging real-time Ethernet technology that provides deterministic communication for time-critical traffic. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| On Latency Predictors for Neural Architecture Search | Yash Akhauri, Mohamed S. Abdelfattah | 2024-03-04 | 下载 | Efficient deployment of neural networks (NN) requires the co-optimization of accuracy and latency. For example, hardware-aware neural architecture search has been used to automatically find NN archite... |