Appearance
2024-01-23
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Full-Stack Optimization for CAM-Only DNN Inference | João Paulo C. de Lima, Asif Ali Khan, Luigi Carro, Jeronimo Castrillon | 2024-01-23 | 下载 | The accuracy of neural networks has greatly improved across various domains over the past years. Their ever-increasing complexity, however, leads to prohibitively high energy demands and latency in vo... |
| CIM-MLC: A Multi-level Compilation Stack for Computing-In-Memory Accelerators | Songyun Qu, Shixin Zhao, Bing Li, Yintao He, Xuyi Cai, Lei Zhang, Ying Wang | 2024-01-23 | 下载 | In recent years, various computing-in-memory (CIM) processors have been presented, showing superior performance over traditional architectures. |
| Enhancing Reliability of Neural Networks at the Edge: Inverted Normalization with Stochastic Affine Transformations | Soyed Tuhin Ahmed, Kamal Danouchi, Guillaume Prenat, Lorena Anghel, Mehdi B. Tahoori | 2024-01-23 | 下载 | Bayesian Neural Networks (BayNNs) naturally provide uncertainty in their predictions, making them a suitable choice in safety-critical applications. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Automated Programmatic Performance Analysis of Parallel Programs | Onur Cankur, Aditya Tomar, Daniel Nichols, Connor Scully-Allison, Katherine E. Isaacs, Abhinav Bhatele | 2024-01-23 | 下载 | Developing efficient parallel applications is critical to advancing scientific development but requires significant performance analysis and optimization. |
| Deterministic Collision-Free Exploration of Unknown Anonymous Graphs | Subhash Bhagat, Andrzej Pelc | 2024-01-23 | 下载 | We consider the fundamental task of network exploration. A network is modeled as a simple connected undirected n-node graph with unlabeled nodes, and all ports at any node of degree d are arbitrarily ... |
| COREC: Concurrent Non-Blocking Single-Queue Receive Driver for Low Latency Networking | Marco Faltelli, Giacomo Belocchi, Francesco Quaglia, Giuseppe Bianchi | 2024-01-23 | 下载 | Existing network stacks tackle performance and scalability aspects by relying on multiple receive queues. However, at software level, each queue is processed by a single thread, which prevents simulta... |
| Towards Privacy-, Budget-, and Deadline-Aware Service Optimization for Large Medical Image Processing across Hybrid Clouds | Yuandou Wang, Neel Kanwal, Kjersti Engan, Chunming Rong, Paola Grosso, Zhiming Zhao | 2024-01-23 | 下载 | Efficiently processing medical images, such as whole slide images in digital pathology, is essential for timely diagnosing high-risk diseases. |
| Can Large Language Models Write Parallel Code? | Daniel Nichols, Joshua H. Davis, Zhaojun Xie, Arjun Rajaram, Abhinav Bhatele | 2024-01-23 | 下载 | Large language models are increasingly becoming a popular tool for software development. Their ability to model and generate source code has been demonstrated in a variety of contexts, including code ... |
| Utilizing Graph Sparsification for Pre-processing in Maxcut QUBO Solver | Vorapong Suppakitpaisarn, Jin-Kao Hao | 2024-01-23 | 下载 | We suggest employing graph sparsification as a pre-processing step for maxcut programs using the QUBO solver. Quantum(-inspired) algorithms are recognized for their potential efficiency in handling qu... |
| Secure Federated Learning Approaches to Diagnosing COVID-19 | Rittika Adhikari, Christopher Settles | 2024-01-23 | 下载 | The recent pandemic has underscored the importance of accurately diagnosing COVID-19 in hospital settings. A major challenge in this regard is differentiating COVID-19 from other respiratory illnesses... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Minimizing the Age of Two Heterogeneous Sources With Packet Drops Via Cyclic Schedulers | Sahan Liyanaarachchi, Sennur Ulukus, Nail Akar | 2024-01-23 | 下载 | In a communication setting where multiple sources share a single channel to provide status updates to a remote monitor, source transmissions need to be scheduled appropriately to maintain timely commu... |
| Eloquent: A More Robust Transmission Scheme for LLM Token Streaming | Hanchen Li, Yuhan Liu, Yihua Cheng, Siddhant Ray, Kuntai Du, Junchen Jiang | 2024-01-23 | 下载 | To render each generated token in real-time for users, the Large Language Model (LLM) server generates tokens one by one and streams each token (or group of a few tokens) through the network to the us... |
| Digital Twin-Based Network Management for Better QoE in Multicast Short Video Streaming | Xinyu Huang, Shisheng Hu, Haojun Yang, Xinghan Wang, Yingying Pei, Xuemin Shen | 2024-01-23 | 下载 | Multicast short video streaming can enhance bandwidth utilization by enabling simultaneous video transmission to multiple users over shared wireless channels. |
| COREC: Concurrent Non-Blocking Single-Queue Receive Driver for Low Latency Networking | Marco Faltelli, Giacomo Belocchi, Francesco Quaglia, Giuseppe Bianchi | 2024-01-23 | 下载 | Existing network stacks tackle performance and scalability aspects by relying on multiple receive queues. However, at software level, each queue is processed by a single thread, which prevents simulta... |
| Learning from the Best: Active Learning for Wireless Communications | Nasim Soltani, Jifan Zhang, Batool Salehi, Debashri Roy, Robert Nowak, Kaushik Chowdhury | 2024-01-23 | 下载 | Collecting an over-the-air wireless communications training dataset for deep learning-based communication tasks is relatively simple. However, labeling the dataset requires expert involvement and doma... |
| A lightweight decentralized service placement policy for performance optimization in fog computing | Carlos Guerrero, Isaac Lera, Carlos Juiz | 2024-01-23 | 下载 | A decentralized optimization policy for service placement in fog computing is presented. The optimization is addressed to place most popular services as closer to the users as possible. |
| Genetic Algorithm for Multi-Objective Optimization of Container Allocation in Cloud Architecture | Carlos Guerrero, Isaac Lera, Carlos Juiz | 2024-01-23 | 下载 | The use of containers in cloud architectures has become widespread because of advantages such as limited overhead, easier and faster deployment and higher portability. |
| Availability-aware Service Placement Policy in Fog Computing Based on Graph Partitions | Isaac Lera, Carlos Guerrero, Carlos Juiz | 2024-01-23 | 下载 | This paper presents a policy for service placement of fog applications inspired on complex networks and graph theory. We propose a twofold partition process based on communities for the partition of t... |
| Knowledge Distillation from Language-Oriented to Emergent Communication for Multi-Agent Remote Control | Yongjun Kim, Sejin Seo, Jihong Park, Mehdi Bennis, Seong-Lyun Kim, Junil Choi | 2024-01-23 | 下载 | In this work, we compare emergent communication (EC) built upon multi-agent deep reinforcement learning (MADRL) and language-oriented semantic communication (LSC) empowered by a pre-trained large lang... |
| Investigation of FlexAlgo for User-driven Path Control | Julia Kułacz, Martyna Pawlus, Leonardo Boldrini, Paola Grosso | 2024-01-23 | 下载 | This paper examines the Flexible Algorithm (FlexAlgo) for its potential to enable user-driven path control in intra-domain Segment Routing (SR) enabled networks. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Automated Programmatic Performance Analysis of Parallel Programs | Onur Cankur, Aditya Tomar, Daniel Nichols, Connor Scully-Allison, Katherine E. Isaacs, Abhinav Bhatele | 2024-01-23 | 下载 | Developing efficient parallel applications is critical to advancing scientific development but requires significant performance analysis and optimization. |