2024-01-23

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Full-Stack Optimization for CAM-Only DNN Inference	João Paulo C. de Lima, Asif Ali Khan, Luigi Carro, Jeronimo Castrillon	2024-01-23	下载	The accuracy of neural networks has greatly improved across various domains over the past years. Their ever-increasing complexity, however, leads to prohibitively high energy demands and latency in vo...
CIM-MLC: A Multi-level Compilation Stack for Computing-In-Memory Accelerators	Songyun Qu, Shixin Zhao, Bing Li, Yintao He, Xuyi Cai, Lei Zhang, Ying Wang	2024-01-23	下载	In recent years, various computing-in-memory (CIM) processors have been presented, showing superior performance over traditional architectures.
Enhancing Reliability of Neural Networks at the Edge: Inverted Normalization with Stochastic Affine Transformations	Soyed Tuhin Ahmed, Kamal Danouchi, Guillaume Prenat, Lorena Anghel, Mehdi B. Tahoori	2024-01-23	下载	Bayesian Neural Networks (BayNNs) naturally provide uncertainty in their predictions, making them a suitable choice in safety-critical applications.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Automated Programmatic Performance Analysis of Parallel Programs	Onur Cankur, Aditya Tomar, Daniel Nichols, Connor Scully-Allison, Katherine E. Isaacs, Abhinav Bhatele	2024-01-23	下载	Developing efficient parallel applications is critical to advancing scientific development but requires significant performance analysis and optimization.
Deterministic Collision-Free Exploration of Unknown Anonymous Graphs	Subhash Bhagat, Andrzej Pelc	2024-01-23	下载	We consider the fundamental task of network exploration. A network is modeled as a simple connected undirected n-node graph with unlabeled nodes, and all ports at any node of degree d are arbitrarily ...
COREC: Concurrent Non-Blocking Single-Queue Receive Driver for Low Latency Networking	Marco Faltelli, Giacomo Belocchi, Francesco Quaglia, Giuseppe Bianchi	2024-01-23	下载	Existing network stacks tackle performance and scalability aspects by relying on multiple receive queues. However, at software level, each queue is processed by a single thread, which prevents simulta...
Towards Privacy-, Budget-, and Deadline-Aware Service Optimization for Large Medical Image Processing across Hybrid Clouds	Yuandou Wang, Neel Kanwal, Kjersti Engan, Chunming Rong, Paola Grosso, Zhiming Zhao	2024-01-23	下载	Efficiently processing medical images, such as whole slide images in digital pathology, is essential for timely diagnosing high-risk diseases.
Can Large Language Models Write Parallel Code?	Daniel Nichols, Joshua H. Davis, Zhaojun Xie, Arjun Rajaram, Abhinav Bhatele	2024-01-23	下载	Large language models are increasingly becoming a popular tool for software development. Their ability to model and generate source code has been demonstrated in a variety of contexts, including code ...
Utilizing Graph Sparsification for Pre-processing in Maxcut QUBO Solver	Vorapong Suppakitpaisarn, Jin-Kao Hao	2024-01-23	下载	We suggest employing graph sparsification as a pre-processing step for maxcut programs using the QUBO solver. Quantum(-inspired) algorithms are recognized for their potential efficiency in handling qu...
Secure Federated Learning Approaches to Diagnosing COVID-19	Rittika Adhikari, Christopher Settles	2024-01-23	下载	The recent pandemic has underscored the importance of accurately diagnosing COVID-19 in hospital settings. A major challenge in this regard is differentiating COVID-19 from other respiratory illnesses...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Minimizing the Age of Two Heterogeneous Sources With Packet Drops Via Cyclic Schedulers	Sahan Liyanaarachchi, Sennur Ulukus, Nail Akar	2024-01-23	下载	In a communication setting where multiple sources share a single channel to provide status updates to a remote monitor, source transmissions need to be scheduled appropriately to maintain timely commu...
Eloquent: A More Robust Transmission Scheme for LLM Token Streaming	Hanchen Li, Yuhan Liu, Yihua Cheng, Siddhant Ray, Kuntai Du, Junchen Jiang	2024-01-23	下载	To render each generated token in real-time for users, the Large Language Model (LLM) server generates tokens one by one and streams each token (or group of a few tokens) through the network to the us...
Digital Twin-Based Network Management for Better QoE in Multicast Short Video Streaming	Xinyu Huang, Shisheng Hu, Haojun Yang, Xinghan Wang, Yingying Pei, Xuemin Shen	2024-01-23	下载	Multicast short video streaming can enhance bandwidth utilization by enabling simultaneous video transmission to multiple users over shared wireless channels.
COREC: Concurrent Non-Blocking Single-Queue Receive Driver for Low Latency Networking	Marco Faltelli, Giacomo Belocchi, Francesco Quaglia, Giuseppe Bianchi	2024-01-23	下载	Existing network stacks tackle performance and scalability aspects by relying on multiple receive queues. However, at software level, each queue is processed by a single thread, which prevents simulta...
Learning from the Best: Active Learning for Wireless Communications	Nasim Soltani, Jifan Zhang, Batool Salehi, Debashri Roy, Robert Nowak, Kaushik Chowdhury	2024-01-23	下载	Collecting an over-the-air wireless communications training dataset for deep learning-based communication tasks is relatively simple. However, labeling the dataset requires expert involvement and doma...
A lightweight decentralized service placement policy for performance optimization in fog computing	Carlos Guerrero, Isaac Lera, Carlos Juiz	2024-01-23	下载	A decentralized optimization policy for service placement in fog computing is presented. The optimization is addressed to place most popular services as closer to the users as possible.
Genetic Algorithm for Multi-Objective Optimization of Container Allocation in Cloud Architecture	Carlos Guerrero, Isaac Lera, Carlos Juiz	2024-01-23	下载	The use of containers in cloud architectures has become widespread because of advantages such as limited overhead, easier and faster deployment and higher portability.
Availability-aware Service Placement Policy in Fog Computing Based on Graph Partitions	Isaac Lera, Carlos Guerrero, Carlos Juiz	2024-01-23	下载	This paper presents a policy for service placement of fog applications inspired on complex networks and graph theory. We propose a twofold partition process based on communities for the partition of t...
Knowledge Distillation from Language-Oriented to Emergent Communication for Multi-Agent Remote Control	Yongjun Kim, Sejin Seo, Jihong Park, Mehdi Bennis, Seong-Lyun Kim, Junil Choi	2024-01-23	下载	In this work, we compare emergent communication (EC) built upon multi-agent deep reinforcement learning (MADRL) and language-oriented semantic communication (LSC) empowered by a pre-trained large lang...
Investigation of FlexAlgo for User-driven Path Control	Julia Kułacz, Martyna Pawlus, Leonardo Boldrini, Paola Grosso	2024-01-23	下载	This paper examines the Flexible Algorithm (FlexAlgo) for its potential to enable user-driven path control in intra-domain Segment Routing (SR) enabled networks.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Automated Programmatic Performance Analysis of Parallel Programs	Onur Cankur, Aditya Tomar, Daniel Nichols, Connor Scully-Allison, Katherine E. Isaacs, Abhinav Bhatele	2024-01-23	下载	Developing efficient parallel applications is critical to advancing scientific development but requires significant performance analysis and optimization.