Skip to content

2024-01-23

cs.AR - Architecture

标题作者发布日期PDF摘要
Full-Stack Optimization for CAM-Only DNN InferenceJoão Paulo C. de Lima, Asif Ali Khan, Luigi Carro, Jeronimo Castrillon2024-01-23下载The accuracy of neural networks has greatly improved across various domains over the past years. Their ever-increasing complexity, however, leads to prohibitively high energy demands and latency in vo...
CIM-MLC: A Multi-level Compilation Stack for Computing-In-Memory AcceleratorsSongyun Qu, Shixin Zhao, Bing Li, Yintao He, Xuyi Cai, Lei Zhang, Ying Wang2024-01-23下载In recent years, various computing-in-memory (CIM) processors have been presented, showing superior performance over traditional architectures.
Enhancing Reliability of Neural Networks at the Edge: Inverted Normalization with Stochastic Affine TransformationsSoyed Tuhin Ahmed, Kamal Danouchi, Guillaume Prenat, Lorena Anghel, Mehdi B. Tahoori2024-01-23下载Bayesian Neural Networks (BayNNs) naturally provide uncertainty in their predictions, making them a suitable choice in safety-critical applications.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Automated Programmatic Performance Analysis of Parallel ProgramsOnur Cankur, Aditya Tomar, Daniel Nichols, Connor Scully-Allison, Katherine E. Isaacs, Abhinav Bhatele2024-01-23下载Developing efficient parallel applications is critical to advancing scientific development but requires significant performance analysis and optimization.
Deterministic Collision-Free Exploration of Unknown Anonymous GraphsSubhash Bhagat, Andrzej Pelc2024-01-23下载We consider the fundamental task of network exploration. A network is modeled as a simple connected undirected n-node graph with unlabeled nodes, and all ports at any node of degree d are arbitrarily ...
COREC: Concurrent Non-Blocking Single-Queue Receive Driver for Low Latency NetworkingMarco Faltelli, Giacomo Belocchi, Francesco Quaglia, Giuseppe Bianchi2024-01-23下载Existing network stacks tackle performance and scalability aspects by relying on multiple receive queues. However, at software level, each queue is processed by a single thread, which prevents simulta...
Towards Privacy-, Budget-, and Deadline-Aware Service Optimization for Large Medical Image Processing across Hybrid CloudsYuandou Wang, Neel Kanwal, Kjersti Engan, Chunming Rong, Paola Grosso, Zhiming Zhao2024-01-23下载Efficiently processing medical images, such as whole slide images in digital pathology, is essential for timely diagnosing high-risk diseases.
Can Large Language Models Write Parallel Code?Daniel Nichols, Joshua H. Davis, Zhaojun Xie, Arjun Rajaram, Abhinav Bhatele2024-01-23下载Large language models are increasingly becoming a popular tool for software development. Their ability to model and generate source code has been demonstrated in a variety of contexts, including code ...
Utilizing Graph Sparsification for Pre-processing in Maxcut QUBO SolverVorapong Suppakitpaisarn, Jin-Kao Hao2024-01-23下载We suggest employing graph sparsification as a pre-processing step for maxcut programs using the QUBO solver. Quantum(-inspired) algorithms are recognized for their potential efficiency in handling qu...
Secure Federated Learning Approaches to Diagnosing COVID-19Rittika Adhikari, Christopher Settles2024-01-23下载The recent pandemic has underscored the importance of accurately diagnosing COVID-19 in hospital settings. A major challenge in this regard is differentiating COVID-19 from other respiratory illnesses...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Minimizing the Age of Two Heterogeneous Sources With Packet Drops Via Cyclic SchedulersSahan Liyanaarachchi, Sennur Ulukus, Nail Akar2024-01-23下载In a communication setting where multiple sources share a single channel to provide status updates to a remote monitor, source transmissions need to be scheduled appropriately to maintain timely commu...
Eloquent: A More Robust Transmission Scheme for LLM Token StreamingHanchen Li, Yuhan Liu, Yihua Cheng, Siddhant Ray, Kuntai Du, Junchen Jiang2024-01-23下载To render each generated token in real-time for users, the Large Language Model (LLM) server generates tokens one by one and streams each token (or group of a few tokens) through the network to the us...
Digital Twin-Based Network Management for Better QoE in Multicast Short Video StreamingXinyu Huang, Shisheng Hu, Haojun Yang, Xinghan Wang, Yingying Pei, Xuemin Shen2024-01-23下载Multicast short video streaming can enhance bandwidth utilization by enabling simultaneous video transmission to multiple users over shared wireless channels.
COREC: Concurrent Non-Blocking Single-Queue Receive Driver for Low Latency NetworkingMarco Faltelli, Giacomo Belocchi, Francesco Quaglia, Giuseppe Bianchi2024-01-23下载Existing network stacks tackle performance and scalability aspects by relying on multiple receive queues. However, at software level, each queue is processed by a single thread, which prevents simulta...
Learning from the Best: Active Learning for Wireless CommunicationsNasim Soltani, Jifan Zhang, Batool Salehi, Debashri Roy, Robert Nowak, Kaushik Chowdhury2024-01-23下载Collecting an over-the-air wireless communications training dataset for deep learning-based communication tasks is relatively simple. However, labeling the dataset requires expert involvement and doma...
A lightweight decentralized service placement policy for performance optimization in fog computingCarlos Guerrero, Isaac Lera, Carlos Juiz2024-01-23下载A decentralized optimization policy for service placement in fog computing is presented. The optimization is addressed to place most popular services as closer to the users as possible.
Genetic Algorithm for Multi-Objective Optimization of Container Allocation in Cloud ArchitectureCarlos Guerrero, Isaac Lera, Carlos Juiz2024-01-23下载The use of containers in cloud architectures has become widespread because of advantages such as limited overhead, easier and faster deployment and higher portability.
Availability-aware Service Placement Policy in Fog Computing Based on Graph PartitionsIsaac Lera, Carlos Guerrero, Carlos Juiz2024-01-23下载This paper presents a policy for service placement of fog applications inspired on complex networks and graph theory. We propose a twofold partition process based on communities for the partition of t...
Knowledge Distillation from Language-Oriented to Emergent Communication for Multi-Agent Remote ControlYongjun Kim, Sejin Seo, Jihong Park, Mehdi Bennis, Seong-Lyun Kim, Junil Choi2024-01-23下载In this work, we compare emergent communication (EC) built upon multi-agent deep reinforcement learning (MADRL) and language-oriented semantic communication (LSC) empowered by a pre-trained large lang...
Investigation of FlexAlgo for User-driven Path ControlJulia Kułacz, Martyna Pawlus, Leonardo Boldrini, Paola Grosso2024-01-23下载This paper examines the Flexible Algorithm (FlexAlgo) for its potential to enable user-driven path control in intra-domain Segment Routing (SR) enabled networks.

cs.PF - Performance

标题作者发布日期PDF摘要
Automated Programmatic Performance Analysis of Parallel ProgramsOnur Cankur, Aditya Tomar, Daniel Nichols, Connor Scully-Allison, Katherine E. Isaacs, Abhinav Bhatele2024-01-23下载Developing efficient parallel applications is critical to advancing scientific development but requires significant performance analysis and optimization.

基于 VitePress 构建