Skip to content

2024-05-07

cs.AR - Architecture

标题作者发布日期PDF摘要
Insights from Basilisk: Are Open-Source EDA Tools Ready for a Multi-Million-Gate, Linux-Booting RV64 SoC Design?Philippe Sauter, Thomas Benz, Paul Scheffler, Frank K. Gürkaynak, Luca Benini2024-05-07下载Designing complex, multi-million-gate application-specific integrated circuits requires robust and mature electronic design automation (EDA) tools.
NOVA: NoC-based Vector Unit for Mapping Attention Layers on a CNN AcceleratorMohit Upadhyay, Rohan Juneja, Weng-Fai Wong, Li-Shiuan Peh2024-05-07下载Attention mechanisms are becoming increasingly popular, being used in neural network models in multiple domains such as natural language processing (NLP) and vision applications, especially at the edg...
SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory SystemsKailash Gogineni, Sai Santosh Dayapule, Juan Gómez-Luna, Karthikeya Gogineni, Peng Wei, Tian Lan, Mohammad Sadrosadati, Onur Mutlu, Guru Venkataramani2024-05-07下载Reinforcement Learning (RL) trains agents to learn optimal behavior by maximizing reward signals from experience datasets. However, RL training often faces memory limitations, leading to execution lat...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Tiny Deep Ensemble: Uncertainty Estimation in Edge AI Accelerators via Ensembling Normalization Layers with Shared WeightsSoyed Tuhin Ahmed, Michael Hefenbrock, Mehdi B. Tahoori2024-05-07下载The applications of artificial intelligence (AI) are rapidly evolving, and they are also commonly used in safety-critical domains, such as autonomous driving and medical diagnosis, where functional sa...
Robust Implementation of Retrieval-Augmented Generation on Edge-based Computing-in-Memory ArchitecturesRuiyang Qin, Zheyu Yan, Dewen Zeng, Zhenge Jia, Dancheng Liu, Jianbo Liu, Zhi Zheng, Ningyuan Cao, Kai Ni, Jinjun Xiong, Yiyu Shi2024-05-07下载Large Language Models (LLMs) deployed on edge devices learn through fine-tuning and updating a certain portion of their parameters. Although such learning methods can be optimized to reduce resource u...
Simpler and More General Distributed Coloring Based on Simple List Defective Coloring AlgorithmsMarc Fuchs, Fabian Kuhn2024-05-07下载In this paper, we give list coloring variants of simple iterative defective coloring algorithms. Formally, in a list defective coloring instance, each node vv of a graph is given a list LvL_v of colo...
Probabilistic Byzantine Fault Tolerance (Extended Version)Diogo Avelãs, Hasan Heydari, Eduardo Alchieri, Tobias Distler, Alysson Bessani2024-05-07下载Consensus is a fundamental building block for constructing reliable and fault-tolerant distributed services. Many Byzantine fault-tolerant consensus protocols designed for partially synchronous system...
PoW Security-Latency under Random Delays and the Effect of Transaction FeesMustafa Doger, Sennur Ulukus, Nail Akar2024-05-07下载Safety guarantees and security-latency problem of Nakamoto consensus have been extensively studied in the last decade with a bounded delay model.
Distributed Computation with Local AdviceAlkida Balliu, Sebastian Brandt, Fabian Kuhn, Krzysztof Nowicki, Dennis Olivetti, Eva Rotenberg, Jukka Suomela2024-05-07下载In this work we study local computation with advice: the goal is to solve a graph problem Π with a distributed algorithm in T(Δ) communication rounds, for some function TT that only depends on th...
Scalable Circuit Cutting and Scheduling in a Resource-constrained and Distributed Quantum SystemShuwen Kan, Zefan Du, Miguel Palma, Samuel A Stein, Chenxu Liu, Wenqi Wei, Juntao Chen, Ang Li, Ying Mao2024-05-07下载Despite quantum computing's rapid development, current systems remain limited in practical applications due to their limited qubit count and quality.
Fast Decentralized Gradient Tracking for Federated Minimax Optimization with Local UpdatesChris Junchi Li2024-05-07下载Federated learning (FL) for minimax optimization has emerged as a powerful paradigm for training models across distributed nodes/clients while preserving data privacy and model robustness on data hete...
Resource-Efficient and Self-Adaptive Quantum Search in a Quantum-Classical Hybrid SystemZihao Jiang, Zefan Du, Shaolun Ruan, Juntao Chen, Yong Wang, Long Cheng, Rajkumar Buyya, Ying Mao2024-05-07下载Over the past decade, the rapid advancement of deep learning and big data applications has been driven by vast datasets and high-performance computing systems.
Parallelized Multi-Agent Bayesian Optimization in LavaShay Snyder, Derek Gobin, Victoria Clerico, Sumedh R. Risbud, Maryam Parsa2024-05-07下载In parallel with the continuously increasing parameter space dimensionality, search and optimization algorithms should support distributed parameter evaluations to reduce cumulative runtime.
Self-Stabilizing MIS Computation in the Beeping ModelGeorge Giakkoupis, Volker Turau, Isabella Ziccardi2024-05-07下载We consider self-stabilizing algorithms to compute a Maximal Independent Set (MIS) in the extremely weak beeping communication model. The model consists of an anonymous network with synchronous rounds...
Federated Learning for Collaborative Inference Systems: The Case of Early Exit NetworksCaelin Kaplan, Angelo Rodio, Tareq Si Salem, Chuan Xu, Giovanni Neglia2024-05-07下载As Internet of Things (IoT) technology advances, end devices like sensors and smartphones are progressively equipped with AI models tailored to their local memory and computational constraints.
QR factorization of ill-conditioned tall-and-skinny matrices on distributed-memory systemsNenad Mijić, Abhiram Kaushik, Davor Davidović2024-05-07下载In this paper we present a novel algorithm developed for computing the QR factorisation of extremely ill-conditioned tall-and-skinny matrices on distributed memory systems.
pFedLVM: A Large Vision Model (LVM)-Driven and Latent Feature-Based Personalized Federated Learning Framework in Autonomous DrivingWei-Bin Kou, Qingfeng Lin, Ming Tang, Sheng Xu, Rongguang Ye, Yang Leng, Shuai Wang, Guofa Li, Zhenyu Chen, Guangxu Zhu, Yik-Chung Wu2024-05-07下载Deep learning-based Autonomous Driving (AD) models often exhibit poor generalization due to data heterogeneity in an ever domain-shifting environment.
Ranking-based Client Selection with Imitation Learning for Efficient Federated LearningChunlin Tian, Zhan Shi, Xinpeng Qin, Li Li, Chengzhong Xu2024-05-07下载Federated Learning (FL) enables multiple devices to collaboratively train a shared model while ensuring data privacy. The selection of participating devices in each training round critically affects b...
Federated Graph Condensation with Information Bottleneck PrinciplesBo Yan, Sihao He, Cheng Yang, Shang Liu, Yang Cao, Chuan Shi2024-05-07下载Graph condensation (GC), which reduces the size of a large-scale graph by synthesizing a small-scale condensed graph as its substitution, has benefited various graph learning tasks.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Pipe Routing with Topology Control for UAV NetworksShreyas Devaraju, Shivam Garg, Alexander Ihler, Sunil Kumar2024-05-07下载Routing protocols help in transmitting the sensed data from UAVs monitoring the targets (called target UAVs) to the BS. However, the highly dynamic nature of an autonomous, decentralized UAV network l...
Designing, Developing, and Validating Network Intelligence for Scaling in Service-Based Architectures based on Deep Reinforcement LearningPaola Soto, Miguel Camelo, Danny De Vleeschauwer, Yorick De Bock, Nina Slamnik-Kriještorac, Chia-Yu Chang, Natalia Gaviria, Erik Mannens, Juan F. Botero, Steven Latré2024-05-07下载Automating network processes without human intervention is crucial for the complex Sixth Generation (6G) environment. Thus, 6G networks must advance beyond basic automation, relying on Artificial Inte...
Designing the Network Intelligence Stratum for 6G NetworksPaola Soto, Miguel Camelo, Gines Garcia-Aviles, Esteban Municio, Marco Gramaglia, Evangelos Kosmatos, Nina Slamnik-Kriještorac, Danny De Vleeschauwer, Antonio Bazco-Nogueras, Lidia Fuentes, Joaquin Ballesteros, Andra Lutu, Luca Cominardi, Ivan Paez, Sergi Alcalá-Marín, Livia Elena Chatzieleftheriou, Andres Garcia-Saavedra, Marco Fiore2024-05-07下载As network complexity escalates, there is an increasing need for more sophisticated methods to manage and operate these networks, focusing on enhancing efficiency, reliability, and security.
Optimizing Information Freshness in IoT Systems with Update Rate Constraints: A Token-Based ApproachErfan Delfani, Nikolaos Pappas2024-05-07下载In Internet of Things (IoT) status update systems, where information is sampled and subsequently transmitted from a source to a destination node, the imperative necessity lies in maintaining the timel...
Utility-driven Optimization of TTL Cache Hierarchies under Network DelaysKarim S. Elsayed, Fabien Geyer, Amr Rizk2024-05-07下载We optimize hierarchies of Time-to-Live (TTL) caches under random network delays. A TTL cache assigns individual eviction timers to cached objects that are usually refreshed upon a hit where upon a mi...
PACIFISTA: Conflict Evaluation and Management in Open RANPietro Brach del Prever, Salvatore D'Oro, Leonardo Bonati, Michele Polese, Maria Tsampazi, Heiko Lehmann, Tommaso Melodia2024-05-07下载The O-RAN ALLIANCE is defining architectures, interfaces, operations, and security requirements for cellular networks based on Open Radio Access Network (RAN) principles.
One-Class Classification as GLRT for Jamming Detection in Private 5G NetworksMatteo Varotto, Stefan Valentin, Francesco Ardizzon, Samuele Marzotto, Stefano Tomasin2024-05-07下载5G mobile networks are vulnerable to jamming attacks that may jeopardize valuable applications such as industry automation. In this paper, we propose to analyze radio signals with a dedicated device t...
Detecting 5G Narrowband Jammers with CNN, k-nearest Neighbors, and Support Vector MachinesMatteo Varotto, Florian Heinrichs, Timo Schuerg, Stefano Tomasin, Stefan Valentin2024-05-07下载5G cellular networks are particularly vulnerable against narrowband jammers that target specific control sub-channels in the radio signal. One mitigation approach is to detect such jamming attacks wit...
GLIDS: A Global Latency Information Dissemination SystemCyrill Krähenbühl, Seyedali Tabaeiaghdaei, Simon Scherrer, Matthias Frei, Adrian Perrig2024-05-07下载A recent advance in networking is the deployment of path-aware multipath network architectures, where network endpoints are given multiple network paths to send their data on.
Energy-Efficient Deployment of Stateful FaaS Vertical Applications on Edge Data NetworksClaudio Cicconetti, Raffaele Bruno, Andrea Passarella2024-05-07下载5G and beyond support the deployment of vertical applications, which is particularly appealing in combination with network slicing and edge computing to create a logically isolated environment for exe...
Effect of Realistic Oscillator Phase Noise on the Performance of Cell-Free Massive MIMO SystemsIgor Zhilin, Evgenii Vinogradov, Ian Akyildiz2024-05-07下载As the demand for 6G technologies continues to grow, the radio access infrastructure is expected to become increasingly dense. Cell-free (CF) Massive MIMO systems provide remarkable flexibility by ena...
uTNT: Unikernels for Efficient and Flexible Internet ProbingMaxime Letemple, Gaulthier Gain, Sami Ben Mariem, Laurent Mathy, Benoit Donnet2024-05-07下载The last twenty years have seen the development and popularity of network measurement infrastructures. Internet measurement platforms have become common and have demonstrated their relevance in Intern...
TrimCaching: Parameter-sharing AI Model Caching in Wireless Edge NetworksGuanqiao Qu, Zheng Lin, Fangming Liu, Xianhao Chen, Kaibin Huang2024-05-07下载Next-generation mobile networks are expected to facilitate fast AI model downloading to end users. By caching models on edge servers, mobile networks can deliver models to end users with low latency, ...
Role of Sensing and Computer Vision in 6G Wireless CommunicationsSeungnyun Kim, Jihoon Moon, Jinhong Kim, Yongjun Ahn, Donghoon Kim, Sunwoo Kim, Kyuhong Shim, Byonghyo Shim2024-05-07下载Recently, we are witnessing the remarkable progress and widespread adoption of sensing technologies in autonomous driving, robotics, and metaverse.

cs.OS - Operating Systems

标题作者发布日期PDF摘要
vAttention: Dynamic Memory Management for Serving LLMs without PagedAttentionRamya Prabhu, Ajay Nayak, Jayashree Mohan, Ramachandran Ramjee, Ashish Panwar2024-05-07下载PagedAttention is a popular approach for dynamic memory allocation in LLM serving systems. It enables on-demand allocation of GPU memory to mitigate KV cache fragmentation -- a phenomenon that cripple...
uTNT: Unikernels for Efficient and Flexible Internet ProbingMaxime Letemple, Gaulthier Gain, Sami Ben Mariem, Laurent Mathy, Benoit Donnet2024-05-07下载The last twenty years have seen the development and popularity of network measurement infrastructures. Internet measurement platforms have become common and have demonstrated their relevance in Intern...

cs.PF - Performance

标题作者发布日期PDF摘要
QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM ServingYujun Lin, Haotian Tang, Shang Yang, Zhekai Zhang, Guangxuan Xiao, Chuang Gan, Song Han2024-05-07下载Quantization can accelerate large language model (LLM) inference. Going beyond INT8 quantization, the research community is actively exploring even lower precision, such as INT4.
QR factorization of ill-conditioned tall-and-skinny matrices on distributed-memory systemsNenad Mijić, Abhiram Kaushik, Davor Davidović2024-05-07下载In this paper we present a novel algorithm developed for computing the QR factorisation of extremely ill-conditioned tall-and-skinny matrices on distributed memory systems.
Analysis of Markovian Arrivals and Service with Applications to Intermittent OverloadIsaac Grosof, Yige Hong, Mor Harchol-Balter2024-05-07下载In many important real-world queueing settings, arrival and service rates fluctuate over time. We consider the MAMS system, where the arrival and service rates each vary according to an arbitrary fini...

基于 VitePress 构建