Appearance
2024-05-07
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Insights from Basilisk: Are Open-Source EDA Tools Ready for a Multi-Million-Gate, Linux-Booting RV64 SoC Design? | Philippe Sauter, Thomas Benz, Paul Scheffler, Frank K. Gürkaynak, Luca Benini | 2024-05-07 | 下载 | Designing complex, multi-million-gate application-specific integrated circuits requires robust and mature electronic design automation (EDA) tools. |
| NOVA: NoC-based Vector Unit for Mapping Attention Layers on a CNN Accelerator | Mohit Upadhyay, Rohan Juneja, Weng-Fai Wong, Li-Shiuan Peh | 2024-05-07 | 下载 | Attention mechanisms are becoming increasingly popular, being used in neural network models in multiple domains such as natural language processing (NLP) and vision applications, especially at the edg... |
| SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory Systems | Kailash Gogineni, Sai Santosh Dayapule, Juan Gómez-Luna, Karthikeya Gogineni, Peng Wei, Tian Lan, Mohammad Sadrosadati, Onur Mutlu, Guru Venkataramani | 2024-05-07 | 下载 | Reinforcement Learning (RL) trains agents to learn optimal behavior by maximizing reward signals from experience datasets. However, RL training often faces memory limitations, leading to execution lat... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Tiny Deep Ensemble: Uncertainty Estimation in Edge AI Accelerators via Ensembling Normalization Layers with Shared Weights | Soyed Tuhin Ahmed, Michael Hefenbrock, Mehdi B. Tahoori | 2024-05-07 | 下载 | The applications of artificial intelligence (AI) are rapidly evolving, and they are also commonly used in safety-critical domains, such as autonomous driving and medical diagnosis, where functional sa... |
| Robust Implementation of Retrieval-Augmented Generation on Edge-based Computing-in-Memory Architectures | Ruiyang Qin, Zheyu Yan, Dewen Zeng, Zhenge Jia, Dancheng Liu, Jianbo Liu, Zhi Zheng, Ningyuan Cao, Kai Ni, Jinjun Xiong, Yiyu Shi | 2024-05-07 | 下载 | Large Language Models (LLMs) deployed on edge devices learn through fine-tuning and updating a certain portion of their parameters. Although such learning methods can be optimized to reduce resource u... |
| Simpler and More General Distributed Coloring Based on Simple List Defective Coloring Algorithms | Marc Fuchs, Fabian Kuhn | 2024-05-07 | 下载 | In this paper, we give list coloring variants of simple iterative defective coloring algorithms. Formally, in a list defective coloring instance, each node of a graph is given a list of colo... |
| Probabilistic Byzantine Fault Tolerance (Extended Version) | Diogo Avelãs, Hasan Heydari, Eduardo Alchieri, Tobias Distler, Alysson Bessani | 2024-05-07 | 下载 | Consensus is a fundamental building block for constructing reliable and fault-tolerant distributed services. Many Byzantine fault-tolerant consensus protocols designed for partially synchronous system... |
| PoW Security-Latency under Random Delays and the Effect of Transaction Fees | Mustafa Doger, Sennur Ulukus, Nail Akar | 2024-05-07 | 下载 | Safety guarantees and security-latency problem of Nakamoto consensus have been extensively studied in the last decade with a bounded delay model. |
| Distributed Computation with Local Advice | Alkida Balliu, Sebastian Brandt, Fabian Kuhn, Krzysztof Nowicki, Dennis Olivetti, Eva Rotenberg, Jukka Suomela | 2024-05-07 | 下载 | In this work we study local computation with advice: the goal is to solve a graph problem Π with a distributed algorithm in T(Δ) communication rounds, for some function that only depends on th... |
| Scalable Circuit Cutting and Scheduling in a Resource-constrained and Distributed Quantum System | Shuwen Kan, Zefan Du, Miguel Palma, Samuel A Stein, Chenxu Liu, Wenqi Wei, Juntao Chen, Ang Li, Ying Mao | 2024-05-07 | 下载 | Despite quantum computing's rapid development, current systems remain limited in practical applications due to their limited qubit count and quality. |
| Fast Decentralized Gradient Tracking for Federated Minimax Optimization with Local Updates | Chris Junchi Li | 2024-05-07 | 下载 | Federated learning (FL) for minimax optimization has emerged as a powerful paradigm for training models across distributed nodes/clients while preserving data privacy and model robustness on data hete... |
| Resource-Efficient and Self-Adaptive Quantum Search in a Quantum-Classical Hybrid System | Zihao Jiang, Zefan Du, Shaolun Ruan, Juntao Chen, Yong Wang, Long Cheng, Rajkumar Buyya, Ying Mao | 2024-05-07 | 下载 | Over the past decade, the rapid advancement of deep learning and big data applications has been driven by vast datasets and high-performance computing systems. |
| Parallelized Multi-Agent Bayesian Optimization in Lava | Shay Snyder, Derek Gobin, Victoria Clerico, Sumedh R. Risbud, Maryam Parsa | 2024-05-07 | 下载 | In parallel with the continuously increasing parameter space dimensionality, search and optimization algorithms should support distributed parameter evaluations to reduce cumulative runtime. |
| Self-Stabilizing MIS Computation in the Beeping Model | George Giakkoupis, Volker Turau, Isabella Ziccardi | 2024-05-07 | 下载 | We consider self-stabilizing algorithms to compute a Maximal Independent Set (MIS) in the extremely weak beeping communication model. The model consists of an anonymous network with synchronous rounds... |
| Federated Learning for Collaborative Inference Systems: The Case of Early Exit Networks | Caelin Kaplan, Angelo Rodio, Tareq Si Salem, Chuan Xu, Giovanni Neglia | 2024-05-07 | 下载 | As Internet of Things (IoT) technology advances, end devices like sensors and smartphones are progressively equipped with AI models tailored to their local memory and computational constraints. |
| QR factorization of ill-conditioned tall-and-skinny matrices on distributed-memory systems | Nenad Mijić, Abhiram Kaushik, Davor Davidović | 2024-05-07 | 下载 | In this paper we present a novel algorithm developed for computing the QR factorisation of extremely ill-conditioned tall-and-skinny matrices on distributed memory systems. |
| pFedLVM: A Large Vision Model (LVM)-Driven and Latent Feature-Based Personalized Federated Learning Framework in Autonomous Driving | Wei-Bin Kou, Qingfeng Lin, Ming Tang, Sheng Xu, Rongguang Ye, Yang Leng, Shuai Wang, Guofa Li, Zhenyu Chen, Guangxu Zhu, Yik-Chung Wu | 2024-05-07 | 下载 | Deep learning-based Autonomous Driving (AD) models often exhibit poor generalization due to data heterogeneity in an ever domain-shifting environment. |
| Ranking-based Client Selection with Imitation Learning for Efficient Federated Learning | Chunlin Tian, Zhan Shi, Xinpeng Qin, Li Li, Chengzhong Xu | 2024-05-07 | 下载 | Federated Learning (FL) enables multiple devices to collaboratively train a shared model while ensuring data privacy. The selection of participating devices in each training round critically affects b... |
| Federated Graph Condensation with Information Bottleneck Principles | Bo Yan, Sihao He, Cheng Yang, Shang Liu, Yang Cao, Chuan Shi | 2024-05-07 | 下载 | Graph condensation (GC), which reduces the size of a large-scale graph by synthesizing a small-scale condensed graph as its substitution, has benefited various graph learning tasks. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Pipe Routing with Topology Control for UAV Networks | Shreyas Devaraju, Shivam Garg, Alexander Ihler, Sunil Kumar | 2024-05-07 | 下载 | Routing protocols help in transmitting the sensed data from UAVs monitoring the targets (called target UAVs) to the BS. However, the highly dynamic nature of an autonomous, decentralized UAV network l... |
| Designing, Developing, and Validating Network Intelligence for Scaling in Service-Based Architectures based on Deep Reinforcement Learning | Paola Soto, Miguel Camelo, Danny De Vleeschauwer, Yorick De Bock, Nina Slamnik-Kriještorac, Chia-Yu Chang, Natalia Gaviria, Erik Mannens, Juan F. Botero, Steven Latré | 2024-05-07 | 下载 | Automating network processes without human intervention is crucial for the complex Sixth Generation (6G) environment. Thus, 6G networks must advance beyond basic automation, relying on Artificial Inte... |
| Designing the Network Intelligence Stratum for 6G Networks | Paola Soto, Miguel Camelo, Gines Garcia-Aviles, Esteban Municio, Marco Gramaglia, Evangelos Kosmatos, Nina Slamnik-Kriještorac, Danny De Vleeschauwer, Antonio Bazco-Nogueras, Lidia Fuentes, Joaquin Ballesteros, Andra Lutu, Luca Cominardi, Ivan Paez, Sergi Alcalá-Marín, Livia Elena Chatzieleftheriou, Andres Garcia-Saavedra, Marco Fiore | 2024-05-07 | 下载 | As network complexity escalates, there is an increasing need for more sophisticated methods to manage and operate these networks, focusing on enhancing efficiency, reliability, and security. |
| Optimizing Information Freshness in IoT Systems with Update Rate Constraints: A Token-Based Approach | Erfan Delfani, Nikolaos Pappas | 2024-05-07 | 下载 | In Internet of Things (IoT) status update systems, where information is sampled and subsequently transmitted from a source to a destination node, the imperative necessity lies in maintaining the timel... |
| Utility-driven Optimization of TTL Cache Hierarchies under Network Delays | Karim S. Elsayed, Fabien Geyer, Amr Rizk | 2024-05-07 | 下载 | We optimize hierarchies of Time-to-Live (TTL) caches under random network delays. A TTL cache assigns individual eviction timers to cached objects that are usually refreshed upon a hit where upon a mi... |
| PACIFISTA: Conflict Evaluation and Management in Open RAN | Pietro Brach del Prever, Salvatore D'Oro, Leonardo Bonati, Michele Polese, Maria Tsampazi, Heiko Lehmann, Tommaso Melodia | 2024-05-07 | 下载 | The O-RAN ALLIANCE is defining architectures, interfaces, operations, and security requirements for cellular networks based on Open Radio Access Network (RAN) principles. |
| One-Class Classification as GLRT for Jamming Detection in Private 5G Networks | Matteo Varotto, Stefan Valentin, Francesco Ardizzon, Samuele Marzotto, Stefano Tomasin | 2024-05-07 | 下载 | 5G mobile networks are vulnerable to jamming attacks that may jeopardize valuable applications such as industry automation. In this paper, we propose to analyze radio signals with a dedicated device t... |
| Detecting 5G Narrowband Jammers with CNN, k-nearest Neighbors, and Support Vector Machines | Matteo Varotto, Florian Heinrichs, Timo Schuerg, Stefano Tomasin, Stefan Valentin | 2024-05-07 | 下载 | 5G cellular networks are particularly vulnerable against narrowband jammers that target specific control sub-channels in the radio signal. One mitigation approach is to detect such jamming attacks wit... |
| GLIDS: A Global Latency Information Dissemination System | Cyrill Krähenbühl, Seyedali Tabaeiaghdaei, Simon Scherrer, Matthias Frei, Adrian Perrig | 2024-05-07 | 下载 | A recent advance in networking is the deployment of path-aware multipath network architectures, where network endpoints are given multiple network paths to send their data on. |
| Energy-Efficient Deployment of Stateful FaaS Vertical Applications on Edge Data Networks | Claudio Cicconetti, Raffaele Bruno, Andrea Passarella | 2024-05-07 | 下载 | 5G and beyond support the deployment of vertical applications, which is particularly appealing in combination with network slicing and edge computing to create a logically isolated environment for exe... |
| Effect of Realistic Oscillator Phase Noise on the Performance of Cell-Free Massive MIMO Systems | Igor Zhilin, Evgenii Vinogradov, Ian Akyildiz | 2024-05-07 | 下载 | As the demand for 6G technologies continues to grow, the radio access infrastructure is expected to become increasingly dense. Cell-free (CF) Massive MIMO systems provide remarkable flexibility by ena... |
| uTNT: Unikernels for Efficient and Flexible Internet Probing | Maxime Letemple, Gaulthier Gain, Sami Ben Mariem, Laurent Mathy, Benoit Donnet | 2024-05-07 | 下载 | The last twenty years have seen the development and popularity of network measurement infrastructures. Internet measurement platforms have become common and have demonstrated their relevance in Intern... |
| TrimCaching: Parameter-sharing AI Model Caching in Wireless Edge Networks | Guanqiao Qu, Zheng Lin, Fangming Liu, Xianhao Chen, Kaibin Huang | 2024-05-07 | 下载 | Next-generation mobile networks are expected to facilitate fast AI model downloading to end users. By caching models on edge servers, mobile networks can deliver models to end users with low latency, ... |
| Role of Sensing and Computer Vision in 6G Wireless Communications | Seungnyun Kim, Jihoon Moon, Jinhong Kim, Yongjun Ahn, Donghoon Kim, Sunwoo Kim, Kyuhong Shim, Byonghyo Shim | 2024-05-07 | 下载 | Recently, we are witnessing the remarkable progress and widespread adoption of sensing technologies in autonomous driving, robotics, and metaverse. |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| vAttention: Dynamic Memory Management for Serving LLMs without PagedAttention | Ramya Prabhu, Ajay Nayak, Jayashree Mohan, Ramachandran Ramjee, Ashish Panwar | 2024-05-07 | 下载 | PagedAttention is a popular approach for dynamic memory allocation in LLM serving systems. It enables on-demand allocation of GPU memory to mitigate KV cache fragmentation -- a phenomenon that cripple... |
| uTNT: Unikernels for Efficient and Flexible Internet Probing | Maxime Letemple, Gaulthier Gain, Sami Ben Mariem, Laurent Mathy, Benoit Donnet | 2024-05-07 | 下载 | The last twenty years have seen the development and popularity of network measurement infrastructures. Internet measurement platforms have become common and have demonstrated their relevance in Intern... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving | Yujun Lin, Haotian Tang, Shang Yang, Zhekai Zhang, Guangxuan Xiao, Chuang Gan, Song Han | 2024-05-07 | 下载 | Quantization can accelerate large language model (LLM) inference. Going beyond INT8 quantization, the research community is actively exploring even lower precision, such as INT4. |
| QR factorization of ill-conditioned tall-and-skinny matrices on distributed-memory systems | Nenad Mijić, Abhiram Kaushik, Davor Davidović | 2024-05-07 | 下载 | In this paper we present a novel algorithm developed for computing the QR factorisation of extremely ill-conditioned tall-and-skinny matrices on distributed memory systems. |
| Analysis of Markovian Arrivals and Service with Applications to Intermittent Overload | Isaac Grosof, Yige Hong, Mor Harchol-Balter | 2024-05-07 | 下载 | In many important real-world queueing settings, arrival and service rates fluctuate over time. We consider the MAMS system, where the arrival and service rates each vary according to an arbitrary fini... |