Appearance
2024-09-02
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| VLSI Hypergraph Partitioning with Deep Learning | Muhammad Hadir Khan, Bugra Onal, Eren Dogan, Matthew R. Guthaus | 2024-09-02 | 下载 | Partitioning is a known problem in computer science and is critical in chip design workflows, as advancements in this area can significantly influence design quality and efficiency. |
| Duplex: A Device for Large Language Models with Mixture of Experts, Grouped Query Attention, and Continuous Batching | Sungmin Yun, Kwanhee Kyung, Juhwan Cho, Jaewan Choi, Jongmin Kim, Byeongho Kim, Sukhan Lee, Kyomin Sohn, Jung Ho Ahn | 2024-09-02 | 下载 | Large language models (LLMs) have emerged due to their capability to generate high-quality content across diverse contexts. To reduce their explosively increasing demands for computing resources, a mi... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Achieving Byzantine-Resilient Federated Learning via Layer-Adaptive Sparsified Model Aggregation | Jiahao Xu, Zikai Zhang, Rui Hu | 2024-09-02 | 下载 | Federated Learning (FL) enables multiple clients to collaboratively train a model without sharing their local data. Yet the FL system is vulnerable to well-designed Byzantine attacks, which aim to dis... |
| VLSI Hypergraph Partitioning with Deep Learning | Muhammad Hadir Khan, Bugra Onal, Eren Dogan, Matthew R. Guthaus | 2024-09-02 | 下载 | Partitioning is a known problem in computer science and is critical in chip design workflows, as advancements in this area can significantly influence design quality and efficiency. |
| Reward Augmentation in Reinforcement Learning for Testing Distributed Systems | Andrea Borgarelli, Constantin Enea, Rupak Majumdar, Srinidhi Nagendra | 2024-09-02 | 下载 | Bugs in popular distributed protocol implementations have been the source of many downtimes in popular internet services. We describe a randomized testing approach for distributed protocol implementat... |
| How local constraints influence network diameter and applications to LCL generalizations | Nicolas Bousquet, Laurent Feuilloley, Théo Pierron | 2024-09-02 | 下载 | In this paper, we investigate how local rules enforced at every node can influence the topology of a network. More precisely, we establish several results on the diameter of trees as a function of the... |
| GAS: Generative Activation-Aided Asynchronous Split Federated Learning | Jiarong Yang, Yuan Liu | 2024-09-02 | 下载 | Split Federated Learning (SFL) splits and collaboratively trains a shared model between clients and server, where clients transmit activations and client-side models to server for updates. |
| Eliminating Timing Anomalies in Scheduling Periodic Segmented Self-Suspending Tasks with Release Jitter | Ching-Chi Lin, Mario Günzel, Junjie Shi, Tristan Taylan Seidl, Kuan-Hsun Chen, Jian-Jia Chen | 2024-09-02 | 下载 | Ensuring timing guarantees for every individual tasks is critical in real-time systems. Even for periodic tasks, providing timing guarantees for tasks with segmented self-suspending behavior is challe... |
| HexiScale: Accommodating Large Language Model Training over Heterogeneous Environment | Ran Yan, Youhe Jiang, Xiaonan Nie, Fangcheng Fu, Bin Cui, Binhang Yuan | 2024-09-02 | 下载 | Training large language model (LLM) is a computationally intensive task, which is typically conducted in data centers with homogeneous high-performance GPUs. |
| CARIn: Constraint-Aware and Responsive Inference on Heterogeneous Devices for Single- and Multi-DNN Workloads | Ioannis Panopoulos, Stylianos I. Venieris, Iakovos S. Venieris | 2024-09-02 | 下载 | The relentless expansion of deep learning applications in recent years has prompted a pivotal shift toward on-device execution, driven by the urgent need for real-time processing, heightened privacy c... |
| Vortex: Efficient Sample-Free Dynamic Tensor Program Optimization via Hardware-aware Strategy Space Hierarchization | Yangjie Zhou, Honglin Zhu, Qian Qiu, Weihao Cui, Zihan Liu, Cong Guo, Siyuan Feng, Jintao Meng, Haidong Lan, Jingwen Leng, Wenxi Zhu, Minwen Deng | 2024-09-02 | 下载 | Dynamic-shape deep neural networks (DNNs) are rapidly evolving, attracting attention for their ability to handle variable input sizes in real-time applications. |
| LuWu: An End-to-End In-Network Out-of-Core Optimizer for 100B-Scale Model-in-Network Data-Parallel Training on Distributed GPUs | Mo Sun, Zihan Yang, Changyue Liao, Yingtao Li, Fei Wu, Zeke Wang | 2024-09-02 | 下载 | The recent progress made in large language models (LLMs) has brought tremendous application prospects to the world. The growing model size demands LLM training on multiple GPUs, while data parallelism... |
| Rapid GPU-Based Pangenome Graph Layout | Jiajie Li, Jan-Niklas Schmelzle, Yixiao Du, Simon Heumos, Andrea Guarracino, Giulia Guidi, Pjotr Prins, Erik Garrison, Zhiru Zhang | 2024-09-02 | 下载 | Computational Pangenomics is an emerging field that studies genetic variation using a graph structure encompassing multiple genomes. Visualizing pangenome graphs is vital for understanding genome dive... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Online Convex Optimization for On-Board Routing in High-Throughput Satellites | Olivier Bélanger, Jean-Luc Lupien, Olfa Ben Yahia, Stéphane Martel, Antoine Lesage-Landry, Gunes Karabulut Kurt | 2024-09-02 | 下载 | The rise in low Earth orbit (LEO) satellite Internet services has led to increasing demand, often exceeding available data rates and compromising the quality of service. |
| Generating Packet-Level Header Traces Using GNN-powered GAN | Zhen Xu | 2024-09-02 | 下载 | This study presents a novel method combining Graph Neural Networks (GNNs) and Generative Adversarial Networks (GANs) for generating packet-level header traces. |
| Non-local redundancy: Erasure coding and dispersed replicas for robust retrieval in the Swarm peer-to-peer network | Viktor Trón, Viktor Tóth, Callum Toner, Dan Nickless, Dániel A. Nagy, Áron Fischer, György Barabás | 2024-09-02 | 下载 | This paper describes in detail how erasure codes are implemented in the Swarm system. First, in Section 1, we introduce erasure codes, and show how to apply them to files in Swarm (Section 2). |
| DTRAN: A Special Use Case of RAN Optimization using Digital Twin | Caglar Tunc, Kubra Duran, Buse Bilgin, Gokhan Kalem, Berk Canberk | 2024-09-02 | 下载 | The emergence of beyond 5G (B5G) and 6G networks underscores the critical role of advanced computer-aided tools, such as network digital twins (DTs), in fostering autonomous networks and ubiquitous in... |
| Poster: Developing an O-RAN Security Test Lab | Sotiris Michaelides, David Rupprecht, Katharina Kohls | 2024-09-02 | 下载 | Open Radio Access Networks (ORAN) is a new architectural approach, having been proposed only a few years ago, and it is an expansion of the current Next Generation Radio Access Networks (NG-RAN) of 5G... |
| Two-Timescale Synchronization and Migration for Digital Twin Networks: A Multi-Agent Deep Reinforcement Learning Approach | Wenshuai Liu, Yaru Fu, Yongna Guo, Fu Lee Wang, Wen Sun, Yan Zhang | 2024-09-02 | 下载 | Digital twins (DTs) have emerged as a promising enabler for representing the real-time states of physical worlds and realizing self-sustaining systems. |
| Federated Deep Reinforcement Learning-Based Intelligent Channel Access in Dense Wi-Fi Deployments | Xinyang Du, Xuming Fang, Rong He, Li Yan, Liuming Lu, Chaoming Luo | 2024-09-02 | 下载 | The IEEE 802.11 MAC layer utilizes the Carrier Sense Multiple Access with Collision Avoidance (CSMA/CA) mechanism for channel contention, but dense Wi-Fi deployments often cause high collision rates. |
| Clutter Suppression, Time-Frequency Synchronization, and Sensing Parameter Association in Asynchronous Perceptive Vehicular Networks | Xiao-Yang Wang, Shaoshi Yang, Jianhua Zhang, Christos Masouros, Ping Zhang | 2024-09-02 | 下载 | Significant challenges remain for realizing precise positioning and velocity estimation in perceptive vehicular networks (PVN) enabled by the emerging integrated sensing and communication technology. |
| Windowing Optimization for Fingerprint-Spectrum-Based Passive Sensing in Perceptive Mobile Networks | Xiao-Yang Wang, Shaoshi Yang, Hou-Yu Zhai, Christos Masouros, J. Andrew Zhang | 2024-09-02 | 下载 | Perceptive mobile networks (PMN) have been widely recognized as a pivotal pillar for the sixth generation (6G) mobile communication systems. However, the asynchronicity between transmitters and receiv... |
| Infiltrating the Sky: Data Delay and Overflow Attacks in Earth Observation Constellations | Xiaojian Wang, Ruozhou Yu, Dejun Yang, Guoliang Xue | 2024-09-02 | 下载 | Low Earth Orbit (LEO) Earth Observation (EO) satellites have changed the way we monitor Earth. Acting like moving cameras, EO satellites are formed in constellations with different missions and priori... |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| CyberCortex.AI: An AI-based Operating System for Autonomous Robotics and Complex Automation | Sorin Grigorescu, Mihai Zaha | 2024-09-02 | 下载 | The underlying framework for controlling autonomous robots and complex automation applications are Operating Systems (OS) capable of scheduling perception-and-control tasks, as well as providing real-... |