Skip to content

2024-09-02

cs.AR - Architecture

标题作者发布日期PDF摘要
VLSI Hypergraph Partitioning with Deep LearningMuhammad Hadir Khan, Bugra Onal, Eren Dogan, Matthew R. Guthaus2024-09-02下载Partitioning is a known problem in computer science and is critical in chip design workflows, as advancements in this area can significantly influence design quality and efficiency.
Duplex: A Device for Large Language Models with Mixture of Experts, Grouped Query Attention, and Continuous BatchingSungmin Yun, Kwanhee Kyung, Juhwan Cho, Jaewan Choi, Jongmin Kim, Byeongho Kim, Sukhan Lee, Kyomin Sohn, Jung Ho Ahn2024-09-02下载Large language models (LLMs) have emerged due to their capability to generate high-quality content across diverse contexts. To reduce their explosively increasing demands for computing resources, a mi...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Achieving Byzantine-Resilient Federated Learning via Layer-Adaptive Sparsified Model AggregationJiahao Xu, Zikai Zhang, Rui Hu2024-09-02下载Federated Learning (FL) enables multiple clients to collaboratively train a model without sharing their local data. Yet the FL system is vulnerable to well-designed Byzantine attacks, which aim to dis...
VLSI Hypergraph Partitioning with Deep LearningMuhammad Hadir Khan, Bugra Onal, Eren Dogan, Matthew R. Guthaus2024-09-02下载Partitioning is a known problem in computer science and is critical in chip design workflows, as advancements in this area can significantly influence design quality and efficiency.
Reward Augmentation in Reinforcement Learning for Testing Distributed SystemsAndrea Borgarelli, Constantin Enea, Rupak Majumdar, Srinidhi Nagendra2024-09-02下载Bugs in popular distributed protocol implementations have been the source of many downtimes in popular internet services. We describe a randomized testing approach for distributed protocol implementat...
How local constraints influence network diameter and applications to LCL generalizationsNicolas Bousquet, Laurent Feuilloley, Théo Pierron2024-09-02下载In this paper, we investigate how local rules enforced at every node can influence the topology of a network. More precisely, we establish several results on the diameter of trees as a function of the...
GAS: Generative Activation-Aided Asynchronous Split Federated LearningJiarong Yang, Yuan Liu2024-09-02下载Split Federated Learning (SFL) splits and collaboratively trains a shared model between clients and server, where clients transmit activations and client-side models to server for updates.
Eliminating Timing Anomalies in Scheduling Periodic Segmented Self-Suspending Tasks with Release JitterChing-Chi Lin, Mario Günzel, Junjie Shi, Tristan Taylan Seidl, Kuan-Hsun Chen, Jian-Jia Chen2024-09-02下载Ensuring timing guarantees for every individual tasks is critical in real-time systems. Even for periodic tasks, providing timing guarantees for tasks with segmented self-suspending behavior is challe...
HexiScale: Accommodating Large Language Model Training over Heterogeneous EnvironmentRan Yan, Youhe Jiang, Xiaonan Nie, Fangcheng Fu, Bin Cui, Binhang Yuan2024-09-02下载Training large language model (LLM) is a computationally intensive task, which is typically conducted in data centers with homogeneous high-performance GPUs.
CARIn: Constraint-Aware and Responsive Inference on Heterogeneous Devices for Single- and Multi-DNN WorkloadsIoannis Panopoulos, Stylianos I. Venieris, Iakovos S. Venieris2024-09-02下载The relentless expansion of deep learning applications in recent years has prompted a pivotal shift toward on-device execution, driven by the urgent need for real-time processing, heightened privacy c...
Vortex: Efficient Sample-Free Dynamic Tensor Program Optimization via Hardware-aware Strategy Space HierarchizationYangjie Zhou, Honglin Zhu, Qian Qiu, Weihao Cui, Zihan Liu, Cong Guo, Siyuan Feng, Jintao Meng, Haidong Lan, Jingwen Leng, Wenxi Zhu, Minwen Deng2024-09-02下载Dynamic-shape deep neural networks (DNNs) are rapidly evolving, attracting attention for their ability to handle variable input sizes in real-time applications.
LuWu: An End-to-End In-Network Out-of-Core Optimizer for 100B-Scale Model-in-Network Data-Parallel Training on Distributed GPUsMo Sun, Zihan Yang, Changyue Liao, Yingtao Li, Fei Wu, Zeke Wang2024-09-02下载The recent progress made in large language models (LLMs) has brought tremendous application prospects to the world. The growing model size demands LLM training on multiple GPUs, while data parallelism...
Rapid GPU-Based Pangenome Graph LayoutJiajie Li, Jan-Niklas Schmelzle, Yixiao Du, Simon Heumos, Andrea Guarracino, Giulia Guidi, Pjotr Prins, Erik Garrison, Zhiru Zhang2024-09-02下载Computational Pangenomics is an emerging field that studies genetic variation using a graph structure encompassing multiple genomes. Visualizing pangenome graphs is vital for understanding genome dive...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Online Convex Optimization for On-Board Routing in High-Throughput SatellitesOlivier Bélanger, Jean-Luc Lupien, Olfa Ben Yahia, Stéphane Martel, Antoine Lesage-Landry, Gunes Karabulut Kurt2024-09-02下载The rise in low Earth orbit (LEO) satellite Internet services has led to increasing demand, often exceeding available data rates and compromising the quality of service.
Generating Packet-Level Header Traces Using GNN-powered GANZhen Xu2024-09-02下载This study presents a novel method combining Graph Neural Networks (GNNs) and Generative Adversarial Networks (GANs) for generating packet-level header traces.
Non-local redundancy: Erasure coding and dispersed replicas for robust retrieval in the Swarm peer-to-peer networkViktor Trón, Viktor Tóth, Callum Toner, Dan Nickless, Dániel A. Nagy, Áron Fischer, György Barabás2024-09-02下载This paper describes in detail how erasure codes are implemented in the Swarm system. First, in Section 1, we introduce erasure codes, and show how to apply them to files in Swarm (Section 2).
DTRAN: A Special Use Case of RAN Optimization using Digital TwinCaglar Tunc, Kubra Duran, Buse Bilgin, Gokhan Kalem, Berk Canberk2024-09-02下载The emergence of beyond 5G (B5G) and 6G networks underscores the critical role of advanced computer-aided tools, such as network digital twins (DTs), in fostering autonomous networks and ubiquitous in...
Poster: Developing an O-RAN Security Test LabSotiris Michaelides, David Rupprecht, Katharina Kohls2024-09-02下载Open Radio Access Networks (ORAN) is a new architectural approach, having been proposed only a few years ago, and it is an expansion of the current Next Generation Radio Access Networks (NG-RAN) of 5G...
Two-Timescale Synchronization and Migration for Digital Twin Networks: A Multi-Agent Deep Reinforcement Learning ApproachWenshuai Liu, Yaru Fu, Yongna Guo, Fu Lee Wang, Wen Sun, Yan Zhang2024-09-02下载Digital twins (DTs) have emerged as a promising enabler for representing the real-time states of physical worlds and realizing self-sustaining systems.
Federated Deep Reinforcement Learning-Based Intelligent Channel Access in Dense Wi-Fi DeploymentsXinyang Du, Xuming Fang, Rong He, Li Yan, Liuming Lu, Chaoming Luo2024-09-02下载The IEEE 802.11 MAC layer utilizes the Carrier Sense Multiple Access with Collision Avoidance (CSMA/CA) mechanism for channel contention, but dense Wi-Fi deployments often cause high collision rates.
Clutter Suppression, Time-Frequency Synchronization, and Sensing Parameter Association in Asynchronous Perceptive Vehicular NetworksXiao-Yang Wang, Shaoshi Yang, Jianhua Zhang, Christos Masouros, Ping Zhang2024-09-02下载Significant challenges remain for realizing precise positioning and velocity estimation in perceptive vehicular networks (PVN) enabled by the emerging integrated sensing and communication technology.
Windowing Optimization for Fingerprint-Spectrum-Based Passive Sensing in Perceptive Mobile NetworksXiao-Yang Wang, Shaoshi Yang, Hou-Yu Zhai, Christos Masouros, J. Andrew Zhang2024-09-02下载Perceptive mobile networks (PMN) have been widely recognized as a pivotal pillar for the sixth generation (6G) mobile communication systems. However, the asynchronicity between transmitters and receiv...
Infiltrating the Sky: Data Delay and Overflow Attacks in Earth Observation ConstellationsXiaojian Wang, Ruozhou Yu, Dejun Yang, Guoliang Xue2024-09-02下载Low Earth Orbit (LEO) Earth Observation (EO) satellites have changed the way we monitor Earth. Acting like moving cameras, EO satellites are formed in constellations with different missions and priori...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
CyberCortex.AI: An AI-based Operating System for Autonomous Robotics and Complex AutomationSorin Grigorescu, Mihai Zaha2024-09-02下载The underlying framework for controlling autonomous robots and complex automation applications are Operating Systems (OS) capable of scheduling perception-and-control tasks, as well as providing real-...

基于 VitePress 构建