Skip to content

2024-04-09

cs.AR - Architecture

标题作者发布日期PDF摘要
Modeling Analog-Digital-Converter Energy and Area for Compute-In-Memory Accelerator DesignTanner Andrulis, Ruicong Chen, Hae-Seung Lee, Joel S. Emer, Vivienne Sze2024-04-09下载Analog Compute-in-Memory (CiM) accelerators use analog-digital converters (ADCs) to read the analog values that they compute. ADCs can consume significant energy and area, so architecture-level ADC de...
WaSP: Warp Scheduling to Mimic Prefetching in Graphics WorkloadsDiya Joseph, Juan Luis Aragón, Joan-Manuel Parcerisa, Antonio Gonzalez2024-04-09下载Contemporary GPUs are designed to handle long-latency operations effectively; however, challenges such as core occupancy (number of warps in a core) and pipeline width can impede their latency managem...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
NotNets: Accelerating Microservices by Bypassing the NetworkPeter Alvaro, Matthew Adiletta, Adrian Cockroft, Frank Hady, Ramesh Illikkal, Esteban Ramos, James Tsai, Robert Soulé2024-04-09下载Remote procedure calls are the workhorse of distributed systems. However, as software engineering trends, such as micro-services and serverless computing, push applications towards ever finer-grained ...
Scaling to 32 GPUs on a Novel Composable System ArchitectureJohn Ihnotic2024-04-09下载The development of composable systems architecture marks a significant shift in resource allocation and utilization within data centers. This paper presents a composable architecture scaling up to 32 ...
Analysis of Distributed Algorithms for Big-dataRajendra Purohit, K R Chowdhary, S D Purohit2024-04-09下载The parallel and distributed processing are becoming de facto industry standard, and a large part of the current research is targeted on how to make computing scalable and distributed, dynamically, wi...
Software-based Security Framework for Edge and Mobile IoTJosé Cecílio, Alan Oliveira de Sá, André Souto2024-04-09下载With the proliferation of Internet of Things (IoT) devices, ensuring secure communications has become imperative. Due to their low cost and embedded nature, many of these devices operate with computat...
Aggressive or Imperceptible, or Both: Network Pruning Assisted Hybrid Byzantines in Federated LearningEmre Ozfatura, Kerem Ozfatura, Baturalp Buyukates, Mert Coskuner, Alptekin Kupcu, Deniz Gunduz2024-04-09下载In federated learning (FL), profiling and verifying each client is inherently difficult, which introduces a significant security vulnerability: malicious clients, commonly referred to as Byzantines, c...
A Comprehensive Benchmarking Analysis of Fault Recovery in Stream Processing FrameworksAdriano Vogel, Sören Henning, Esteban Perez-Wohlfeil, Otmar Ertl, Rick Rabiser2024-04-09下载Nowadays, several software systems rely on stream processing architectures to deliver scalable performance and handle large volumes of data in near real-time.
Communication-Efficient Large-Scale Distributed Deep Learning: A Comprehensive SurveyFeng Liang, Zhen Zhang, Haifeng Lu, Victor C. M. Leung, Yanyi Guo, Xiping Hu2024-04-09下载With the rapid growth in the volume of data sets, models, and devices in the domain of deep learning, there is increasing attention on large-scale distributed deep learning.
A Systematic Literature Survey of Sparse Matrix-Vector MultiplicationJianhua Gao, Bingjie Liu, Weixing Ji, Hua Huang2024-04-09下载Sparse matrix-vector multiplication (SpMV) is a crucial computing kernel with widespread applications in iterative algorithms. Over the past decades, research on SpMV optimization has made remarkable ...
A Survey of Distributed Graph Algorithms on Massive GraphsLingkai Meng, Yu Shao, Long Yuan, Longbin Lai, Peng Cheng, Xue Li, Wenyuan Yu, Wenjie Zhang, Xuemin Lin, Jingren Zhou2024-04-09下载Distributed processing of large-scale graph data has many practical applications and has been widely studied. In recent years, a lot of distributed graph processing frameworks and algorithms have been...
Asynchronous Federated Reinforcement Learning with Policy Gradient Updates: Algorithm Design and Convergence AnalysisGuangchen Lan, Dong-Jun Han, Abolfazl Hashemi, Vaneet Aggarwal, Christopher G. Brinton2024-04-09下载To improve the efficiency of reinforcement learning (RL), we propose a novel asynchronous federated reinforcement learning (FedRL) framework termed AFedPG, which constructs a global model through coll...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Resource Management in RIS-Assisted Rate Splitting Multiple Access for Next Generation (xG) Wireless Communications: Models, State-of-the-Art, and Future DirectionsIbrahim Aboumahmoud, Ekram Hossain, \Amine Mezghani2024-04-09下载Next generation wireless networks require more stringent performance levels. New technologies such as Reconfigurable intelligent surfaces (RISs) and rate-splitting multiple access (RSMA) are candida...
Deterministic and Probabilistic P4-Enabled Lightweight In-Band Network TelemetryKonstantinos Papadopoulos, Panagiotis Papadimitriou, Chrysa Papagianni2024-04-09下载In-band network telemetry (INT), empowered by programmable dataplanes such as P4, comprises a viable approach to network monitoring and telemetry analysis.
Dynamic D2D-Assisted Federated Learning over O-RAN: Performance Analysis, MAC Scheduler, and Asymmetric User SelectionPayam Abdisarabshali, Kwang Taik Kim, Michael Langberg, Weifeng Su, Seyyedali Hosseinalipour2024-04-09下载Existing studies on federated learning (FL) are mostly focused on system orchestration for static snapshots of the network and making static control decisions (e.g., spectrum allocation).
Integration of Computer Networks and Artificial Neural Networks for an AI-based Network OperatorBinbin Wu, Jingyu Xu, Yifan Zhang, Bo Liu, Yulu Gong, Jiaxin Huang2024-04-09下载This paper proposes an integrated approach combining computer networks and artificial neural networks to construct an intelligent network operator, functioning as an AI model.
DDPG-E2E: A Novel Policy Gradient Approach for End-to-End Communication SystemsBolun Zhang, Nguyen Van Huynh, Dinh Thai Hoang, Diep N. Nguyen, Quoc-Viet Pham2024-04-09下载The End-to-end (E2E) learning-based approach has great potential to reshape the existing communication systems by replacing the transceivers with deep neural networks.
SUPPLY: Sustainable multi-UAV Performance-aware Placement Algorithm for Flying NetworksPedro Ribeiro, André Coelho, Rui Campos2024-04-09下载Unmanned Aerial Vehicles (UAVs) are used for a wide range of applications. Due to characteristics such as the ability to hover and carry cargo on-board, rotary-wing UAVs have been considered suitable ...
Streamlined Transmission: A Semantic-Aware XR Deployment Framework Enhanced by Generative AIWanting Yang, Zehui Xiong, Tony Q. S. Quek, Xuemin Shen2024-04-09下载In the era of 6G, featuring compelling visions of digital twins and metaverses, Extended Reality (XR) has emerged as a vital conduit connecting the digital and physical realms, garnering widespread in...
Asynchronous Federated Reinforcement Learning with Policy Gradient Updates: Algorithm Design and Convergence AnalysisGuangchen Lan, Dong-Jun Han, Abolfazl Hashemi, Vaneet Aggarwal, Christopher G. Brinton2024-04-09下载To improve the efficiency of reinforcement learning (RL), we propose a novel asynchronous federated reinforcement learning (FedRL) framework termed AFedPG, which constructs a global model through coll...

基于 VitePress 构建