Skip to content

2025-01-12

cs.AR - Architecture

标题作者发布日期PDF摘要
On Optimizing Locality of Graph Transposition on Modern ArchitecturesMohsen Koohi Esfahani, Hans Vandierendonck2025-01-12下载This paper investigates the shared-memory Graph Transposition (GT) problem, a fundamental graph algorithm that is widely used in graph analytics and scientific computing.
COMPASS: A Compiler Framework for Resource-Constrained Crossbar-Array Based In-Memory Deep Learning AcceleratorsJihoon Park, Jeongin Choe, Dohyun Kim, Jae-Joon Kim2025-01-12下载Recently, crossbar array based in-memory accelerators have been gaining interest due to their high throughput and energy efficiency. While software and compiler support for the in-memory accelerators ...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
On Optimizing Locality of Graph Transposition on Modern ArchitecturesMohsen Koohi Esfahani, Hans Vandierendonck2025-01-12下载This paper investigates the shared-memory Graph Transposition (GT) problem, a fundamental graph algorithm that is widely used in graph analytics and scientific computing.
CoCoI: Distributed Coded Inference System for Straggler MitigationXing Liu, Chao Huang, Ming Tang2025-01-12下载Convolutional neural networks (CNNs) are widely applied in real-time applications on resource-constrained devices. To accelerate CNN inference, prior works proposed to distribute the inference workloa...
COMPASS: A Compiler Framework for Resource-Constrained Crossbar-Array Based In-Memory Deep Learning AcceleratorsJihoon Park, Jeongin Choe, Dohyun Kim, Jae-Joon Kim2025-01-12下载Recently, crossbar array based in-memory accelerators have been gaining interest due to their high throughput and energy efficiency. While software and compiler support for the in-memory accelerators ...
Mell: Memory-Efficient Large Language Model Serving via Multi-GPU KV Cache ManagementLiu Qianli, Hong Zicong, Chen Fahao, Li Peng, Guo Song2025-01-12下载Serving large language models (LLMs) for massive users is challenged by the significant memory footprint of the transient state, known as the key-value (KV) cache, which scales with sequence length an...
AIOpsLab: A Holistic Framework to Evaluate AI Agents for Enabling Autonomous CloudsYinfang Chen, Manish Shetty, Gagan Somashekar, Minghua Ma, Yogesh Simmhan, Jonathan Mace, Chetan Bansal, Rujia Wang, Saravan Rajmohan2025-01-12下载AI for IT Operations (AIOps) aims to automate complex operational tasks, such as fault localization and root cause analysis, to reduce human workload and minimize customer impact.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
AdaSlicing: Adaptive Online Network Slicing under Continual Network Dynamics in Open Radio Access NetworksMing Zhao, Yuru Zhang, Qiang Liu, Ahan Kak, Nakjung Choi2025-01-12下载Open radio access networks (e.g., O-RAN) facilitate fine-grained control (e.g., near-RT RIC) in next-generation networks, necessitating advanced AI/ML techniques in handling online resource orchestrat...
Real-Time Neural-Enhancement for Online Cloud GamingShan Jiang, Zhenhua Han, Haisheng Tan, Xinyang Jiang, Yifan Yang, Xiaoxi Zhang, Hongqiu Ni, Yuqing Yang, Xiang-Yang Li2025-01-12下载Online Cloud gaming demands real-time, high-quality video transmission across variable wide-area networks (WANs). Neural-enhanced video transmission algorithms employing super-resolution (SR) for vide...
Average Reward Reinforcement Learning for Wireless Radio Resource ManagementKun Yang, Jing Yang, Cong Shen2025-01-12下载In this paper, we address a crucial but often overlooked issue in applying reinforcement learning (RL) to radio resource management (RRM) in wireless communications: the mismatch between the discounte...
Optimizing Age of Information without Knowing the Age of InformationZhuoyi Zhao, Igor Kadota2025-01-12下载Consider a network where a wireless base station (BS) connects multiple source-destination pairs. Packets from each source are generated according to a renewal process and are enqueued in a single-pac...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Symbol Resolution MatRs: Make it Fast and Observable with Stable LinkingFarid Zakaria, Andrew Quinn, Thomas R. W. Scogland2025-01-12下载Dynamic linking is the standard mechanism for using external dependencies since it enables code reuse, streamlines software updates, and reduces disk/network use.

cs.PF - Performance

标题作者发布日期PDF摘要
On Optimizing Locality of Graph Transposition on Modern ArchitecturesMohsen Koohi Esfahani, Hans Vandierendonck2025-01-12下载This paper investigates the shared-memory Graph Transposition (GT) problem, a fundamental graph algorithm that is widely used in graph analytics and scientific computing.

基于 VitePress 构建