Skip to content

2025-03-22

cs.AR - Architecture

标题作者发布日期PDF摘要
Architectural and System Implications of CXL-enabled Tiered MemoryYujie Yang, Lingfeng Xiang, Peiran Du, Zhen Lin, Weishu Deng, Ren Wang, Andrey Kudryavtsev, Louis Ko, Hui Lu, Jia Rao2025-03-22下载Memory disaggregation is an emerging technology that decouples memory from traditional memory buses, enabling independent scaling of compute and memory.
Multiport Support for Vortex OpenGPU Memory HierarchyInjae Shin, Blaise Tine2025-03-22下载Modern day applications have grown in size and require more computational power. The rise of machine learning and AI increased the need for parallel computation, which has increased the need for GPGPU...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
CRDT-Based Game State Synchronization in Peer-to-Peer VRAbel Dantas, Carlos Baquero2025-03-22下载Virtual presence demands ultra-low latency, a factor that centralized architectures, by their nature, cannot minimize. Local peer-to-peer architectures offer a compelling alternative, but also pose un...
Neutron particle transport 3D method of characteristic Multi GPU platform Parallel ComputingFaguo Zhou, Shunde Li, Rong Xue, Lingkun Bu, Ningming Nie, Peng Shi, Jue Wang, Yun Hu, Zongguo Wang, Yangang Wang, Qinmeng Yang, Miao Yu2025-03-22下载Three-dimensional neutron transport calculations using the Method of Characteristics (MOC) are highly regarded for their exceptional computational efficiency, precision, and stability.
PipeBoost: Resilient Pipelined Architecture for Fast Serverless LLM ScalingChongpeng Liu, Xiaojian Liao, Hancheng Liu, Limin Xiao, Jianxin Li2025-03-22下载This paper presents PipeBoost, a low-latency LLM serving system for multi-GPU (serverless) clusters, which can rapidly launch inference services in response to bursty requests without preemptively ove...
Sense4FL: Vehicular Crowdsensing Enhanced Federated Learning for Object Detection in Autonomous DrivingYanan Ma, Senkang Hu, Zhengru Fang, Yun Ji, Yiqin Deng, Yuguang Fang2025-03-22下载To accommodate constantly changing road conditions, real-time vision model training is essential for autonomous driving (AD). Federated learning (FL) serves as a promising paradigm to enable autonomou...
Using a Market Economy to Provision Compute Resources Across Planet-wide ClustersMurray Stokely, Jim Winget, Ed Keyes, Carrie Grimes, Benjamin Yolken2025-03-22下载We present a practical, market-based solution to the resource provisioning problem in a set of heterogeneous resource clusters. We focus on provisioning rather than immediate scheduling decisions to a...
THAPI: Tracing Heterogeneous APIsSolomon Bekele, Aurelio Vivas, Thomas Applencourt, Servesh Muralidharan, Bryce Allen, Kazutomo Yoshiiinst, Swann Perarnau, Brice Videau2025-03-22下载As we reach exascale, production High Performance Computing (HPC) systems are increasing in complexity. These systems now comprise multiple heterogeneous computing components (CPUs and GPUs) utilized ...
Time- and Space-Optimal Silent Self-Stabilizing Exact Majority in Population ProtocolsHaruki Kanaya, Ryota Eguchi, Taisho Sasada, Fukuhito Ooshita, Michiko Inoue2025-03-22下载We address the self-stabilizing exact majority problem in the population protocol model, introduced by Angluin, Aspnes, Diamadi, Fischer, and Peralta (2004).
A Generative Caching System for Large Language ModelsArun Iyengar, Ashish Kundu, Ramana Kompella, Sai Nandan Mamidi2025-03-22下载Caching has the potential to be of significant benefit for accessing large language models (LLMs) due to their high latencies which typically range from a small number of seconds to well over a minute...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Detecting and Mitigating DDoS Attacks with AI: A SurveyAlexandru Apostu, Silviu Gheorghe, Andrei Hîji, Nicolae Cleju, Andrei Pătraşcu, Cristian Rusu, Radu Ionescu, Paul Irofti2025-03-22下载Distributed Denial of Service attacks represent an active cybersecurity research problem. Recent research shifted from static rule-based defenses towards AI-based detection and mitigation.
CP-AgentNet: Autonomous and Explainable Communication Protocol Design Using Generative AgentsDae Cheol Kwon, Xinyu Zhang2025-03-22下载Although DRL (deep reinforcement learning) has emerged as a powerful tool for making better decisions than existing hand-crafted communication protocols, it faces significant limitations: 1) Selecting...
Revisiting Outage for Edge Inference SystemsZhanwei Wang, Qunsong Zeng, Haotian Zheng, Kaibin Huang2025-03-22下载One of the key missions of sixth-generation (6G) mobile networks is to deploy large-scale artificial intelligence (AI) models at the network edge to provide remote-inference services for edge devices.
RAISE: Optimizing RIS Placement to Maximize Task Throughput in Multi-Server Vehicular Edge ComputingYanan Ma, Zhengru Fang, Longzhi Yuan, Yiqin Deng, Xianhao Chen, Yuguang Fang2025-03-22下载Given the limited computing capabilities on autonomous vehicles, onboard processing of large volumes of latency-sensitive tasks presents significant challenges.
Conditional Diffusion Model with OOD Mitigation as High-Dimensional Offline Resource Allocation Planner in Clustered Ad Hoc NetworksKechen Meng, Sinuo Zhang, Rongpeng Li, Chan Wang, Ming Lei, Zhifeng Zhao2025-03-22下载Due to network delays and scalability limitations, clustered ad hoc networks widely adopt Reinforcement Learning (RL) for on-demand resource allocation.
A Generative Caching System for Large Language ModelsArun Iyengar, Ashish Kundu, Ramana Kompella, Sai Nandan Mamidi2025-03-22下载Caching has the potential to be of significant benefit for accessing large language models (LLMs) due to their high latencies which typically range from a small number of seconds to well over a minute...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
LEMIX: Enabling Testing of Embedded Applications as Linux Applications (Extended Report)Sai Ritvik Tanksalkar, Siddharth Muralee, Srihari Danduri, Paschal Amusuo, Antonio Bianchi, James C Davis, Aravind Kumar Machiry2025-03-22下载Dynamic analysis, through rehosting, is an important capability for security assessment in embedded systems software. Existing rehosting techniques aim to provide high-fidelity execution by accurately...

cs.PF - Performance

标题作者发布日期PDF摘要
Energy-Aware LLMs: A step towards sustainable AI for downstream applicationsNguyen Phuc Tran, Brigitte Jaumard, Oscar Delgado2025-03-22下载Advanced Large Language Models (LLMs) have revolutionized various fields, including communication networks, sparking an innovation wave that has led to new applications and services, and significantly...
THAPI: Tracing Heterogeneous APIsSolomon Bekele, Aurelio Vivas, Thomas Applencourt, Servesh Muralidharan, Bryce Allen, Kazutomo Yoshiiinst, Swann Perarnau, Brice Videau2025-03-22下载As we reach exascale, production High Performance Computing (HPC) systems are increasing in complexity. These systems now comprise multiple heterogeneous computing components (CPUs and GPUs) utilized ...

基于 VitePress 构建