2025-03-22

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Architectural and System Implications of CXL-enabled Tiered Memory	Yujie Yang, Lingfeng Xiang, Peiran Du, Zhen Lin, Weishu Deng, Ren Wang, Andrey Kudryavtsev, Louis Ko, Hui Lu, Jia Rao	2025-03-22	下载	Memory disaggregation is an emerging technology that decouples memory from traditional memory buses, enabling independent scaling of compute and memory.
Multiport Support for Vortex OpenGPU Memory Hierarchy	Injae Shin, Blaise Tine	2025-03-22	下载	Modern day applications have grown in size and require more computational power. The rise of machine learning and AI increased the need for parallel computation, which has increased the need for GPGPU...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
CRDT-Based Game State Synchronization in Peer-to-Peer VR	Abel Dantas, Carlos Baquero	2025-03-22	下载	Virtual presence demands ultra-low latency, a factor that centralized architectures, by their nature, cannot minimize. Local peer-to-peer architectures offer a compelling alternative, but also pose un...
Neutron particle transport 3D method of characteristic Multi GPU platform Parallel Computing	Faguo Zhou, Shunde Li, Rong Xue, Lingkun Bu, Ningming Nie, Peng Shi, Jue Wang, Yun Hu, Zongguo Wang, Yangang Wang, Qinmeng Yang, Miao Yu	2025-03-22	下载	Three-dimensional neutron transport calculations using the Method of Characteristics (MOC) are highly regarded for their exceptional computational efficiency, precision, and stability.
PipeBoost: Resilient Pipelined Architecture for Fast Serverless LLM Scaling	Chongpeng Liu, Xiaojian Liao, Hancheng Liu, Limin Xiao, Jianxin Li	2025-03-22	下载	This paper presents PipeBoost, a low-latency LLM serving system for multi-GPU (serverless) clusters, which can rapidly launch inference services in response to bursty requests without preemptively ove...
Sense4FL: Vehicular Crowdsensing Enhanced Federated Learning for Object Detection in Autonomous Driving	Yanan Ma, Senkang Hu, Zhengru Fang, Yun Ji, Yiqin Deng, Yuguang Fang	2025-03-22	下载	To accommodate constantly changing road conditions, real-time vision model training is essential for autonomous driving (AD). Federated learning (FL) serves as a promising paradigm to enable autonomou...
Using a Market Economy to Provision Compute Resources Across Planet-wide Clusters	Murray Stokely, Jim Winget, Ed Keyes, Carrie Grimes, Benjamin Yolken	2025-03-22	下载	We present a practical, market-based solution to the resource provisioning problem in a set of heterogeneous resource clusters. We focus on provisioning rather than immediate scheduling decisions to a...
THAPI: Tracing Heterogeneous APIs	Solomon Bekele, Aurelio Vivas, Thomas Applencourt, Servesh Muralidharan, Bryce Allen, Kazutomo Yoshiiinst, Swann Perarnau, Brice Videau	2025-03-22	下载	As we reach exascale, production High Performance Computing (HPC) systems are increasing in complexity. These systems now comprise multiple heterogeneous computing components (CPUs and GPUs) utilized ...
Time- and Space-Optimal Silent Self-Stabilizing Exact Majority in Population Protocols	Haruki Kanaya, Ryota Eguchi, Taisho Sasada, Fukuhito Ooshita, Michiko Inoue	2025-03-22	下载	We address the self-stabilizing exact majority problem in the population protocol model, introduced by Angluin, Aspnes, Diamadi, Fischer, and Peralta (2004).
A Generative Caching System for Large Language Models	Arun Iyengar, Ashish Kundu, Ramana Kompella, Sai Nandan Mamidi	2025-03-22	下载	Caching has the potential to be of significant benefit for accessing large language models (LLMs) due to their high latencies which typically range from a small number of seconds to well over a minute...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Detecting and Mitigating DDoS Attacks with AI: A Survey	Alexandru Apostu, Silviu Gheorghe, Andrei Hîji, Nicolae Cleju, Andrei Pătraşcu, Cristian Rusu, Radu Ionescu, Paul Irofti	2025-03-22	下载	Distributed Denial of Service attacks represent an active cybersecurity research problem. Recent research shifted from static rule-based defenses towards AI-based detection and mitigation.
CP-AgentNet: Autonomous and Explainable Communication Protocol Design Using Generative Agents	Dae Cheol Kwon, Xinyu Zhang	2025-03-22	下载	Although DRL (deep reinforcement learning) has emerged as a powerful tool for making better decisions than existing hand-crafted communication protocols, it faces significant limitations: 1) Selecting...
Revisiting Outage for Edge Inference Systems	Zhanwei Wang, Qunsong Zeng, Haotian Zheng, Kaibin Huang	2025-03-22	下载	One of the key missions of sixth-generation (6G) mobile networks is to deploy large-scale artificial intelligence (AI) models at the network edge to provide remote-inference services for edge devices.
RAISE: Optimizing RIS Placement to Maximize Task Throughput in Multi-Server Vehicular Edge Computing	Yanan Ma, Zhengru Fang, Longzhi Yuan, Yiqin Deng, Xianhao Chen, Yuguang Fang	2025-03-22	下载	Given the limited computing capabilities on autonomous vehicles, onboard processing of large volumes of latency-sensitive tasks presents significant challenges.
Conditional Diffusion Model with OOD Mitigation as High-Dimensional Offline Resource Allocation Planner in Clustered Ad Hoc Networks	Kechen Meng, Sinuo Zhang, Rongpeng Li, Chan Wang, Ming Lei, Zhifeng Zhao	2025-03-22	下载	Due to network delays and scalability limitations, clustered ad hoc networks widely adopt Reinforcement Learning (RL) for on-demand resource allocation.
A Generative Caching System for Large Language Models	Arun Iyengar, Ashish Kundu, Ramana Kompella, Sai Nandan Mamidi	2025-03-22	下载	Caching has the potential to be of significant benefit for accessing large language models (LLMs) due to their high latencies which typically range from a small number of seconds to well over a minute...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
LEMIX: Enabling Testing of Embedded Applications as Linux Applications (Extended Report)	Sai Ritvik Tanksalkar, Siddharth Muralee, Srihari Danduri, Paschal Amusuo, Antonio Bianchi, James C Davis, Aravind Kumar Machiry	2025-03-22	下载	Dynamic analysis, through rehosting, is an important capability for security assessment in embedded systems software. Existing rehosting techniques aim to provide high-fidelity execution by accurately...

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Energy-Aware LLMs: A step towards sustainable AI for downstream applications	Nguyen Phuc Tran, Brigitte Jaumard, Oscar Delgado	2025-03-22	下载	Advanced Large Language Models (LLMs) have revolutionized various fields, including communication networks, sparking an innovation wave that has led to new applications and services, and significantly...
THAPI: Tracing Heterogeneous APIs	Solomon Bekele, Aurelio Vivas, Thomas Applencourt, Servesh Muralidharan, Bryce Allen, Kazutomo Yoshiiinst, Swann Perarnau, Brice Videau	2025-03-22	下载	As we reach exascale, production High Performance Computing (HPC) systems are increasing in complexity. These systems now comprise multiple heterogeneous computing components (CPUs and GPUs) utilized ...