Appearance
2025-03-22
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Architectural and System Implications of CXL-enabled Tiered Memory | Yujie Yang, Lingfeng Xiang, Peiran Du, Zhen Lin, Weishu Deng, Ren Wang, Andrey Kudryavtsev, Louis Ko, Hui Lu, Jia Rao | 2025-03-22 | 下载 | Memory disaggregation is an emerging technology that decouples memory from traditional memory buses, enabling independent scaling of compute and memory. |
| Multiport Support for Vortex OpenGPU Memory Hierarchy | Injae Shin, Blaise Tine | 2025-03-22 | 下载 | Modern day applications have grown in size and require more computational power. The rise of machine learning and AI increased the need for parallel computation, which has increased the need for GPGPU... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| CRDT-Based Game State Synchronization in Peer-to-Peer VR | Abel Dantas, Carlos Baquero | 2025-03-22 | 下载 | Virtual presence demands ultra-low latency, a factor that centralized architectures, by their nature, cannot minimize. Local peer-to-peer architectures offer a compelling alternative, but also pose un... |
| Neutron particle transport 3D method of characteristic Multi GPU platform Parallel Computing | Faguo Zhou, Shunde Li, Rong Xue, Lingkun Bu, Ningming Nie, Peng Shi, Jue Wang, Yun Hu, Zongguo Wang, Yangang Wang, Qinmeng Yang, Miao Yu | 2025-03-22 | 下载 | Three-dimensional neutron transport calculations using the Method of Characteristics (MOC) are highly regarded for their exceptional computational efficiency, precision, and stability. |
| PipeBoost: Resilient Pipelined Architecture for Fast Serverless LLM Scaling | Chongpeng Liu, Xiaojian Liao, Hancheng Liu, Limin Xiao, Jianxin Li | 2025-03-22 | 下载 | This paper presents PipeBoost, a low-latency LLM serving system for multi-GPU (serverless) clusters, which can rapidly launch inference services in response to bursty requests without preemptively ove... |
| Sense4FL: Vehicular Crowdsensing Enhanced Federated Learning for Object Detection in Autonomous Driving | Yanan Ma, Senkang Hu, Zhengru Fang, Yun Ji, Yiqin Deng, Yuguang Fang | 2025-03-22 | 下载 | To accommodate constantly changing road conditions, real-time vision model training is essential for autonomous driving (AD). Federated learning (FL) serves as a promising paradigm to enable autonomou... |
| Using a Market Economy to Provision Compute Resources Across Planet-wide Clusters | Murray Stokely, Jim Winget, Ed Keyes, Carrie Grimes, Benjamin Yolken | 2025-03-22 | 下载 | We present a practical, market-based solution to the resource provisioning problem in a set of heterogeneous resource clusters. We focus on provisioning rather than immediate scheduling decisions to a... |
| THAPI: Tracing Heterogeneous APIs | Solomon Bekele, Aurelio Vivas, Thomas Applencourt, Servesh Muralidharan, Bryce Allen, Kazutomo Yoshiiinst, Swann Perarnau, Brice Videau | 2025-03-22 | 下载 | As we reach exascale, production High Performance Computing (HPC) systems are increasing in complexity. These systems now comprise multiple heterogeneous computing components (CPUs and GPUs) utilized ... |
| Time- and Space-Optimal Silent Self-Stabilizing Exact Majority in Population Protocols | Haruki Kanaya, Ryota Eguchi, Taisho Sasada, Fukuhito Ooshita, Michiko Inoue | 2025-03-22 | 下载 | We address the self-stabilizing exact majority problem in the population protocol model, introduced by Angluin, Aspnes, Diamadi, Fischer, and Peralta (2004). |
| A Generative Caching System for Large Language Models | Arun Iyengar, Ashish Kundu, Ramana Kompella, Sai Nandan Mamidi | 2025-03-22 | 下载 | Caching has the potential to be of significant benefit for accessing large language models (LLMs) due to their high latencies which typically range from a small number of seconds to well over a minute... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Detecting and Mitigating DDoS Attacks with AI: A Survey | Alexandru Apostu, Silviu Gheorghe, Andrei Hîji, Nicolae Cleju, Andrei Pătraşcu, Cristian Rusu, Radu Ionescu, Paul Irofti | 2025-03-22 | 下载 | Distributed Denial of Service attacks represent an active cybersecurity research problem. Recent research shifted from static rule-based defenses towards AI-based detection and mitigation. |
| CP-AgentNet: Autonomous and Explainable Communication Protocol Design Using Generative Agents | Dae Cheol Kwon, Xinyu Zhang | 2025-03-22 | 下载 | Although DRL (deep reinforcement learning) has emerged as a powerful tool for making better decisions than existing hand-crafted communication protocols, it faces significant limitations: 1) Selecting... |
| Revisiting Outage for Edge Inference Systems | Zhanwei Wang, Qunsong Zeng, Haotian Zheng, Kaibin Huang | 2025-03-22 | 下载 | One of the key missions of sixth-generation (6G) mobile networks is to deploy large-scale artificial intelligence (AI) models at the network edge to provide remote-inference services for edge devices. |
| RAISE: Optimizing RIS Placement to Maximize Task Throughput in Multi-Server Vehicular Edge Computing | Yanan Ma, Zhengru Fang, Longzhi Yuan, Yiqin Deng, Xianhao Chen, Yuguang Fang | 2025-03-22 | 下载 | Given the limited computing capabilities on autonomous vehicles, onboard processing of large volumes of latency-sensitive tasks presents significant challenges. |
| Conditional Diffusion Model with OOD Mitigation as High-Dimensional Offline Resource Allocation Planner in Clustered Ad Hoc Networks | Kechen Meng, Sinuo Zhang, Rongpeng Li, Chan Wang, Ming Lei, Zhifeng Zhao | 2025-03-22 | 下载 | Due to network delays and scalability limitations, clustered ad hoc networks widely adopt Reinforcement Learning (RL) for on-demand resource allocation. |
| A Generative Caching System for Large Language Models | Arun Iyengar, Ashish Kundu, Ramana Kompella, Sai Nandan Mamidi | 2025-03-22 | 下载 | Caching has the potential to be of significant benefit for accessing large language models (LLMs) due to their high latencies which typically range from a small number of seconds to well over a minute... |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| LEMIX: Enabling Testing of Embedded Applications as Linux Applications (Extended Report) | Sai Ritvik Tanksalkar, Siddharth Muralee, Srihari Danduri, Paschal Amusuo, Antonio Bianchi, James C Davis, Aravind Kumar Machiry | 2025-03-22 | 下载 | Dynamic analysis, through rehosting, is an important capability for security assessment in embedded systems software. Existing rehosting techniques aim to provide high-fidelity execution by accurately... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Energy-Aware LLMs: A step towards sustainable AI for downstream applications | Nguyen Phuc Tran, Brigitte Jaumard, Oscar Delgado | 2025-03-22 | 下载 | Advanced Large Language Models (LLMs) have revolutionized various fields, including communication networks, sparking an innovation wave that has led to new applications and services, and significantly... |
| THAPI: Tracing Heterogeneous APIs | Solomon Bekele, Aurelio Vivas, Thomas Applencourt, Servesh Muralidharan, Bryce Allen, Kazutomo Yoshiiinst, Swann Perarnau, Brice Videau | 2025-03-22 | 下载 | As we reach exascale, production High Performance Computing (HPC) systems are increasing in complexity. These systems now comprise multiple heterogeneous computing components (CPUs and GPUs) utilized ... |