2025-08-24

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Random-phase Wave Splatting of Translucent Primitives for Computer-generated Holography	Brian Chao, Jacqueline Yang, Suyeon Choi, Manu Gopakumar, Ryota Koiso, Gordon Wetzstein	2025-08-24	下载	Holographic near-eye displays offer ultra-compact form factors for VR/AR systems but rely on advanced computer-generated holography (CGH) algorithms to convert 3D scenes into interference patterns on ...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Easy Acceleration with Distributed Arrays	Jeremy Kepner, Chansup Byun, LaToya Anderson, William Arcand, David Bestor, William Bergeron, Alex Bonn, Daniel Burrill, Vijay Gadepally, Ryan Haney, Michael Houle, Matthew Hubbell, Hayden Jananthan, Michael Jones, Piotr Luszczek, Lauren Milechin, Guillermo Morales, Julie Mullen, Andrew Prout, Albert Reuther, Antonio Rosa, Charles Yee, Peter Michaleas	2025-08-24	下载	High level programming languages and GPU accelerators are powerful enablers for a wide range of applications. Achieving scalable vertical (within a compute node), horizontal (across compute nodes), an...
MetaFed: Advancing Privacy, Performance, and Sustainability in Federated Metaverse Systems	Muhammet Anil Yagiz, Zeynep Sude Cengiz, Polat Goktas	2025-08-24	下载	The rapid expansion of immersive Metaverse applications introduces complex challenges at the intersection of performance, privacy, and environmental sustainability.
Bine Trees: Enhancing Collective Operations by Optimizing Communication Locality	Daniele De Sensi, Saverio Pasqualoni, Lorenzo Piarulli, Tommaso Bonato, Seydou Ba, Matteo Turisini, Jens Domke, Torsten Hoefler	2025-08-24	下载	Communication locality plays a key role in the performance of collective operations on large HPC systems, especially on oversubscribed networks where groups of nodes are fully connected internally but...
TokenLake: A Unified Segment-level Prefix Cache Pool for Fine-grained Elastic Long-Context LLM Serving	Bingyang Wu, Zili Zhang, Yinmin Zhong, Guanzhe Huang, Yibo Zhu, Xuanzhe Liu, Xin Jin	2025-08-24	下载	Prefix caching is crucial to accelerate multi-turn interactions and requests with shared prefixes. At the cluster level, existing prefix caching systems are tightly coupled with request scheduling to ...
Memory-Efficient Federated Fine-Tuning of Large Language Models via Layer Pruning	Yebo Wu, Jingguang Li, Chunlin Tian, Zhijiang Guo, Li Li	2025-08-24	下载	Federated fine-tuning enables privacy-preserving Large Language Model (LLM) adaptation, but its high memory cost limits participation from resource-constrained devices.

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Comparison of FTN-NOFDM and PCS-OFDM for Long-Haul Coherent Optical Communications	Haide Wang, Ji Zhou, Yongcheng Li, Weiping Liu, Changyuan Yu, Xiangjun Xin, Liangchuan Li	2025-08-24	下载	Single-wavelength 400G coherent optical communications have become a critical solution to meet the explosive traffic demands. However, the single-carrier modulation using low-order modulation formats ...

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Evaluating Compiler Optimization Impacts on zkVM Performance	Thomas Gassmann, Stefanos Chaliasos, Thodoris Sotiropoulos, Zhendong Su	2025-08-24	下载	Zero-knowledge proofs (ZKPs) are the cornerstone of programmable cryptography. They enable (1) privacy-preserving and verifiable computation across blockchains, and (2) an expanding range of off-chain...
Easy Acceleration with Distributed Arrays	Jeremy Kepner, Chansup Byun, LaToya Anderson, William Arcand, David Bestor, William Bergeron, Alex Bonn, Daniel Burrill, Vijay Gadepally, Ryan Haney, Michael Houle, Matthew Hubbell, Hayden Jananthan, Michael Jones, Piotr Luszczek, Lauren Milechin, Guillermo Morales, Julie Mullen, Andrew Prout, Albert Reuther, Antonio Rosa, Charles Yee, Peter Michaleas	2025-08-24	下载	High level programming languages and GPU accelerators are powerful enablers for a wide range of applications. Achieving scalable vertical (within a compute node), horizontal (across compute nodes), an...
MoE-Inference-Bench: Performance Evaluation of Mixture of Expert Large Language and Vision Models	Krishna Teja Chitty-Venkata, Sylvia Howland, Golara Azar, Daria Soboleva, Natalia Vassilieva, Siddhisanket Raskar, Murali Emani, Venkatram Vishwanath	2025-08-24	下载	Mixture of Experts (MoE) models have enabled the scaling of Large Language Models (LLMs) and Vision Language Models (VLMs) by achieving massive parameter counts while maintaining computational efficie...
The Unwritten Contract of Cloud-based Elastic Solid-State Drives	Yingjia Wang, Ming-Chang Yang	2025-08-24	下载	Elastic block storage (EBS) with the storage-compute disaggregated architecture stands as a pivotal piece in today's cloud. EBS furnishes users with storage capabilities through the elastic solid-stat...
Who Wins the Race? (R Vs Python) - An Exploratory Study on Energy Consumption of Machine Learning Algorithms	Rajrupa Chattaraj, Sridhar Chimalakonda, Vibhu Saujanya Sharma, Vikrant Kaulgud	2025-08-24	下载	The utilization of Machine Learning (ML) in contemporary software systems is extensive and continually expanding. However, its usage is energy-intensive, contributing to increased carbon emissions and...
Bine Trees: Enhancing Collective Operations by Optimizing Communication Locality	Daniele De Sensi, Saverio Pasqualoni, Lorenzo Piarulli, Tommaso Bonato, Seydou Ba, Matteo Turisini, Jens Domke, Torsten Hoefler	2025-08-24	下载	Communication locality plays a key role in the performance of collective operations on large HPC systems, especially on oversubscribed networks where groups of nodes are fully connected internally but...
Performance is not All You Need: Sustainability Considerations for Algorithms	Xiang Li, Chong Zhang, Hongpeng Wang, Shreyank Narayana Gowda, Yushi Li, Xiaobo Jin	2025-08-24	下载	This work focuses on the high carbon emissions generated by deep learning model training, specifically addressing the core challenge of balancing algorithm performance and energy consumption.