Skip to content

2025-08-24

cs.AR - Architecture

标题作者发布日期PDF摘要
Random-phase Wave Splatting of Translucent Primitives for Computer-generated HolographyBrian Chao, Jacqueline Yang, Suyeon Choi, Manu Gopakumar, Ryota Koiso, Gordon Wetzstein2025-08-24下载Holographic near-eye displays offer ultra-compact form factors for VR/AR systems but rely on advanced computer-generated holography (CGH) algorithms to convert 3D scenes into interference patterns on ...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Easy Acceleration with Distributed ArraysJeremy Kepner, Chansup Byun, LaToya Anderson, William Arcand, David Bestor, William Bergeron, Alex Bonn, Daniel Burrill, Vijay Gadepally, Ryan Haney, Michael Houle, Matthew Hubbell, Hayden Jananthan, Michael Jones, Piotr Luszczek, Lauren Milechin, Guillermo Morales, Julie Mullen, Andrew Prout, Albert Reuther, Antonio Rosa, Charles Yee, Peter Michaleas2025-08-24下载High level programming languages and GPU accelerators are powerful enablers for a wide range of applications. Achieving scalable vertical (within a compute node), horizontal (across compute nodes), an...
MetaFed: Advancing Privacy, Performance, and Sustainability in Federated Metaverse SystemsMuhammet Anil Yagiz, Zeynep Sude Cengiz, Polat Goktas2025-08-24下载The rapid expansion of immersive Metaverse applications introduces complex challenges at the intersection of performance, privacy, and environmental sustainability.
Bine Trees: Enhancing Collective Operations by Optimizing Communication LocalityDaniele De Sensi, Saverio Pasqualoni, Lorenzo Piarulli, Tommaso Bonato, Seydou Ba, Matteo Turisini, Jens Domke, Torsten Hoefler2025-08-24下载Communication locality plays a key role in the performance of collective operations on large HPC systems, especially on oversubscribed networks where groups of nodes are fully connected internally but...
TokenLake: A Unified Segment-level Prefix Cache Pool for Fine-grained Elastic Long-Context LLM ServingBingyang Wu, Zili Zhang, Yinmin Zhong, Guanzhe Huang, Yibo Zhu, Xuanzhe Liu, Xin Jin2025-08-24下载Prefix caching is crucial to accelerate multi-turn interactions and requests with shared prefixes. At the cluster level, existing prefix caching systems are tightly coupled with request scheduling to ...
Memory-Efficient Federated Fine-Tuning of Large Language Models via Layer PruningYebo Wu, Jingguang Li, Chunlin Tian, Zhijiang Guo, Li Li2025-08-24下载Federated fine-tuning enables privacy-preserving Large Language Model (LLM) adaptation, but its high memory cost limits participation from resource-constrained devices.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Comparison of FTN-NOFDM and PCS-OFDM for Long-Haul Coherent Optical CommunicationsHaide Wang, Ji Zhou, Yongcheng Li, Weiping Liu, Changyuan Yu, Xiangjun Xin, Liangchuan Li2025-08-24下载Single-wavelength 400G coherent optical communications have become a critical solution to meet the explosive traffic demands. However, the single-carrier modulation using low-order modulation formats ...

cs.PF - Performance

标题作者发布日期PDF摘要
Evaluating Compiler Optimization Impacts on zkVM PerformanceThomas Gassmann, Stefanos Chaliasos, Thodoris Sotiropoulos, Zhendong Su2025-08-24下载Zero-knowledge proofs (ZKPs) are the cornerstone of programmable cryptography. They enable (1) privacy-preserving and verifiable computation across blockchains, and (2) an expanding range of off-chain...
Easy Acceleration with Distributed ArraysJeremy Kepner, Chansup Byun, LaToya Anderson, William Arcand, David Bestor, William Bergeron, Alex Bonn, Daniel Burrill, Vijay Gadepally, Ryan Haney, Michael Houle, Matthew Hubbell, Hayden Jananthan, Michael Jones, Piotr Luszczek, Lauren Milechin, Guillermo Morales, Julie Mullen, Andrew Prout, Albert Reuther, Antonio Rosa, Charles Yee, Peter Michaleas2025-08-24下载High level programming languages and GPU accelerators are powerful enablers for a wide range of applications. Achieving scalable vertical (within a compute node), horizontal (across compute nodes), an...
MoE-Inference-Bench: Performance Evaluation of Mixture of Expert Large Language and Vision ModelsKrishna Teja Chitty-Venkata, Sylvia Howland, Golara Azar, Daria Soboleva, Natalia Vassilieva, Siddhisanket Raskar, Murali Emani, Venkatram Vishwanath2025-08-24下载Mixture of Experts (MoE) models have enabled the scaling of Large Language Models (LLMs) and Vision Language Models (VLMs) by achieving massive parameter counts while maintaining computational efficie...
The Unwritten Contract of Cloud-based Elastic Solid-State DrivesYingjia Wang, Ming-Chang Yang2025-08-24下载Elastic block storage (EBS) with the storage-compute disaggregated architecture stands as a pivotal piece in today's cloud. EBS furnishes users with storage capabilities through the elastic solid-stat...
Who Wins the Race? (R Vs Python) - An Exploratory Study on Energy Consumption of Machine Learning AlgorithmsRajrupa Chattaraj, Sridhar Chimalakonda, Vibhu Saujanya Sharma, Vikrant Kaulgud2025-08-24下载The utilization of Machine Learning (ML) in contemporary software systems is extensive and continually expanding. However, its usage is energy-intensive, contributing to increased carbon emissions and...
Bine Trees: Enhancing Collective Operations by Optimizing Communication LocalityDaniele De Sensi, Saverio Pasqualoni, Lorenzo Piarulli, Tommaso Bonato, Seydou Ba, Matteo Turisini, Jens Domke, Torsten Hoefler2025-08-24下载Communication locality plays a key role in the performance of collective operations on large HPC systems, especially on oversubscribed networks where groups of nodes are fully connected internally but...
Performance is not All You Need: Sustainability Considerations for AlgorithmsXiang Li, Chong Zhang, Hongpeng Wang, Shreyank Narayana Gowda, Yushi Li, Xiaobo Jin2025-08-24下载This work focuses on the high carbon emissions generated by deep learning model training, specifically addressing the core challenge of balancing algorithm performance and energy consumption.

基于 VitePress 构建