Appearance
2025-08-24
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Random-phase Wave Splatting of Translucent Primitives for Computer-generated Holography | Brian Chao, Jacqueline Yang, Suyeon Choi, Manu Gopakumar, Ryota Koiso, Gordon Wetzstein | 2025-08-24 | 下载 | Holographic near-eye displays offer ultra-compact form factors for VR/AR systems but rely on advanced computer-generated holography (CGH) algorithms to convert 3D scenes into interference patterns on ... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Easy Acceleration with Distributed Arrays | Jeremy Kepner, Chansup Byun, LaToya Anderson, William Arcand, David Bestor, William Bergeron, Alex Bonn, Daniel Burrill, Vijay Gadepally, Ryan Haney, Michael Houle, Matthew Hubbell, Hayden Jananthan, Michael Jones, Piotr Luszczek, Lauren Milechin, Guillermo Morales, Julie Mullen, Andrew Prout, Albert Reuther, Antonio Rosa, Charles Yee, Peter Michaleas | 2025-08-24 | 下载 | High level programming languages and GPU accelerators are powerful enablers for a wide range of applications. Achieving scalable vertical (within a compute node), horizontal (across compute nodes), an... |
| MetaFed: Advancing Privacy, Performance, and Sustainability in Federated Metaverse Systems | Muhammet Anil Yagiz, Zeynep Sude Cengiz, Polat Goktas | 2025-08-24 | 下载 | The rapid expansion of immersive Metaverse applications introduces complex challenges at the intersection of performance, privacy, and environmental sustainability. |
| Bine Trees: Enhancing Collective Operations by Optimizing Communication Locality | Daniele De Sensi, Saverio Pasqualoni, Lorenzo Piarulli, Tommaso Bonato, Seydou Ba, Matteo Turisini, Jens Domke, Torsten Hoefler | 2025-08-24 | 下载 | Communication locality plays a key role in the performance of collective operations on large HPC systems, especially on oversubscribed networks where groups of nodes are fully connected internally but... |
| TokenLake: A Unified Segment-level Prefix Cache Pool for Fine-grained Elastic Long-Context LLM Serving | Bingyang Wu, Zili Zhang, Yinmin Zhong, Guanzhe Huang, Yibo Zhu, Xuanzhe Liu, Xin Jin | 2025-08-24 | 下载 | Prefix caching is crucial to accelerate multi-turn interactions and requests with shared prefixes. At the cluster level, existing prefix caching systems are tightly coupled with request scheduling to ... |
| Memory-Efficient Federated Fine-Tuning of Large Language Models via Layer Pruning | Yebo Wu, Jingguang Li, Chunlin Tian, Zhijiang Guo, Li Li | 2025-08-24 | 下载 | Federated fine-tuning enables privacy-preserving Large Language Model (LLM) adaptation, but its high memory cost limits participation from resource-constrained devices. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Comparison of FTN-NOFDM and PCS-OFDM for Long-Haul Coherent Optical Communications | Haide Wang, Ji Zhou, Yongcheng Li, Weiping Liu, Changyuan Yu, Xiangjun Xin, Liangchuan Li | 2025-08-24 | 下载 | Single-wavelength 400G coherent optical communications have become a critical solution to meet the explosive traffic demands. However, the single-carrier modulation using low-order modulation formats ... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Evaluating Compiler Optimization Impacts on zkVM Performance | Thomas Gassmann, Stefanos Chaliasos, Thodoris Sotiropoulos, Zhendong Su | 2025-08-24 | 下载 | Zero-knowledge proofs (ZKPs) are the cornerstone of programmable cryptography. They enable (1) privacy-preserving and verifiable computation across blockchains, and (2) an expanding range of off-chain... |
| Easy Acceleration with Distributed Arrays | Jeremy Kepner, Chansup Byun, LaToya Anderson, William Arcand, David Bestor, William Bergeron, Alex Bonn, Daniel Burrill, Vijay Gadepally, Ryan Haney, Michael Houle, Matthew Hubbell, Hayden Jananthan, Michael Jones, Piotr Luszczek, Lauren Milechin, Guillermo Morales, Julie Mullen, Andrew Prout, Albert Reuther, Antonio Rosa, Charles Yee, Peter Michaleas | 2025-08-24 | 下载 | High level programming languages and GPU accelerators are powerful enablers for a wide range of applications. Achieving scalable vertical (within a compute node), horizontal (across compute nodes), an... |
| MoE-Inference-Bench: Performance Evaluation of Mixture of Expert Large Language and Vision Models | Krishna Teja Chitty-Venkata, Sylvia Howland, Golara Azar, Daria Soboleva, Natalia Vassilieva, Siddhisanket Raskar, Murali Emani, Venkatram Vishwanath | 2025-08-24 | 下载 | Mixture of Experts (MoE) models have enabled the scaling of Large Language Models (LLMs) and Vision Language Models (VLMs) by achieving massive parameter counts while maintaining computational efficie... |
| The Unwritten Contract of Cloud-based Elastic Solid-State Drives | Yingjia Wang, Ming-Chang Yang | 2025-08-24 | 下载 | Elastic block storage (EBS) with the storage-compute disaggregated architecture stands as a pivotal piece in today's cloud. EBS furnishes users with storage capabilities through the elastic solid-stat... |
| Who Wins the Race? (R Vs Python) - An Exploratory Study on Energy Consumption of Machine Learning Algorithms | Rajrupa Chattaraj, Sridhar Chimalakonda, Vibhu Saujanya Sharma, Vikrant Kaulgud | 2025-08-24 | 下载 | The utilization of Machine Learning (ML) in contemporary software systems is extensive and continually expanding. However, its usage is energy-intensive, contributing to increased carbon emissions and... |
| Bine Trees: Enhancing Collective Operations by Optimizing Communication Locality | Daniele De Sensi, Saverio Pasqualoni, Lorenzo Piarulli, Tommaso Bonato, Seydou Ba, Matteo Turisini, Jens Domke, Torsten Hoefler | 2025-08-24 | 下载 | Communication locality plays a key role in the performance of collective operations on large HPC systems, especially on oversubscribed networks where groups of nodes are fully connected internally but... |
| Performance is not All You Need: Sustainability Considerations for Algorithms | Xiang Li, Chong Zhang, Hongpeng Wang, Shreyank Narayana Gowda, Yushi Li, Xiaobo Jin | 2025-08-24 | 下载 | This work focuses on the high carbon emissions generated by deep learning model training, specifically addressing the core challenge of balancing algorithm performance and energy consumption. |