2024-06-29

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
A Quality-Aware Voltage Overscaling Framework to Improve the Energy Efficiency and Lifetime of TPUs based on Statistical Error Modeling	Alireza Senobari, Jafar Vafaei, Omid Akbari, Christian Hochberger, Muhammad Shafique	2024-06-29	下载	Deep neural networks (DNNs) are a type of artificial intelligence models that are inspired by the structure and function of the human brain, designed to process and learn from large amounts of data, m...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Graph Neural Networks Gone Hogwild	Olga Solodova, Nick Richardson, Deniz Oktay, Ryan P. Adams	2024-06-29	下载	Graph neural networks (GNNs) appear to be powerful tools to learn state representations for agents in distributed, decentralized multi-agent systems, but generate catastrophically incorrect prediction...
VcLLM: Video Codecs are Secretly Tensor Codecs	Ceyu Xu, Yongji Wu, Xinyu Yang, Beidi Chen, Matthew Lentz, Danyang Zhuo, Lisa Wu Wills	2024-06-29	下载	As the parameter size of large language models (LLMs) continues to expand, the need for a large memory footprint and high communication bandwidth have become significant bottlenecks for the training a...
Understanding Large-Scale Plasma Simulation Challenges for Fusion Energy on Supercomputers	Jeremy J. Williams, Ashish Bhole, Dylan Kierans, Matthias Hoelzl, Ihor Holod, Weikang Tang, David Tskhakaya, Stefan Costea, Leon Kos, Ales Podolnik, Jakub Hromadka, JOREK Team, Erwin Laure, Stefano Markidis	2024-06-29	下载	Understanding plasma instabilities is essential for achieving sustainable fusion energy, with large-scale plasma simulations playing a crucial role in both the design and development of next-generatio...
Teola: Towards End-to-End Optimization of LLM-based Applications	Xin Tan, Yimin Jiang, Yitao Yang, Hong Xu	2024-06-29	下载	Large language model (LLM)-based applications consist of both LLM and non-LLM components, each contributing to the end-to-end latency. Despite great efforts to optimize LLM inference, end-to-end workf...
FastMig: Leveraging FastFreeze to Establish Robust Service Liquidity in Cloud 2.0	Sorawit Manatura, Thanawat Chanikaphon, Chantana Chantrapornchai, Mohsen Amini Salehi	2024-06-29	下载	Service liquidity across edge-to-cloud or multi-cloud will serve as the cornerstone of the next generation of cloud computing systems (Cloud 2.0).

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Dynamic Optimization of Video Streaming Quality Using Network Digital Twin Technology	Zurh Farus, Betty Searcy, Tina Nassisid, Kevin Muhammad	2024-06-29	下载	This paper introduces a novel dynamic optimization framework for video streaming that leverages Network Digital Twin (NDT) technology to address the challenges posed by fluctuating wireless network co...
To Switch or Not to Switch to TCP Prague? Incentives for Adoption in a Partial L4S Deployment	Fatih Berkay Sarpkaya, Ashutosh Srivastava, Fraida Fund, Shivendra Panwar	2024-06-29	下载	The Low Latency, Low Loss, Scalable Throughput (L4S) architecture has the potential to reduce queuing delay when it is deployed at endpoints and routers throughout the Internet.
C-MASS: Combinatorial Mobility-Aware Sensor Scheduling for Collaborative Perception with Second-Order Topology Approximation	Yukuan Jia, Yuxuan Sun, Ruiqing Mao, Zhaojun Nan, Sheng Zhou, Zhisheng Niu	2024-06-29	下载	Collaborative Perception (CP) has been a promising solution to address occlusions in the traffic environment by sharing sensor data among collaborative vehicles (CoV) via vehicle-to-everything (V2X) n...
Teola: Towards End-to-End Optimization of LLM-based Applications	Xin Tan, Yimin Jiang, Yitao Yang, Hong Xu	2024-06-29	下载	Large language model (LLM)-based applications consist of both LLM and non-LLM components, each contributing to the end-to-end latency. Despite great efforts to optimize LLM inference, end-to-end workf...
Digital Twin-Assisted Data-Driven Optimization for Reliable Edge Caching in Wireless Networks	Zifan Zhang, Yuchen Liu, Zhiyuan Peng, Mingzhe Chen, Dongkuan Xu, Shuguang Cui	2024-06-29	下载	Optimizing edge caching is crucial for the advancement of next-generation (nextG) wireless networks, ensuring high-speed and low-latency services for mobile users.
Science-Informed Design of Deep Learning With Applications to Wireless Systems: A Tutorial	Atefeh Termehchi, Ekram Hossain, Angelo Vera-Rivera, Muhammad Ibrahim, Isaac Woungang	2024-06-29	下载	Recent advances in computational infrastructure and large-scale data processing have accelerated the adoption of data-driven inference methods, particularly deep learning (DL), to solve problems in ma...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
FastMig: Leveraging FastFreeze to Establish Robust Service Liquidity in Cloud 2.0	Sorawit Manatura, Thanawat Chanikaphon, Chantana Chantrapornchai, Mohsen Amini Salehi	2024-06-29	下载	Service liquidity across edge-to-cloud or multi-cloud will serve as the cornerstone of the next generation of cloud computing systems (Cloud 2.0).

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Understanding Large-Scale Plasma Simulation Challenges for Fusion Energy on Supercomputers	Jeremy J. Williams, Ashish Bhole, Dylan Kierans, Matthias Hoelzl, Ihor Holod, Weikang Tang, David Tskhakaya, Stefan Costea, Leon Kos, Ales Podolnik, Jakub Hromadka, JOREK Team, Erwin Laure, Stefano Markidis	2024-06-29	下载	Understanding plasma instabilities is essential for achieving sustainable fusion energy, with large-scale plasma simulations playing a crucial role in both the design and development of next-generatio...