Skip to content

2024-06-29

cs.AR - Architecture

标题作者发布日期PDF摘要
A Quality-Aware Voltage Overscaling Framework to Improve the Energy Efficiency and Lifetime of TPUs based on Statistical Error ModelingAlireza Senobari, Jafar Vafaei, Omid Akbari, Christian Hochberger, Muhammad Shafique2024-06-29下载Deep neural networks (DNNs) are a type of artificial intelligence models that are inspired by the structure and function of the human brain, designed to process and learn from large amounts of data, m...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Graph Neural Networks Gone HogwildOlga Solodova, Nick Richardson, Deniz Oktay, Ryan P. Adams2024-06-29下载Graph neural networks (GNNs) appear to be powerful tools to learn state representations for agents in distributed, decentralized multi-agent systems, but generate catastrophically incorrect prediction...
VcLLM: Video Codecs are Secretly Tensor CodecsCeyu Xu, Yongji Wu, Xinyu Yang, Beidi Chen, Matthew Lentz, Danyang Zhuo, Lisa Wu Wills2024-06-29下载As the parameter size of large language models (LLMs) continues to expand, the need for a large memory footprint and high communication bandwidth have become significant bottlenecks for the training a...
Understanding Large-Scale Plasma Simulation Challenges for Fusion Energy on SupercomputersJeremy J. Williams, Ashish Bhole, Dylan Kierans, Matthias Hoelzl, Ihor Holod, Weikang Tang, David Tskhakaya, Stefan Costea, Leon Kos, Ales Podolnik, Jakub Hromadka, JOREK Team, Erwin Laure, Stefano Markidis2024-06-29下载Understanding plasma instabilities is essential for achieving sustainable fusion energy, with large-scale plasma simulations playing a crucial role in both the design and development of next-generatio...
Teola: Towards End-to-End Optimization of LLM-based ApplicationsXin Tan, Yimin Jiang, Yitao Yang, Hong Xu2024-06-29下载Large language model (LLM)-based applications consist of both LLM and non-LLM components, each contributing to the end-to-end latency. Despite great efforts to optimize LLM inference, end-to-end workf...
FastMig: Leveraging FastFreeze to Establish Robust Service Liquidity in Cloud 2.0Sorawit Manatura, Thanawat Chanikaphon, Chantana Chantrapornchai, Mohsen Amini Salehi2024-06-29下载Service liquidity across edge-to-cloud or multi-cloud will serve as the cornerstone of the next generation of cloud computing systems (Cloud 2.0).

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Dynamic Optimization of Video Streaming Quality Using Network Digital Twin TechnologyZurh Farus, Betty Searcy, Tina Nassisid, Kevin Muhammad2024-06-29下载This paper introduces a novel dynamic optimization framework for video streaming that leverages Network Digital Twin (NDT) technology to address the challenges posed by fluctuating wireless network co...
To Switch or Not to Switch to TCP Prague? Incentives for Adoption in a Partial L4S DeploymentFatih Berkay Sarpkaya, Ashutosh Srivastava, Fraida Fund, Shivendra Panwar2024-06-29下载The Low Latency, Low Loss, Scalable Throughput (L4S) architecture has the potential to reduce queuing delay when it is deployed at endpoints and routers throughout the Internet.
C-MASS: Combinatorial Mobility-Aware Sensor Scheduling for Collaborative Perception with Second-Order Topology ApproximationYukuan Jia, Yuxuan Sun, Ruiqing Mao, Zhaojun Nan, Sheng Zhou, Zhisheng Niu2024-06-29下载Collaborative Perception (CP) has been a promising solution to address occlusions in the traffic environment by sharing sensor data among collaborative vehicles (CoV) via vehicle-to-everything (V2X) n...
Teola: Towards End-to-End Optimization of LLM-based ApplicationsXin Tan, Yimin Jiang, Yitao Yang, Hong Xu2024-06-29下载Large language model (LLM)-based applications consist of both LLM and non-LLM components, each contributing to the end-to-end latency. Despite great efforts to optimize LLM inference, end-to-end workf...
Digital Twin-Assisted Data-Driven Optimization for Reliable Edge Caching in Wireless NetworksZifan Zhang, Yuchen Liu, Zhiyuan Peng, Mingzhe Chen, Dongkuan Xu, Shuguang Cui2024-06-29下载Optimizing edge caching is crucial for the advancement of next-generation (nextG) wireless networks, ensuring high-speed and low-latency services for mobile users.
Science-Informed Design of Deep Learning With Applications to Wireless Systems: A TutorialAtefeh Termehchi, Ekram Hossain, Angelo Vera-Rivera, Muhammad Ibrahim, Isaac Woungang2024-06-29下载Recent advances in computational infrastructure and large-scale data processing have accelerated the adoption of data-driven inference methods, particularly deep learning (DL), to solve problems in ma...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
FastMig: Leveraging FastFreeze to Establish Robust Service Liquidity in Cloud 2.0Sorawit Manatura, Thanawat Chanikaphon, Chantana Chantrapornchai, Mohsen Amini Salehi2024-06-29下载Service liquidity across edge-to-cloud or multi-cloud will serve as the cornerstone of the next generation of cloud computing systems (Cloud 2.0).

cs.PF - Performance

标题作者发布日期PDF摘要
Understanding Large-Scale Plasma Simulation Challenges for Fusion Energy on SupercomputersJeremy J. Williams, Ashish Bhole, Dylan Kierans, Matthias Hoelzl, Ihor Holod, Weikang Tang, David Tskhakaya, Stefan Costea, Leon Kos, Ales Podolnik, Jakub Hromadka, JOREK Team, Erwin Laure, Stefano Markidis2024-06-29下载Understanding plasma instabilities is essential for achieving sustainable fusion energy, with large-scale plasma simulations playing a crucial role in both the design and development of next-generatio...

基于 VitePress 构建