Skip to content

2024-03-24

cs.AR - Architecture

标题作者发布日期PDF摘要
Thermal Analysis for NVIDIA GTX480 Fermi GPU ArchitectureSavinay Nagendra2024-03-24下载In this project, we design a four-layer (Silicon|TIM|Silicon|TIM), 3D floor plan for NVIDIA GTX480 Fermi GPU architecture and compare heat dissipation and power trends for matrix multiplication and...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Q-adaptive: A Multi-Agent Reinforcement Learning Based Routing on Dragonfly NetworkYao Kang, Xin Wang, Zhiling Lan2024-03-24下载High-radix interconnects such as Dragonfly and its variants rely on adaptive routing to balance network traffic for optimum performance. Ideally, adaptive routing attempts to forward packets between m...
MRSch: Multi-Resource Scheduling for HPCBoyang Li, Yuping Fan, Matthew Dearing, Zhiling Lan, Paul Richy, William Allcocky, Michael Papka2024-03-24下载Emerging workloads in high-performance computing (HPC) are embracing significant changes, such as having diverse resource requirements instead of being CPU-centric.
Interpretable Modeling of Deep Reinforcement Learning Driven SchedulingBoyang Li, Zhiling Lan, Michael E. Papka2024-03-24下载In the field of high-performance computing (HPC), there has been recent exploration into the use of deep reinforcement learning for cluster scheduling (DRL scheduling), which has demonstrated promisin...
Study of Workload Interference with Intelligent Routing on DragonflyYao Kang, Xin Wang, Zhiling Lan2024-03-24下载Dragonfly interconnect is a crucial network technology for supercomputers. To support exascale systems, network resources are shared such that links and routers are not dedicated to any node pair.
Arena: Efficiently Training Large Models via Dynamic Scheduling and Adaptive Parallelism Co-DesignChunyu Xue, Weihao Cui, Quan Chen, Chen Chen, Han Zhao, Shulai Zhang, Linmei Wang, Yan Li, Limin Xiao, Weifeng Zhang, Jing Yang, Bingsheng He, Minyi Guo2024-03-24下载Efficiently training large-scale models (LMs) in GPU clusters involves two separate avenues: inter-job dynamic scheduling and intra-job adaptive parallelism (AP).

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Q-adaptive: A Multi-Agent Reinforcement Learning Based Routing on Dragonfly NetworkYao Kang, Xin Wang, Zhiling Lan2024-03-24下载High-radix interconnects such as Dragonfly and its variants rely on adaptive routing to balance network traffic for optimum performance. Ideally, adaptive routing attempts to forward packets between m...
Study of Workload Interference with Intelligent Routing on DragonflyYao Kang, Xin Wang, Zhiling Lan2024-03-24下载Dragonfly interconnect is a crucial network technology for supercomputers. To support exascale systems, network resources are shared such that links and routers are not dedicated to any node pair.
Evaluation of Greedy and CBF for ETSI non-area GeoNetworking: The impact of DCCOscar Amador, Maria Calderon, Manuel Urueña, Ignacio Soto2024-03-24下载This paper evaluates the performance of the two ETSI non-area forwarding algorithms in the GeoNetworking specification: Greedy Forwarding and Non-Area Contention-Based Forwarding (CBF).
Interference Management for Integrated Sensing and Communication Systems: A SurveyYangyang Niu, Zhiqing Wei, Lin Wang, Huici Wu, Zhiyong Feng2024-03-24下载Emerging applications such as autonomous driving and Internet of things (IoT) services put forward the demand for simutaneous sensing and communication functions in the same system.
Digital Twin Assisted Intelligent Network Management for Vehicular ApplicationsKaige Qu, Weihua Zhuang2024-03-24下载The emerging data-driven methods based on artificial intelligence (AI) have paved the way for intelligent, flexible, and adaptive network management in vehicular applications.
Prioritized Multi-Tenant Traffic Engineering for Dynamic QoS Provisioning in Autonomous SDN-OpenFlow Edge NetworksMohammad Sajid Shahriar, Faisal Ahmed, Genshe Chen, Khanh D. Pham, Suresh Subramaniam, Motoharu Matsuura, Hiroshi Hasegawa, Shih-Chun Lin2024-03-24下载This letter indicates the critical need for prioritized multi-tenant quality-of-service (QoS) management by emerging mobile edge systems, particularly for high-throughput beyond fifth-generation netwo...

cs.PF - Performance

标题作者发布日期PDF摘要
Explainable Port Mapping Inference with Sparse Performance Counters for AMD's Zen ArchitecturesFabian Ritter, Sebastian Hack2024-03-24下载Performance models are instrumental for optimizing performance-sensitive code. When modeling the use of functional units of out-of-order x86-64 CPUs, data availability varies by the manufacturer: Inst...
Performance evaluation of accelerated complex multiple-precision LU decompositionTomonori Kouya2024-03-24下载The direct method is one of the most important algorithms for solving linear systems of equations, with LU decomposition comprising a significant portion of its computation time.
MicroHD: An Accuracy-Driven Optimization of Hyperdimensional Computing Algorithms for TinyML systemsFlavio Ponzina, Tajana Rosing2024-03-24下载Hyperdimensional computing (HDC) is emerging as a promising AI approach that can effectively target TinyML applications thanks to its lightweight computing and memory requirements.

基于 VitePress 构建