Publication:
Asynchronous AMR on multi-GPUs

Placeholder

School / College / Institute

Organizational Unit

Program

KU Authors

Co-Authors

Tan Nguyen
Zhang, Weiqun
Almgren, Ann S.
Shalf, John

Publication Date

Language

Embargo Status

Journal Title

Journal ISSN

Volume Title

Alternative Title

Abstract

Adaptive Mesh Refinement (AMR) is a computational and memory efficient technique for solving partial differential equations. As many of the supercomputers employ GPUs in their systems, AMR frameworks have to be evolved to adapt to large-scale heterogeneous systems. However, it is challenging to employ multiple GPUs and achieve good scalability in AMR because of its complex communication pattern. In this paper, we present our asynchronous AMR runtime system that simultaneously schedules tasks on both CPUs and GPUs and coordinates data movement between different processing units. Our runtime is adaptive to various machine configurations and uses a host resident data model. It helps facilitate using streams to overlap CPU-GPU data transfers with computation and increase device occupancy. We perform strong and weak scaling studies using an Advection solver on Piz Daint supercomputer and achieve high performance.

Source

Publisher

Springer International Publishing Ag

Subject

Computer science, Theory methods

Citation

Has Part

Source

High Performance Computing: Isc High Performance 2019 International Workshops

Book Series Title

Edition

DOI

10.1007/978-3-030-34356-9_11

item.page.datauri

Link

Rights

Copyrights Note

Endorsement

Review

Supplemented By

Referenced By

0

Views

0

Downloads

View PlumX Details