Publication: Tiling-based programming model for structured grids on GPU clusters
dc.contributor.department | N/A | |
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.kuauthor | Bastem, Burak | |
dc.contributor.kuauthor | Erten, Didem Unat | |
dc.contributor.kuprofile | Master Student | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.other | Department of Computer Engineering | |
dc.contributor.schoolcollegeinstitute | Graduate School of Sciences and Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.yokid | N/A | |
dc.contributor.yokid | 219274 | |
dc.date.accessioned | 2024-11-09T22:52:05Z | |
dc.date.issued | 2020 | |
dc.description.abstract | Currently, more than 25% of supercomputers employ GPUs due to their massively parallel and power-efficient architectures. However, programming GPUs effiently in a large scale system is a demanding task not only for computational scientists but also for programming experts as multi-GPU programming requires managing distinct address spaces, generating GPU-specific code and handling inter-device communication. To ease the programming effort, we propose a tiling-based high-level GPU programming model for structured grid problems. The model abstracts data decomposition, memory management and generation of GPU specific code, and hides all types of data transfer overheads. We demonstrate the effectiveness of the programming model on a heat simulation and a real-life cardiac modeling on a single GPU, on a single node with multiple-GPUs and multiple-nodes with multiple-GPUs. We also present performance comparisons under different hardware and software configurations. The results show that the programming model successfully overlaps communication and provides good speedup on 192 GPUs. | |
dc.description.indexedby | WoS | |
dc.description.indexedby | Scopus | |
dc.description.openaccess | NO | |
dc.description.publisherscope | International | |
dc.description.sponsorship | Turkish Science and Technology Research Centre [215E185] We thank Lawrence Berkeley National Laboratory (LBNL), Oak Ridge National Laboratory (ORNL) and Swiss National Supercomputing Center (CSCS) for providing resources for our research. We developed the programming model at a workstation by LBNL and Piz Daint by CSCS, and evaluated its performance at Summitdev by ORNL. We especially thank their sta~ for always helping us with our needs. We also thank Yapi Kredi Teknoloji for their conference and travel support. Finally, authors are supported by the Turkish Science and Technology Research Centre Grant No: 215E185. | |
dc.identifier.doi | 10.1145/3368474.3368485 | |
dc.identifier.isbn | 978-1-4503-7236-7 | |
dc.identifier.scopus | 2-s2.0-85094832813 | |
dc.identifier.uri | http://dx.doi.org/10.1145/3368474.3368485 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/6963 | |
dc.identifier.wos | 555299700005 | |
dc.keywords | Gpu programming | |
dc.keywords | Gpu cluster | |
dc.keywords | Multi-gpu | |
dc.keywords | Tiling | |
dc.keywords | Communication verlap | |
dc.keywords | Gpu streams aware mpi | |
dc.language | English | |
dc.publisher | Assoc Computing Machinery | |
dc.source | Proceedings Of International Conference On High Performance Computing In Asia-Pacific Region (Hpc Asia 2020) | |
dc.subject | Computer science | |
dc.subject | Information systems | |
dc.subject | Engineering | |
dc.subject | Software engineering | |
dc.subject | Theory methods | |
dc.title | Tiling-based programming model for structured grids on GPU clusters | |
dc.type | Conference proceeding | |
dspace.entity.type | Publication | |
local.contributor.authorid | N/A | |
local.contributor.authorid | 0000-0002-2351-0770 | |
local.contributor.kuauthor | Bastem, Burak | |
local.contributor.kuauthor | Erten, Didem Unat | |
relation.isOrgUnitOfPublication | 89352e43-bf09-4ef4-82f6-6f9d0174ebae | |
relation.isOrgUnitOfPublication.latestForDiscovery | 89352e43-bf09-4ef4-82f6-6f9d0174ebae |