Publication: Nonintrusive AMR asynchrony for communication optimization
dc.contributor.coauthor | Nguyen, Tan | |
dc.contributor.coauthor | Zhang, Weiqun | |
dc.contributor.coauthor | Almgren, Ann | |
dc.contributor.coauthor | Shalf, John | |
dc.contributor.department | N/A | |
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.kuauthor | Farooqi, Muhammad Nufail | |
dc.contributor.kuauthor | Erten, Didem Unat | |
dc.contributor.kuprofile | PhD Student | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.other | Department of Computer Engineering | |
dc.contributor.schoolcollegeinstitute | Graduate School of Sciences and Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.yokid | N/A | |
dc.contributor.yokid | 219274 | |
dc.date.accessioned | 2024-11-10T00:08:16Z | |
dc.date.issued | 2017 | |
dc.description.abstract | Adaptive Mesh Refinement (AMR) is a well known method for efficiently solving partial differential equations. A straightforward AMR algorithm typically exhibits many synchronization points even during a single time step, where costly communication often degrades the performance. This problem will be even more pronounced on future supercomputers containing billion way parallelism, which will raise the communication cost further. Re-designing AMR algorithms to avoid synchronization is not a viable solution due to the large code size and complex control structures. We present a nonintrusive asynchronous approach to hiding the effects of communication in an AMR application. Specifically, our approach reasons about data dependencies automatically using domain knowledge about AMR applications, allowing asynchrony to be discovered with only a modest amount of code modification. Using this approach, we optimize the synchronous AMR algorithm in the BoxLib software framework without severely affecting the productivity of the application programmer We observe around 27-31% performance improvement for an advection solver on the Hazel Hen supercomputer using 12288 cores. | |
dc.description.indexedby | WoS | |
dc.description.indexedby | Scopus | |
dc.description.openaccess | NO | |
dc.description.publisherscope | International | |
dc.description.sponsorship | Turkish Science and Technology Research Centre [215E185] | |
dc.description.sponsorship | European Commission [655965] | |
dc.description.sponsorship | Office of Advanced Scientific Computing Research in the Department of Energy Office of Science [DE-AC02-05CH11231] Authors from Koc University are supported by the Turkish Science and Technology Research Centre Grant No: 215E185. Dr. Unat is supported by the Marie SklodoWSKa Curie Reintegration Grant 655965 by the European Commission. We acknowledge PRACE for awarding us access to the Hazel Hen supercomputer in Germany. Authors from Lawrence Berkeley National Laboratory were supported by the Office of Advanced Scientific Computing Research in the Department of Energy Office of Science under contract number DE-AC02-05CH11231. | |
dc.description.volume | 10417 | |
dc.identifier.doi | 10.1007/978-3-319-64203-1_49 | |
dc.identifier.eissn | 1611-3349 | |
dc.identifier.isbn | 978-3-319-64203-1 | |
dc.identifier.isbn | 978-3-319-64202-4 | |
dc.identifier.issn | 0302-9743 | |
dc.identifier.quartile | Q4 | |
dc.identifier.scopus | 2-s2.0-85028700106 | |
dc.identifier.uri | http://dx.doi.org/10.1007/978-3-319-64203-1_49 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/16929 | |
dc.identifier.wos | 851032800049 | |
dc.keywords | Asynchronous execution | |
dc.keywords | Adaptive mesh refinement | |
dc.keywords | AMR algorithm | |
dc.keywords | Communication hiding | |
dc.language | English | |
dc.publisher | Springer International Publishing Ag | |
dc.source | Euro-Par 2017: Parallel Processing | |
dc.subject | Computer science | |
dc.subject | Theory methods | |
dc.title | Nonintrusive AMR asynchrony for communication optimization | |
dc.type | Conference proceeding | |
dspace.entity.type | Publication | |
local.contributor.authorid | 0000-0002-1609-5847 | |
local.contributor.authorid | 0000-0002-2351-0770 | |
local.contributor.kuauthor | Farooqi, Muhammad Nufail | |
local.contributor.kuauthor | Erten, Didem Unat | |
relation.isOrgUnitOfPublication | 89352e43-bf09-4ef4-82f6-6f9d0174ebae | |
relation.isOrgUnitOfPublication.latestForDiscovery | 89352e43-bf09-4ef4-82f6-6f9d0174ebae |