Publication:
Nonintrusive AMR asynchrony for communication optimization

dc.contributor.coauthorNguyen, Tan
dc.contributor.coauthorZhang, Weiqun
dc.contributor.coauthorAlmgren, Ann
dc.contributor.coauthorShalf, John
dc.contributor.departmentN/A
dc.contributor.departmentDepartment of Computer Engineering
dc.contributor.kuauthorFarooqi, Muhammad Nufail
dc.contributor.kuauthorErten, Didem Unat
dc.contributor.kuprofilePhD Student
dc.contributor.kuprofileFaculty Member
dc.contributor.otherDepartment of Computer Engineering
dc.contributor.schoolcollegeinstituteGraduate School of Sciences and Engineering
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.contributor.yokidN/A
dc.contributor.yokid219274
dc.date.accessioned2024-11-10T00:08:16Z
dc.date.issued2017
dc.description.abstractAdaptive Mesh Refinement (AMR) is a well known method for efficiently solving partial differential equations. A straightforward AMR algorithm typically exhibits many synchronization points even during a single time step, where costly communication often degrades the performance. This problem will be even more pronounced on future supercomputers containing billion way parallelism, which will raise the communication cost further. Re-designing AMR algorithms to avoid synchronization is not a viable solution due to the large code size and complex control structures. We present a nonintrusive asynchronous approach to hiding the effects of communication in an AMR application. Specifically, our approach reasons about data dependencies automatically using domain knowledge about AMR applications, allowing asynchrony to be discovered with only a modest amount of code modification. Using this approach, we optimize the synchronous AMR algorithm in the BoxLib software framework without severely affecting the productivity of the application programmer We observe around 27-31% performance improvement for an advection solver on the Hazel Hen supercomputer using 12288 cores.
dc.description.indexedbyWoS
dc.description.indexedbyScopus
dc.description.openaccessNO
dc.description.publisherscopeInternational
dc.description.sponsorshipTurkish Science and Technology Research Centre [215E185]
dc.description.sponsorshipEuropean Commission [655965]
dc.description.sponsorshipOffice of Advanced Scientific Computing Research in the Department of Energy Office of Science [DE-AC02-05CH11231] Authors from Koc University are supported by the Turkish Science and Technology Research Centre Grant No: 215E185. Dr. Unat is supported by the Marie SklodoWSKa Curie Reintegration Grant 655965 by the European Commission. We acknowledge PRACE for awarding us access to the Hazel Hen supercomputer in Germany. Authors from Lawrence Berkeley National Laboratory were supported by the Office of Advanced Scientific Computing Research in the Department of Energy Office of Science under contract number DE-AC02-05CH11231.
dc.description.volume10417
dc.identifier.doi10.1007/978-3-319-64203-1_49
dc.identifier.eissn1611-3349
dc.identifier.isbn978-3-319-64203-1
dc.identifier.isbn978-3-319-64202-4
dc.identifier.issn0302-9743
dc.identifier.quartileQ4
dc.identifier.scopus2-s2.0-85028700106
dc.identifier.urihttp://dx.doi.org/10.1007/978-3-319-64203-1_49
dc.identifier.urihttps://hdl.handle.net/20.500.14288/16929
dc.identifier.wos851032800049
dc.keywordsAsynchronous execution
dc.keywordsAdaptive mesh refinement
dc.keywordsAMR algorithm
dc.keywordsCommunication hiding
dc.languageEnglish
dc.publisherSpringer International Publishing Ag
dc.sourceEuro-Par 2017: Parallel Processing
dc.subjectComputer science
dc.subjectTheory methods
dc.titleNonintrusive AMR asynchrony for communication optimization
dc.typeConference proceeding
dspace.entity.typePublication
local.contributor.authorid0000-0002-1609-5847
local.contributor.authorid0000-0002-2351-0770
local.contributor.kuauthorFarooqi, Muhammad Nufail
local.contributor.kuauthorErten, Didem Unat
relation.isOrgUnitOfPublication89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication.latestForDiscovery89352e43-bf09-4ef4-82f6-6f9d0174ebae

Files