Publication:
ComScribe: identifying intra-node GPU communication

dc.contributor.departmentDepartment of Computer Engineering
dc.contributor.departmentGraduate School of Sciences and Engineering
dc.contributor.kuauthorAkhtar, Palwisha
dc.contributor.kuauthorErten, Didem Unat
dc.contributor.kuauthorQararyah, Fareed Mohammad
dc.contributor.kuauthorTezcan, Erhan
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.contributor.schoolcollegeinstituteGRADUATE SCHOOL OF SCIENCES AND ENGINEERING
dc.date.accessioned2024-11-10T00:10:58Z
dc.date.issued2021
dc.description.abstractGPU communication plays a critical role in performance and scalability of multi-GPU accelerated applications. With the ever increasing methods and types of communication, it is often hard for the programmer to know the exact amount and type of communication taking place in an application. Though there are prior works that detect communication in distributed systems for MPI and multi-threaded applications on shared memory systems, to our knowledge, none of these works identify intra-node GPU communication. We propose a tool, ComScribe that identifies and categorizes types of communication among all GPU-GPU and CPU-GPU pairs in a node. Built on top of NVIDIA’s profiler nvprof, ComScribe visualizes data movement as a communication matrix or bar-chart for explicit communication primitives, Unified Memory operations, and Zero-copy Memory transfers. To validate our tool on 16 GPUs, we present communication patterns of 8 micro- and 3 macro-benchmarks from NVIDIA, Comm|Scope, and MGBench benchmark suites. To demonstrate tool’s capabilities in real-life applications, we also present insightful communication matrices of two deep neural network models. All in all, ComScribe can guide the programmer in identifying which groups of GPUs communicate in what volume by using which primitives. This offers avenues to detect performance bottlenecks and more importantly communication bugs in an application. © 2021, Springer Nature Switzerland AG.
dc.description.indexedbyScopus
dc.description.openaccessYES
dc.description.publisherscopeInternational
dc.description.sponsoredbyTubitakEuN/A
dc.description.volume12614 LNCS
dc.identifier.doi10.1007/978-3-030-71058-3_10
dc.identifier.isbn9783-0307-1057-6
dc.identifier.issn0302-9743
dc.identifier.quartileN/A
dc.identifier.scopus2-s2.0-85103280105
dc.identifier.urihttps://hdl.handle.net/20.500.14288/17392
dc.keywordsInter-GPU communication
dc.keywordsMulti-GPUs
dc.keywordsProfiling
dc.language.isoeng
dc.publisherSpringer Science and Business Media Deutschland GmbH
dc.relation.ispartofLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
dc.subjectHardware
dc.subjectParallel applications
dc.subjectCommunication patterns
dc.titleComScribe: identifying intra-node GPU communication
dc.typeConference Proceeding
dspace.entity.typePublication
local.contributor.kuauthorAkhtar, Palwisha
local.contributor.kuauthorTezcan, Erhan
local.contributor.kuauthorQararyah, Fareed Mohammad
local.contributor.kuauthorErten, Didem Unat
local.publication.orgunit1GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
local.publication.orgunit1College of Engineering
local.publication.orgunit2Department of Computer Engineering
local.publication.orgunit2Graduate School of Sciences and Engineering
relation.isOrgUnitOfPublication89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication3fc31c89-e803-4eb1-af6b-6258bc42c3d8
relation.isOrgUnitOfPublication.latestForDiscovery89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isParentOrgUnitOfPublication8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication434c9663-2b11-4e66-9399-c863e2ebae43
relation.isParentOrgUnitOfPublication.latestForDiscovery8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Files