Publication:
GPU-initiated resource allocation for irregular workloads

dc.contributor.departmentDepartment of Computer Engineering
dc.contributor.kuauthorErten, Didem Unat
dc.contributor.kuauthorSasongko, Muhammad Aditya
dc.contributor.kuauthorTurimbetov, İlyas
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.date.accessioned2024-12-29T09:39:30Z
dc.date.issued2024
dc.description.abstractGPU kernels may suffer from resource underutilization in multi-GPU systems due to insufficient workload to saturate devices when incorporated within an irregular application. To better utilize the resources in multi-GPU systems, we propose a GPU-sided resource allocation method that can increase or decrease the number of GPUs in use as the workload changes over time. Our method employs GPU-to-CPU callbacks to allowGPU device(s) to request additional devices while the kernel execution is in flight. We implemented and tested multiple callback methods required for GPU-initiated workload offloading to other devices and measured their overheads on Nvidia and AMD platforms. To showcase the usage of callbacks in irregular applications, we implemented Breadth-First Search (BFS) that uses device-initiated workload offloading. Apart from allowing dynamic device allocation in persistently running kernels, it reduces time to solution on average by 15.7% at the cost of callback overheads with a minimum of 6.50 microseconds on AMD and 4.83 microseconds on Nvidia, depending on the chosen callback mechanism. Moreover, the proposed model can reduce the total device usage by up to 35%, which is associated with higher energy efficiency.
dc.description.indexedbyWOS
dc.description.indexedbyScopus
dc.description.openaccessHybrid Gold Open Access
dc.description.publisherscopeInternational
dc.description.sponsoredbyTubitakEuN/A
dc.description.sponsorshipThis work was supported in part by the Royal Society-Newton Advanced Fellowship and by the European Research Council (ERC) under the European Union's Horizon 2020 research and innovation programme (grant agreement No 949587).
dc.identifier.doi10.1145/3642961.3643799
dc.identifier.isbn979-8-4007-0537-3
dc.identifier.quartileN/A
dc.identifier.scopus2-s2.0-85191662098
dc.identifier.urihttps://doi.org/10.1145/3642961.3643799
dc.identifier.urihttps://hdl.handle.net/20.500.14288/23012
dc.identifier.wos1209671600001
dc.keywordsEnergy efficiency
dc.keywordsProgram processors
dc.keywordsResource allocation
dc.language.isoeng
dc.publisherAssoc Computing Machinery
dc.relation.ispartofProceedings of 2024 3rd International Workshop on Extreme Heterogeneity Solutions, Exhet 2024
dc.subjectComputer science
dc.subjectHardware and architecture
dc.subjectSoftware engineering
dc.subjectTheory and methods
dc.titleGPU-initiated resource allocation for irregular workloads
dc.typeConference Proceeding
dspace.entity.typePublication
local.contributor.kuauthorTurimbetov, İlyas
local.contributor.kuauthorSasongko, Muhammad Aditya
local.contributor.kuauthorErten, Didem Unat
local.publication.orgunit1College of Engineering
local.publication.orgunit2Department of Computer Engineering
relation.isOrgUnitOfPublication89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication.latestForDiscovery89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isParentOrgUnitOfPublication8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication.latestForDiscovery8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Files

Original bundle

Now showing 1 - 1 of 1
Thumbnail Image
Name:
IR04604.pdf
Size:
1.97 MB
Format:
Adobe Portable Document Format