GPU-initiated resource allocation for irregular workloads

Publication:
GPU-initiated resource allocation for irregular workloads

dc.contributor.department	Department of Computer Engineering
dc.contributor.kuauthor	Faculty Member, Erten, Didem Unat
dc.contributor.kuauthor	Researcher, Sasongko, Muhammad Aditya
dc.contributor.kuauthor	PhD Student, Turimbetov, İlyas
dc.contributor.schoolcollegeinstitute	College of Engineering
dc.date.accessioned	2024-12-29T09:39:30Z
dc.date.issued	2024
dc.description.abstract	GPU kernels may suffer from resource underutilization in multi-GPU systems due to insufficient workload to saturate devices when incorporated within an irregular application. To better utilize the resources in multi-GPU systems, we propose a GPU-sided resource allocation method that can increase or decrease the number of GPUs in use as the workload changes over time. Our method employs GPU-to-CPU callbacks to allowGPU device(s) to request additional devices while the kernel execution is in flight. We implemented and tested multiple callback methods required for GPU-initiated workload offloading to other devices and measured their overheads on Nvidia and AMD platforms. To showcase the usage of callbacks in irregular applications, we implemented Breadth-First Search (BFS) that uses device-initiated workload offloading. Apart from allowing dynamic device allocation in persistently running kernels, it reduces time to solution on average by 15.7% at the cost of callback overheads with a minimum of 6.50 microseconds on AMD and 4.83 microseconds on Nvidia, depending on the chosen callback mechanism. Moreover, the proposed model can reduce the total device usage by up to 35%, which is associated with higher energy efficiency.
dc.description.indexedby	WOS
dc.description.indexedby	Scopus
dc.description.openaccess	Hybrid Gold Open Access
dc.description.publisherscope	International
dc.description.sponsoredbyTubitakEu	N/A
dc.description.sponsorship	This work was supported in part by the Royal Society-Newton Advanced Fellowship and by the European Research Council (ERC) under the European Union's Horizon 2020 research and innovation programme (grant agreement No 949587).
dc.identifier.doi	10.1145/3642961.3643799
dc.identifier.isbn	979-8-4007-0537-3
dc.identifier.quartile	N/A
dc.identifier.scopus	2-s2.0-85191662098
dc.identifier.uri	https://doi.org/10.1145/3642961.3643799
dc.identifier.uri	https://hdl.handle.net/20.500.14288/23012
dc.identifier.wos	1209671600001
dc.keywords	Energy efficiency
dc.keywords	Program processors
dc.keywords	Resource allocation
dc.language.iso	eng
dc.publisher	Assoc Computing Machinery
dc.relation.ispartof	Proceedings of 2024 3rd International Workshop on Extreme Heterogeneity Solutions, Exhet 2024
dc.subject	Computer science
dc.subject	Hardware and architecture
dc.subject	Software engineering
dc.subject	Theory and methods
dc.title	GPU-initiated resource allocation for irregular workloads
dc.type	Conference Proceeding
dspace.entity.type	Publication
local.contributor.kuauthor	Turimbetov, İlyas
local.contributor.kuauthor	Sasongko, Muhammad Aditya
local.contributor.kuauthor	Erten, Didem Unat
local.publication.orgunit1	College of Engineering
local.publication.orgunit2	Department of Computer Engineering
relation.isOrgUnitOfPublication	89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication.latestForDiscovery	89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isParentOrgUnitOfPublication	8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication.latestForDiscovery	8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Files

Original bundle

Now showing 1 - 1 of 1

Name:: IR04604.pdf
Size:: 1.97 MB
Format:: Adobe Portable Document Format

Download

Collections

Publications with Fulltext

Publication: GPU-initiated resource allocation for irregular workloads

Files

Original bundle

Collections

Publication:
GPU-initiated resource allocation for irregular workloads