GPU-initiated resource allocation for irregular workloads

Publication:
GPU-initiated resource allocation for irregular workloads

Files

Primary IR04604.pdf (1.97 MB)

Departments

Organizational Unit

Department of Computer Engineering

School / College / Institute

Organizational Unit

College of Engineering

KU-Authors

Erten, Didem Unat

Sasongko, Muhammad Aditya

Turimbetov, İlyas

Publication Date

2024

Type

Conference Proceeding

Abstract

GPU kernels may suffer from resource underutilization in multi-GPU systems due to insufficient workload to saturate devices when incorporated within an irregular application. To better utilize the resources in multi-GPU systems, we propose a GPU-sided resource allocation method that can increase or decrease the number of GPUs in use as the workload changes over time. Our method employs GPU-to-CPU callbacks to allowGPU device(s) to request additional devices while the kernel execution is in flight. We implemented and tested multiple callback methods required for GPU-initiated workload offloading to other devices and measured their overheads on Nvidia and AMD platforms. To showcase the usage of callbacks in irregular applications, we implemented Breadth-First Search (BFS) that uses device-initiated workload offloading. Apart from allowing dynamic device allocation in persistently running kernels, it reduces time to solution on average by 15.7% at the cost of callback overheads with a minimum of 6.50 microseconds on AMD and 4.83 microseconds on Nvidia, depending on the chosen callback mechanism. Moreover, the proposed model can reduce the total device usage by up to 35%, which is associated with higher energy efficiency.

Publisher

Assoc Computing Machinery

Subject

Computer science, Hardware and architecture, Software engineering, Theory and methods

Source

Proceedings of 2024 3rd International Workshop on Extreme Heterogeneity Solutions, Exhet 2024

DOI

10.1145/3642961.3643799

URI

https://doi.org/10.1145/3642961.3643799
https://hdl.handle.net/20.500.14288/23012

Publication:
GPU-initiated resource allocation for irregular workloads

Files

Departments

School / College / Institute

Program

KU-Authors

KU Authors

Co-Authors

Publication Date

Language

Type

Embargo Status

Journal Title

Journal ISSN

Volume Title

Alternative Title

Abstract

Source

Publisher

Subject

Citation

Has Part

Source

Book Series Title

Edition

DOI

URI

item.page.datauri

Link

Rights

Copyrights Note

Collections

Endorsement

Review

Supplemented By

Referenced By

2

Views

3

Downloads

Publication: GPU-initiated resource allocation for irregular workloads

Files

Departments

School / College / Institute

Program

KU-Authors

KU Authors

Co-Authors

Publication Date

Language

Type

Embargo Status

Journal Title

Journal ISSN

Volume Title

Alternative Title

Abstract

Source

Publisher

Subject

Citation

Has Part

Source

Book Series Title

Edition

DOI

URI

item.page.datauri

Link

Rights

Copyrights Note

Collections

Endorsement

Review

Supplemented By

Referenced By

2

Views

3

Downloads

Publication:
GPU-initiated resource allocation for irregular workloads