Data:
Artifacts for "CPU- and GPU-initiated Communication Strategies for Conjugate Gradient Methods on Large GPU Clusters"

dc.contributor.authorTrotter, James D
dc.contributor.authorEkmekçibaşı, Sinan
dc.contributor.authorLangguth, Johannes
dc.contributor.authorSağbili, Doğan
dc.contributor.authorCai, Xing
dc.contributor.authorUnat, Didem
dc.contributor.orcid0000-0003-4498-020x
dc.contributor.orcid0009-0003-5377-6339
dc.contributor.orcid0000-0003-4200-511x
dc.contributor.orcid0000-0002-9603-2466
dc.contributor.orcid0000-0003-3706-4414
dc.contributor.orcid0000-0002-2351-0770
dc.date.accessioned2025-10-24T11:44:04Z
dc.date.issued2025-04-27
dc.description.abstractThis dataset contains computational artifacts related to the paper: “CPU- and GPU-initiated Communication Strategies for Conjugate Gradient Methods on Large GPU Clusters” The paper describes computational experiments that were conducted to evaluate the performance of multi-GPU iterative linear solvers based on the conjugate gradient (CG) method. The computational artifacts are located in several subdirectories: 'aCG-1.0.0/' contains the source code for aCG (version 1.0.0), which implements of the various multi-GPU CG solvers that are used for the performance benchmarks presented in the paper. 'partitions/' contains input files related to partitioning and distributing matrices that were used in the experiments. Partitions were computed using METIS (Karypis and Kumar, 1998), a multilevel graph partitioner.  From the SuiteSparse Collection (Davis and Hu, 2011), six matrices were selected: Bump_2911, Cube_Coup_dt6, Flan_1565, Queen_4147, Serena and audikw_1.  For each matrix, partitions are provided for 2, 4, 8, 16 and 32 parts. 'scripts/' contains job scripts for submitting jobs on three clusters: LUMI, MareNostrum 5 and Wisteria/BDEC-01 (Aquarius). These scripts carry out performance measurements for the multi-GPU CG solvers in aCG and PETSc, and were used to collect the results presented in the paper. 'results/' contains results from the performance benchmarks presented in the paper as tables in a plain-text format. References Davis, T. A. and Y. Hu. 2011. “The University of Florida Sparse Matrix Collection”. ACM Transactions on Mathematical Software 38, 1, Article 1 (December 2011), 25 pages. DOI: https://doi.org/10.1145/2049662.2049663 Karypis, G., and V. Kumar. 1998. “A fast and high quality multilevel scheme for partitioning irregular graphs”. SIAM Journal on scientific Computing 20, 1, pp. 359–392. DOI: https://doi.org/10.1137/S1064827595287997
dc.description.urihttps://dx.doi.org/10.5281/zenodo.15849599
dc.description.urihttps://dx.doi.org/10.5281/zenodo.15884684
dc.description.urihttps://dx.doi.org/10.5281/zenodo.15291897
dc.description.urihttps://dx.doi.org/10.5281/zenodo.15291898
dc.description.urihttps://dx.doi.org/10.5281/zenodo.16941071
dc.identifier.doi10.5281/zenodo.15849599
dc.identifier.openairedoi_dedup___::5a7dff94745f60fd0edadfbd0711242e
dc.identifier.urihttps://hdl.handle.net/20.500.14288/31279
dc.publisherZenodo
dc.titleArtifacts for "CPU- and GPU-initiated Communication Strategies for Conjugate Gradient Methods on Large GPU Clusters"
dc.typeDataset
dspace.entity.typeData
local.import.sourceOpenAire

Files

Collections