Publication:
Data decomposition for parallel K-means clustering

dc.contributor.departmentDepartment of Computer Engineering
dc.contributor.kuauthorGürsoy, Attila
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.date.accessioned2024-11-09T23:39:08Z
dc.date.issued2004
dc.description.abstractDeveloping fast algorithms for clustering has been an important area of research in data mining and other fields. K-means is one of the widely used clustering algorithms. In this work, we have developed and evaluated parallelization of k-means method for low-dimensional data on message passing computers. Three different data decomposition schemes and their impact on the pruning of distance calculations in tree-based k-means algorithm have been studied. Random pattern decomposition has good load balancing but fails to prune distance calculations effectively. Compact spatial decomposition of patterns based on space filling curves outperforms random pattern decomposition even though it has load imbalance problem. In both cases, parallel tree-based k-means clustering runs significantly faster than the traditional parallel k-means.
dc.description.indexedbyWOS
dc.description.indexedbyScopus
dc.description.openaccessNO
dc.description.sponsoredbyTubitakEuN/A
dc.description.volume3019
dc.identifier.doiN/A
dc.identifier.eissn1611-3349
dc.identifier.isbn3-540-21946-3
dc.identifier.issn0302-9743
dc.identifier.scopus2-s2.0-35048897012
dc.identifier.urihttps://hdl.handle.net/20.500.14288/13065
dc.identifier.wos221559200031
dc.language.isoeng
dc.publisherSpringer-Verlag Berlin
dc.relation.ispartofParallel Processing and Applied Mathematics
dc.subjectComputer science
dc.subjectArtificial intelligence
dc.subjectTheory methods
dc.subjectMathematics
dc.subjectApplied mathematics
dc.titleData decomposition for parallel K-means clustering
dc.typeBook Chapter
dspace.entity.typePublication
local.contributor.kuauthorGürsoy, Attila
local.publication.orgunit1College of Engineering
local.publication.orgunit2Department of Computer Engineering
relation.isOrgUnitOfPublication89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication.latestForDiscovery89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isParentOrgUnitOfPublication8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication.latestForDiscovery8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Files