Publication:
Hierarchical compact clustering attention (COCA) for unsupervised object-centric learning

dc.conference.dateJUN 10-17, 2025
dc.conference.locationNashville, TN
dc.contributor.departmentDepartment of Computer Engineering
dc.contributor.departmentKUIS AI (Koç University & İş Bank Artificial Intelligence Center)
dc.contributor.kuauthorYemez, Yücel
dc.contributor.kuauthorKüçüksözen, Can
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.contributor.schoolcollegeinstituteResearch Center
dc.date.accessioned2026-07-02T07:03:04Z
dc.date.available2026-03-27
dc.date.issued2025
dc.description.abstractWe propose the Compact Clustering Attention (COCA) layer, an effective building block that introduces a hierarchical strategy for object-centric representation learning, while solving the unsupervised object discovery task on single images. COCA is an attention-based clustering module capable of extracting object-centric representations from multi-object scenes, when cascaded into a bottom-up hierarchical network architecture, referred to as COCA-Net. At its core, COCA utilizes a novel clustering algorithm that leverages the physical concept of compactness, to highlight distinct object centroids in a scene, providing a spatial inductive bias. Thanks to this strategy, COCA-Net generates high-quality segmentation masks on both the decoder side and, notably, the encoder side of its pipeline. Additionally, COCA-Net is not bound by a predetermined number of object masks that it generates and handles the segmentation of background elements better than its competitors. We demonstrate COCA-Net's segmentation performance on six widely adopted datasets, achieving superior or competitive results against the state-of-the-art models across nine different evaluation metrics.
dc.description.fulltextNo
dc.description.harvestedfromManual
dc.description.indexedbyWOS
dc.description.publisherscopeInternational
dc.description.readpublishN/A
dc.description.sponsoredbyTubitakEuN/A
dc.description.versionPublished Version
dc.identifier.WoSQuartileN/A
dc.identifier.doi10.1109/CVPR52734.2025.02364
dc.identifier.embargoNo
dc.identifier.endpage25398
dc.identifier.isbn9798331543655
dc.identifier.isbn9798331543648
dc.identifier.issn1063-6919
dc.identifier.startpage25388
dc.identifier.urihttp://dx.doi.org/10.1109/CVPR52734.2025.02364
dc.identifier.urihttps://hdl.handle.net/20.500.14288/32830
dc.identifier.wos001601181100128
dc.keywordsObject-centric learning
dc.keywordsAttention mechanisms
dc.keywordsUnsupervised segmentation
dc.languageeng
dc.publisherIEEE
dc.relation.affiliationKoç University
dc.relation.collectionKoç University Institutional Repository
dc.relation.ispartof2025 IEEE CVF Conference on Computer Vision and Pattern Recognition (CVPR)
dc.relation.openaccessN/A
dc.rightsN/A
dc.rights.uriN/A
dc.subjectComputer science
dc.titleHierarchical compact clustering attention (COCA) for unsupervised object-centric learning
dc.typeConference Proceeding
dspace.entity.typePublication
relation.isOrgUnitOfPublication89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication77d67233-829b-4c3a-a28f-bd97ab5c12c7
relation.isOrgUnitOfPublication.latestForDiscovery89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isParentOrgUnitOfPublication8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublicationd437580f-9309-4ecb-864a-4af58309d287
relation.isParentOrgUnitOfPublication.latestForDiscovery8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Files