Publication: Hierarchical compact clustering attention (COCA) for unsupervised object-centric learning
| dc.conference.date | JUN 10-17, 2025 | |
| dc.conference.location | Nashville, TN | |
| dc.contributor.department | Department of Computer Engineering | |
| dc.contributor.department | KUIS AI (Koç University & İş Bank Artificial Intelligence Center) | |
| dc.contributor.kuauthor | Yemez, Yücel | |
| dc.contributor.kuauthor | Küçüksözen, Can | |
| dc.contributor.schoolcollegeinstitute | College of Engineering | |
| dc.contributor.schoolcollegeinstitute | Research Center | |
| dc.date.accessioned | 2026-07-02T07:03:04Z | |
| dc.date.available | 2026-03-27 | |
| dc.date.issued | 2025 | |
| dc.description.abstract | We propose the Compact Clustering Attention (COCA) layer, an effective building block that introduces a hierarchical strategy for object-centric representation learning, while solving the unsupervised object discovery task on single images. COCA is an attention-based clustering module capable of extracting object-centric representations from multi-object scenes, when cascaded into a bottom-up hierarchical network architecture, referred to as COCA-Net. At its core, COCA utilizes a novel clustering algorithm that leverages the physical concept of compactness, to highlight distinct object centroids in a scene, providing a spatial inductive bias. Thanks to this strategy, COCA-Net generates high-quality segmentation masks on both the decoder side and, notably, the encoder side of its pipeline. Additionally, COCA-Net is not bound by a predetermined number of object masks that it generates and handles the segmentation of background elements better than its competitors. We demonstrate COCA-Net's segmentation performance on six widely adopted datasets, achieving superior or competitive results against the state-of-the-art models across nine different evaluation metrics. | |
| dc.description.fulltext | No | |
| dc.description.harvestedfrom | Manual | |
| dc.description.indexedby | WOS | |
| dc.description.publisherscope | International | |
| dc.description.readpublish | N/A | |
| dc.description.sponsoredbyTubitakEu | N/A | |
| dc.description.version | Published Version | |
| dc.identifier.WoSQuartile | N/A | |
| dc.identifier.doi | 10.1109/CVPR52734.2025.02364 | |
| dc.identifier.embargo | No | |
| dc.identifier.endpage | 25398 | |
| dc.identifier.isbn | 9798331543655 | |
| dc.identifier.isbn | 9798331543648 | |
| dc.identifier.issn | 1063-6919 | |
| dc.identifier.startpage | 25388 | |
| dc.identifier.uri | http://dx.doi.org/10.1109/CVPR52734.2025.02364 | |
| dc.identifier.uri | https://hdl.handle.net/20.500.14288/32830 | |
| dc.identifier.wos | 001601181100128 | |
| dc.keywords | Object-centric learning | |
| dc.keywords | Attention mechanisms | |
| dc.keywords | Unsupervised segmentation | |
| dc.language | eng | |
| dc.publisher | IEEE | |
| dc.relation.affiliation | Koç University | |
| dc.relation.collection | Koç University Institutional Repository | |
| dc.relation.ispartof | 2025 IEEE CVF Conference on Computer Vision and Pattern Recognition (CVPR) | |
| dc.relation.openaccess | N/A | |
| dc.rights | N/A | |
| dc.rights.uri | N/A | |
| dc.subject | Computer science | |
| dc.title | Hierarchical compact clustering attention (COCA) for unsupervised object-centric learning | |
| dc.type | Conference Proceeding | |
| dspace.entity.type | Publication | |
| relation.isOrgUnitOfPublication | 89352e43-bf09-4ef4-82f6-6f9d0174ebae | |
| relation.isOrgUnitOfPublication | 77d67233-829b-4c3a-a28f-bd97ab5c12c7 | |
| relation.isOrgUnitOfPublication.latestForDiscovery | 89352e43-bf09-4ef4-82f6-6f9d0174ebae | |
| relation.isParentOrgUnitOfPublication | 8e756b23-2d4a-4ce8-b1b3-62c794a8c164 | |
| relation.isParentOrgUnitOfPublication | d437580f-9309-4ecb-864a-4af58309d287 | |
| relation.isParentOrgUnitOfPublication.latestForDiscovery | 8e756b23-2d4a-4ce8-b1b3-62c794a8c164 |
