Publication:
HiSEG: Human assisted instance segmentation

dc.contributor.departmentDepartment of Computer Engineering
dc.contributor.kuauthorSezgin, Tevfik Metin
dc.contributor.otherDepartment of Computer Engineering
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.contributor.schoolcollegeinstituteGraduate School of Sciences and Engineering
dc.date.accessioned2024-12-29T09:41:23Z
dc.date.issued2024
dc.description.abstractInstance segmentation is a form of image detection which has a range of applications, such as object refinement, medical image analysis, and image/video editing, all of which demand a high degree of accuracy. However, this precision is often beyond the reach of what even state-of-the-art, fully automated instance segmentation algorithms can deliver. The performance gap becomes particularly prohibitive for small and complex objects. Practitioners typically resort to fully manual annotation, which can be a laborious process. In order to overcome this problem, we propose a novel approach to enable more precise predictions and generate higher-quality segmentation masks for high-curvature, complex and small-scale objects. Our human-assisted segmentation method, HiSEG, augments the existing Strong Mask R-CNN network to incorporate human-specified partial boundaries. We also present a dataset of hand-drawn partial object boundaries, which we refer to as “human attention maps”. In addition, the Partial Sketch Object Boundaries (PSOB) dataset contains hand-drawn partial object boundaries which represent curvatures of an object's ground truth mask with several pixels. Through extensive evaluation using the PSOB dataset, we show that HiSEG outperforms state-of-the art methods such as Mask R-CNN, Strong Mask R-CNN, Mask2Former, and Segment Anything, achieving respective increases of +42.0, +34.9, +29.9, and +13.4 points in APMask metrics for these four models. We hope that our novel approach will set a baseline for future human-aided deep learning models by combining fully automated and interactive instance segmentation architectures.
dc.description.indexedbyWoS
dc.description.indexedbyScopus
dc.description.publisherscopeInternational
dc.description.volume124
dc.identifier.doi10.1016/j.cag.2024.104061
dc.identifier.eissn1873-7684
dc.identifier.issn0097-8493
dc.identifier.quartileQ2
dc.identifier.scopus2-s2.0-85202354327
dc.identifier.urihttps://doi.org/10.1016/j.cag.2024.104061
dc.identifier.urihttps://hdl.handle.net/20.500.14288/23620
dc.identifier.wos1305644400001
dc.keywordsData fusion
dc.keywordsHuman-aided computer vision
dc.keywordsInstance segmentation
dc.keywordsInteractive segmentation
dc.keywordsPen-based partial annotation
dc.languageen
dc.publisherElsevier Ltd
dc.sourceComputers and Graphics
dc.subjectComputer engineering
dc.titleHiSEG: Human assisted instance segmentation
dc.typeJournal article
dspace.entity.typePublication
local.contributor.kuauthorSezgin, Tevfik Metin
local.contributor.kuauthorKorkmaz, Muhammed
relation.isOrgUnitOfPublication89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication.latestForDiscovery89352e43-bf09-4ef4-82f6-6f9d0174ebae

Files