Publication:
Semi-supervised learning with induced word senses for state of the art word sense disambiguation

dc.contributor.coauthorJurgens, David
dc.contributor.departmentN/A
dc.contributor.kuauthorBaşkaya, Osman
dc.contributor.kuprofileMaster Student
dc.contributor.schoolcollegeinstituteGraduate School of Sciences and Engineering
dc.contributor.yokidN/A
dc.date.accessioned2024-11-09T23:44:03Z
dc.date.issued2016
dc.description.abstractWord Sense Disambiguation (WSD) aims to determine the meaning of a word in context, and successful approaches are known to bene fit many applications in Natural Language Processing. Although supervised learning has been shown to provide superior WSD performance, current sense-annotated corpora do not contain a sufficient number of instances per word type to train supervised systems for all words. While unsupervised techniques have been proposed to overcome this data sparsity problem, such techniques have not outperformed supervised methods. In this paper, we propose a new approach to building semi-supervised WSD systems that combines a small amount of sense-annotated data with information from Word Sense Induction, a fully-unsupervised technique that automatically learns the different senses of a word based on how it is used. In three experiments, we show how sense induction models may be effectively combined to ultimately produce high-performance semi-supervised WSD systems that exceed the performance of state-of-the-art supervised WSD techniques trained on the same sense-annotated data. We anticipate that our results and released software will also bene fit evaluation practices for sense induction systems and those working in low-resource languages by demonstrating how to quickly produce accurate WSD systems with minimal annotation effort.
dc.description.indexedbyWoS
dc.description.indexedbyScopus
dc.description.openaccessYES
dc.description.publisherscopeInternational
dc.description.volume55
dc.identifier.doi10.1613/jair.4917
dc.identifier.eissn1943-5037
dc.identifier.issn1076-9757
dc.identifier.quartileQ2
dc.identifier.scopus2-s2.0-84964596699
dc.identifier.urihttp://dx.doi.org/10.1613/jair.4917
dc.identifier.urihttps://hdl.handle.net/20.500.14288/13587
dc.identifier.wos375392800001
dc.keywordsFramework
dc.languageEnglish
dc.publisherAI Access Foundation
dc.sourceJournal of Artificial Intelligence Research
dc.subjectComputer science
dc.subjectArtificial intelligence
dc.titleSemi-supervised learning with induced word senses for state of the art word sense disambiguation
dc.typeJournal Article
dspace.entity.typePublication
local.contributor.authoridN/A
local.contributor.kuauthorBaşkaya, Osman

Files