Semi-supervised learning with induced word senses for state of the art word sense disambiguation

Publication:
Semi-supervised learning with induced word senses for state of the art word sense disambiguation

Departments

Organizational Unit

Graduate School of Sciences and Engineering

School / College / Institute

Organizational Unit

GRADUATE SCHOOL OF SCIENCES AND ENGINEERING

Upper Org Unit

KU-Authors

Başkaya, Osman

Co-Authors

Jurgens, David

Date

2016

Type

Journal Article

Embargo Status

N/A

Abstract

Word Sense Disambiguation (WSD) aims to determine the meaning of a word in context, and successful approaches are known to bene fit many applications in Natural Language Processing. Although supervised learning has been shown to provide superior WSD performance, current sense-annotated corpora do not contain a sufficient number of instances per word type to train supervised systems for all words. While unsupervised techniques have been proposed to overcome this data sparsity problem, such techniques have not outperformed supervised methods. In this paper, we propose a new approach to building semi-supervised WSD systems that combines a small amount of sense-annotated data with information from Word Sense Induction, a fully-unsupervised technique that automatically learns the different senses of a word based on how it is used. In three experiments, we show how sense induction models may be effectively combined to ultimately produce high-performance semi-supervised WSD systems that exceed the performance of state-of-the-art supervised WSD techniques trained on the same sense-annotated data. We anticipate that our results and released software will also bene fit evaluation practices for sense induction systems and those working in low-resource languages by demonstrating how to quickly produce accurate WSD systems with minimal annotation effort.

Publisher

AI Access Foundation

Subject

Computer science, Artificial intelligence

Source

Journal of Artificial Intelligence Research

DOI

10.1613/jair.4917

URI

https://doi.org/10.1613/jair.4917
https://hdl.handle.net/20.500.14288/13587

Rights

N/A

Collections

Publications without Fulltext

Full item page

0

Views

0

Downloads

View PlumX Details

Publication: Semi-supervised learning with induced word senses for state of the art word sense disambiguation

Departments

School / College / Institute

Program

KU-Authors

KU Authors

Co-Authors

Editor & Affiliation

Compiler & Affiliation

Translator

Other Contributor

Date

Language

Type

Embargo Status

Journal Title

Journal ISSN

Volume Title

Alternative Title

Abstract

Source

Publisher

Subject

Citation

Has Part

Source

Book Series Title

Edition

DOI

URI

item.page.datauri

Link

Rights

Copyrights Note

Collections

Endorsement

Review

Supplemented By

Referenced By

Related Goal

0

Views

0

Downloads

Publication:
Semi-supervised learning with induced word senses for state of the art word sense disambiguation