The noisy channel mode for unsupervised word sense disambiguation

Publication:
The noisy channel mode for unsupervised word sense disambiguation

Files

559.pdf (248.25 KB)

Departments

Organizational Unit

Department of Computer Engineering

School / College / Institute

Organizational Unit

College of Engineering

KU-Authors

Yatbaz, Mehmet Ali

Yüret, Deniz

Publication Date

2010

Type

Journal Article

Embargo Status

NO

Abstract

We introduce a generative probabilistic model, the noisy channel model, for unsupervised word sense disambiguation. In our model, each context C is modeled as a distinct channel through which the speaker intends to transmit a particular meaning S using a possibly ambiguous word W. To reconstruct the intended meaning the hearer uses the distribution of possible meanings in the given context P(S|C) and possible words that can express each meaning P(W|S). We assume P(W|S) is independent of the context and estimate it using WordNet sense frequencies. The main problem of unsupervised WSD is estimating context-dependent P(S|C) without access to any sense-tagged text. We show one way to solve this problem using a statistical language model based on large amounts of untagged text. Our model uses coarse-grained semantic classes for S internally and we explore the effect of using different levels of granularity on WSD performance. The system outputs fine-grained senses for evaluation, and its performance on noun disambiguation is better than most previously reported unsupervised systems and close to the best supervised systems.

Publisher

Massachusetts Institute of Technology (MIT) Press

Subject

Physics

Source

Computational Linguistics

DOI

10.1162/coli.2010.36.1.36103

URI

https://hdl.handle.net/20.500.14288/4019

Collections

Publications with Fulltext

Full item page

1

Views

18

Downloads

View PlumX Details

Publication:
The noisy channel mode for unsupervised word sense disambiguation

Files

Departments

School / College / Institute

Program

KU-Authors

KU Authors

Co-Authors

Publication Date

Language

Type

Embargo Status

Journal Title

Journal ISSN

Volume Title

Alternative Title

Abstract

Source

Publisher

Subject

Citation

Has Part

Source

Book Series Title

Edition

DOI

URI

item.page.datauri

Link

Rights

Copyrights Note

Collections

Endorsement

Review

Supplemented By

Referenced By

1

Views

18

Downloads

Publication: The noisy channel mode for unsupervised word sense disambiguation

Files

Departments

School / College / Institute

Program

KU-Authors

KU Authors

Co-Authors

Publication Date

Language

Type

Embargo Status

Journal Title

Journal ISSN

Volume Title

Alternative Title

Abstract

Source

Publisher

Subject

Citation

Has Part

Source

Book Series Title

Edition

DOI

URI

item.page.datauri

Link

Rights

Copyrights Note

Collections

Endorsement

Review

Supplemented By

Referenced By

1

Views

18

Downloads

Publication:
The noisy channel mode for unsupervised word sense disambiguation