Publication: The noisy channel mode for unsupervised word sense disambiguation
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.kuauthor | Yüret, Deniz | |
dc.contributor.kuauthor | Yatbaz, Mehmet Ali | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.kuprofile | PhD Student | |
dc.contributor.other | Department of Computer Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.yokid | 179996 | |
dc.contributor.yokid | 192506 | |
dc.date.accessioned | 2024-11-09T13:53:36Z | |
dc.date.issued | 2010 | |
dc.description.abstract | We introduce a generative probabilistic model, the noisy channel model, for unsupervised word sense disambiguation. In our model, each context C is modeled as a distinct channel through which the speaker intends to transmit a particular meaning S using a possibly ambiguous word W. To reconstruct the intended meaning the hearer uses the distribution of possible meanings in the given context P(S|C) and possible words that can express each meaning P(W|S). We assume P(W|S) is independent of the context and estimate it using WordNet sense frequencies. The main problem of unsupervised WSD is estimating context-dependent P(S|C) without access to any sense-tagged text. We show one way to solve this problem using a statistical language model based on large amounts of untagged text. Our model uses coarse-grained semantic classes for S internally and we explore the effect of using different levels of granularity on WSD performance. The system outputs fine-grained senses for evaluation, and its performance on noun disambiguation is better than most previously reported unsupervised systems and close to the best supervised systems. | |
dc.description.fulltext | YES | |
dc.description.indexedby | WoS | |
dc.description.indexedby | Scopus | |
dc.description.issue | 1 | |
dc.description.openaccess | YES | |
dc.description.publisherscope | International | |
dc.description.sponsoredbyTubitakEu | TÜBİTAK | |
dc.description.sponsorship | Scientific and Technological Research Council of Turkey (TÜBİTAK) | |
dc.description.version | Publisher version | |
dc.description.volume | 36 | |
dc.format | ||
dc.identifier.doi | 10.1162/coli.2010.36.1.36103 | |
dc.identifier.embargo | NO | |
dc.identifier.filenameinventoryno | IR00489 | |
dc.identifier.issn | 0891-2017 | |
dc.identifier.link | https://doi.org/10.1162/coli.2010.36.1.36103 | |
dc.identifier.quartile | Q1 | |
dc.identifier.scopus | 2-s2.0-77049109829 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/4019 | |
dc.identifier.wos | 275310400004 | |
dc.language | English | |
dc.publisher | Massachusetts Institute of Technology (MIT) Press | |
dc.relation.grantno | 1.08E+230 | |
dc.relation.uri | http://cdm21054.contentdm.oclc.org/cdm/ref/collection/IR/id/559 | |
dc.source | Computational Linguistics | |
dc.subject | Physics | |
dc.title | The noisy channel mode for unsupervised word sense disambiguation | |
dc.type | Journal Article | |
dspace.entity.type | Publication | |
local.contributor.authorid | 0000-0002-7039-0046 | |
local.contributor.authorid | N/A | |
local.contributor.kuauthor | Yüret, Deniz | |
local.contributor.kuauthor | Yatbaz, Mehmet Ali | |
relation.isOrgUnitOfPublication | 89352e43-bf09-4ef4-82f6-6f9d0174ebae | |
relation.isOrgUnitOfPublication.latestForDiscovery | 89352e43-bf09-4ef4-82f6-6f9d0174ebae |
Files
Original bundle
1 - 1 of 1