Publication: Unsupervised part of speech tagging using unambiguous substitutes from a statistical language model
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.department | Graduate School of Sciences and Engineering | |
dc.contributor.kuauthor | Yatbaz, Mehmet Ali | |
dc.contributor.kuauthor | Yüret, Deniz | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.schoolcollegeinstitute | GRADUATE SCHOOL OF SCIENCES AND ENGINEERING | |
dc.date.accessioned | 2024-11-09T22:51:07Z | |
dc.date.issued | 2010 | |
dc.description.abstract | We show that unsupervised part of speech tagging performance can be significantly improved using likely substitutes for target words given by a statistical language model. We choose unambiguous substitutes for each occurrence of an ambiguous target word based on its context. The part of speech tags for the unambiguous substitutes are then used to filter the entry for the target word in the word-tag dictionary. A standard HMM model trained using the filtered dictionary achieves 92.25% accuracy on a standard 24,000 word corpus. | |
dc.description.indexedby | Scopus | |
dc.description.openaccess | YES | |
dc.description.publisherscope | International | |
dc.description.sponsoredbyTubitakEu | N/A | |
dc.description.sponsorship | National Natural Science Foundation of China | |
dc.description.sponsorship | Dep. Lang. Inf. Adm., Minist. Educ. | |
dc.description.sponsorship | BaiDu | |
dc.description.sponsorship | ||
dc.description.sponsorship | Fujitsu R and D Center CO., LTD. | |
dc.description.volume | 2 | |
dc.identifier.link | https://www.scopus.com/inward/record.uri?eid=2-s2.0-80053400067andpartnerID=40andmd5=9d00490aa573a3ff0f42e50375e15a0e | |
dc.identifier.quartile | N/A | |
dc.identifier.scopus | 2-s2.0-80053400067 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/6771 | |
dc.keywords | HMM models | |
dc.keywords | Part of speech tagging | |
dc.keywords | Part-of-speech tags | |
dc.keywords | Statistical language models | |
dc.keywords | Computational linguistics | |
dc.language.iso | eng | |
dc.publisher | COLING | |
dc.relation.ispartof | Coling 2010 - 23rd International Conference on Computational Linguistics, Proceedings of the Conference | |
dc.subject | Computer engineering | |
dc.title | Unsupervised part of speech tagging using unambiguous substitutes from a statistical language model | |
dc.type | Conference Proceeding | |
dspace.entity.type | Publication | |
local.contributor.kuauthor | Yüret, Deniz | |
local.contributor.kuauthor | Yatbaz, Mehmet Ali | |
local.publication.orgunit1 | College of Engineering | |
local.publication.orgunit1 | GRADUATE SCHOOL OF SCIENCES AND ENGINEERING | |
local.publication.orgunit2 | Department of Computer Engineering | |
local.publication.orgunit2 | Graduate School of Sciences and Engineering | |
relation.isOrgUnitOfPublication | 89352e43-bf09-4ef4-82f6-6f9d0174ebae | |
relation.isOrgUnitOfPublication | 3fc31c89-e803-4eb1-af6b-6258bc42c3d8 | |
relation.isOrgUnitOfPublication.latestForDiscovery | 89352e43-bf09-4ef4-82f6-6f9d0174ebae | |
relation.isParentOrgUnitOfPublication | 8e756b23-2d4a-4ce8-b1b3-62c794a8c164 | |
relation.isParentOrgUnitOfPublication | 434c9663-2b11-4e66-9399-c863e2ebae43 | |
relation.isParentOrgUnitOfPublication.latestForDiscovery | 8e756b23-2d4a-4ce8-b1b3-62c794a8c164 |