Publication: Learning morphological disambiguation rules for Turkish
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.kuauthor | Türe, Ferhan | |
dc.contributor.kuauthor | Yüret, Deniz | |
dc.contributor.kuprofile | Undergraduate Student | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.other | Department of Computer Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.yokid | N/A | |
dc.contributor.yokid | 179996 | |
dc.date.accessioned | 2024-11-09T23:46:54Z | |
dc.date.issued | 2006 | |
dc.description.abstract | In this paper, we present a rule based model for morphological disambiguation of Turkish. The rules are generated by a novel decision list learning algorithm using supervised training. Morphological ambiguity (e.g. lives = live+s or life+s) is a challenging problem for agglutinative languages like Turkish where close to half of the words in running text are morphologically ambiguous. Furthermore, it is possible for a word to take an unlimited number of suffixes, therefore the number of possible morphological tags is unlimited. We attempted to cope with these problems by training a separate model for each of the 126 morphological features recognized by the morphological analyzer. The resulting decision lists independently vote on each of the potential parses of a word and the final parse is selected based on our confidence on these votes. The accuracy of our model (96%) is slightly above the best previously reported results which use statistical models. For comparison, when we train a single decision list on full tags instead of using separate models on each feature we get 91% accuracy. | |
dc.description.indexedby | Scopus | |
dc.description.openaccess | YES | |
dc.description.publisherscope | International | |
dc.description.sponsoredbyTubitakEu | N/A | |
dc.identifier.doi | 10.3115/1220835.1220877 | |
dc.identifier.link | https://www.scopus.com/inward/record.uri?eid=2-s2.0-84858435058&doi=10.3115%2f1220835.1220877&partnerID=40&md5=39587bf525c9097f0d248365b4c392d3 | |
dc.identifier.quartile | N/A | |
dc.identifier.scopus | 2-s2.0-84858435058 | |
dc.identifier.uri | https://aclanthology.org/N06-1042/ | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/14038 | |
dc.keywords | Computational linguistics | |
dc.keywords | Learning algorithms | |
dc.keywords | Agglutinative language | |
dc.keywords | Decision lists | |
dc.keywords | Morphological analyzer | |
dc.keywords | Morphological disambiguation | |
dc.keywords | Morphological features | |
dc.keywords | Rule-based models | |
dc.keywords | Single decision | |
dc.keywords | Supervised trainings | |
dc.keywords | Text processing | |
dc.language | English | |
dc.publisher | Association for Computational Linguistics (ACL) | |
dc.source | HLT-NAACL 2006 - Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings of the Main Conference | |
dc.subject | Computer engineering | |
dc.title | Learning morphological disambiguation rules for Turkish | |
dc.type | Conference proceeding | |
dspace.entity.type | Publication | |
local.contributor.authorid | N/A | |
local.contributor.authorid | 0000-0002-7039-0046 | |
local.contributor.kuauthor | Türe, Ferhan | |
local.contributor.kuauthor | Yüret, Deniz | |
relation.isOrgUnitOfPublication | 89352e43-bf09-4ef4-82f6-6f9d0174ebae | |
relation.isOrgUnitOfPublication.latestForDiscovery | 89352e43-bf09-4ef4-82f6-6f9d0174ebae |