Publication:
Learning morphological disambiguation rules for Turkish

dc.contributor.departmentDepartment of Computer Engineering
dc.contributor.kuauthorTüre, Ferhan
dc.contributor.kuauthorYüret, Deniz
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.date.accessioned2024-11-09T23:46:54Z
dc.date.issued2006
dc.description.abstractIn this paper, we present a rule based model for morphological disambiguation of Turkish. The rules are generated by a novel decision list learning algorithm using supervised training. Morphological ambiguity (e.g. lives = live+s or life+s) is a challenging problem for agglutinative languages like Turkish where close to half of the words in running text are morphologically ambiguous. Furthermore, it is possible for a word to take an unlimited number of suffixes, therefore the number of possible morphological tags is unlimited. We attempted to cope with these problems by training a separate model for each of the 126 morphological features recognized by the morphological analyzer. The resulting decision lists independently vote on each of the potential parses of a word and the final parse is selected based on our confidence on these votes. The accuracy of our model (96%) is slightly above the best previously reported results which use statistical models. For comparison, when we train a single decision list on full tags instead of using separate models on each feature we get 91% accuracy.
dc.description.fulltextNo
dc.description.harvestedfromManual
dc.description.indexedbyScopus
dc.description.openaccessYES
dc.description.peerreviewstatusN/A
dc.description.publisherscopeInternational
dc.description.readpublishN/A
dc.description.sponsoredbyTubitakEuN/A
dc.description.versionN/A
dc.identifier.doi10.3115/1220835.1220877
dc.identifier.embargoN/A
dc.identifier.quartileBakılacak
dc.identifier.scopus2-s2.0-84858435058
dc.identifier.urihttps://www.scopus.com/inward/record.uri?eid=2-s2.0-84858435058&doi=10.3115%2f1220835.1220877&partnerID=40&md5=39587bf525c9097f0d248365b4c392d3
dc.identifier.urihttps://aclanthology.org/N06-1042/
dc.identifier.urihttps://hdl.handle.net/20.500.14288/14038
dc.keywordsComputational linguistics
dc.keywordsLearning algorithms
dc.keywordsAgglutinative language
dc.keywordsDecision lists
dc.keywordsMorphological analyzer
dc.keywordsMorphological disambiguation
dc.keywordsMorphological features
dc.keywordsRule-based models
dc.keywordsSingle decision
dc.keywordsSupervised trainings
dc.keywordsText processing
dc.language.isoeng
dc.publisherAssociation for Computational Linguistics (ACL)
dc.relation.affiliationKoç University
dc.relation.collectionKoç University Institutional Repository
dc.relation.ispartofHLT-NAACL 2006 - Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings of the Main Conference
dc.relation.openaccessN/A
dc.rightsN/A
dc.subjectComputer engineering
dc.titleLearning morphological disambiguation rules for Turkish
dc.typeConference Proceeding
dspace.entity.typePublication
local.contributor.kuauthorTüre, Ferhan
local.contributor.kuauthorYüret, Deniz
relation.isOrgUnitOfPublication89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication.latestForDiscovery89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isParentOrgUnitOfPublication8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication.latestForDiscovery8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Files