Modeling morphologically rich languages using splitwords and unstructured dependencies

Publication:
Modeling morphologically rich languages using splitwords and unstructured dependencies

dc.contributor.department	Department of Computer Engineering
dc.contributor.department	Graduate School of Sciences and Engineering
dc.contributor.facultymember	Yes
dc.contributor.kuauthor	Biçici, Ergün
dc.contributor.kuauthor	Yüret, Deniz
dc.contributor.schoolcollegeinstitute	College of Engineering
dc.contributor.schoolcollegeinstitute	GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
dc.date.accessioned	2024-11-09T23:50:51Z
dc.date.issued	2009
dc.description.abstract	We experiment with splitting words into their stem and suffix components for modeling morphologically rich languages. We show that using a morphological analyzer and disambiguator results in a significant perplexity reduction in Turkish. We present flexible n-gram models, Flex-Grams, which assume that the n-1 tokens that determine the probability of a given token can be chosen anywhere in the sentence rather than the preceding n-1 positions. Our final model achieves 27% perplexity reduction compared to the standard n-gram model.
dc.description.fulltext	No
dc.description.harvestedfrom	Manual
dc.description.indexedby	Scopus
dc.description.openaccess	YES
dc.description.peerreviewstatus	N/A
dc.description.publisherscope	International
dc.description.readpublish	N/A
dc.description.sponsoredbyTubitakEu	N/A
dc.description.sponsorship	Asian Federation of Natural Language Processing (AFNLP)
dc.description.sponsorship	Association for Computational Linguistics (ACL)
dc.description.studentonlypublication	No
dc.description.studentpublication	Yes
dc.description.version	N/A
dc.identifier.WoSQuartile	Bakılacak
dc.identifier.doi	10.3115/1667583.1667690
dc.identifier.embargo	N/A
dc.identifier.isbn	9781-6173-8258-1
dc.identifier.scopus	2-s2.0-84859062288
dc.identifier.uri	https://doi.org/10.3115/1667583.1667690
dc.identifier.uri	https://hdl.handle.net/20.500.14288/14609
dc.keywords	Computational linguistics
dc.keywords	Natural language processing systems
dc.keywords	Text processing
dc.keywords	Morphological analyzer
dc.keywords	N-gram modeling
dc.keywords	N-gram models
dc.keywords	Turkishs
dc.keywords	% reductions
dc.keywords	Splittings
dc.keywords	Modeling languages
dc.language.iso	eng
dc.publisher	Association for Computational Linguistics (ACL)
dc.relation.affiliation	Koç University
dc.relation.collection	Koç University Institutional Repository
dc.relation.ispartof	ACL-IJCNLP 2009 - Joint Conf. of the 47th Annual Meeting of the Association for Computational Linguistics and 4th Int. Joint Conf. on Natural Language Processing of the AFNLP, Proceedings of the Conf.
dc.relation.openaccess	N/A
dc.rights	N/A
dc.subject	Computer engineering
dc.title	Modeling morphologically rich languages using splitwords and unstructured dependencies
dc.type	Conference Proceeding
dspace.entity.type	Publication
local.contributor.kuauthor	Yüret, Deniz
local.contributor.kuauthor	Biçici, Ergün
relation.isOrgUnitOfPublication	89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication	3fc31c89-e803-4eb1-af6b-6258bc42c3d8
relation.isOrgUnitOfPublication.latestForDiscovery	89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isParentOrgUnitOfPublication	8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication	434c9663-2b11-4e66-9399-c863e2ebae43
relation.isParentOrgUnitOfPublication.latestForDiscovery	8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Collections

Publications without Fulltext

Publication: Modeling morphologically rich languages using splitwords and unstructured dependencies

Files

Collections

Publication:
Modeling morphologically rich languages using splitwords and unstructured dependencies