Publication:
AI-KU: using substitute vectors and co-occurrence modeling for word sense induction and disambiguation

dc.conference.dateJUN 14-15, 2013
dc.conference.locationAtlanta, Georgia, ABD
dc.conference.organizerAssociation for Computational Linguistics
dc.contributor.departmentKUIS AI (Koç University & İş Bank Artificial Intelligence Center)
dc.contributor.facultymemberYes
dc.contributor.kuauthorBaşkaya, Osman
dc.contributor.kuauthorCirik, Volkan
dc.contributor.kuauthorYüret, Deniz
dc.contributor.schoolcollegeinstituteResearch Center
dc.date.accessioned2024-11-09T22:49:59Z
dc.date.issued2013
dc.description.abstractWord sense induction aims to discover different senses of a word from a corpus by using unsupervised learning approaches. Once a sense inventory is obtained for an ambiguous word, word sense discrimination approaches choose the best-fitting single sense for a given context from the induced sense inventory. However, there may not be a clear distinction between one sense and another, although for a context, more than one induced sense can be suitable. Graded word sense method allows for labeling a word in more than one sense. In contrast to the most common approach which is to apply clustering or graph partitioning on a representation of first or second order co-occurrences of a word, we propose a system that creates a substitute vector for each target word from the most likely substitutes suggested by a statistical language model. Word samples are then taken according to probabilities of these substitutes and the results of the co-occurrence model are clustered. This approach outperforms the other systems on graded word sense induction task in SemEval-2013.
dc.description.fulltextYes
dc.description.harvestedfromManual
dc.description.indexedbyScopus
dc.description.openaccessGold OA
dc.description.peerreviewstatusN/A
dc.description.publisherscopeInternational
dc.description.readpublishN/A
dc.description.sponsoredbyTubitakEuN/A
dc.description.sponsorshipThe ACL Special Interest Group on Computational Semantics (SIGSEM).The ACL Special Interest Group on the Lexicon (SIGLEX). The US Defense Advanced Research Projects Agency (DARPA)
dc.description.versionPublished Version
dc.identifier.embargoNo
dc.identifier.isbn9781-9372-8449-7
dc.identifier.quartileN/A
dc.identifier.scopus2-s2.0-85040566343
dc.identifier.urihttps://hdl.handle.net/20.500.14288/6571
dc.keywordsComputational linguistics
dc.keywordsNatural language processing systems
dc.keywordsCo-occurrence
dc.keywordsGraph partitioning
dc.keywordsMost likely
dc.keywordsSecond orders
dc.keywordsSense inventories
dc.keywordsStatistical language modeling
dc.keywordsTarget words
dc.keywordsWord sense inductions
dc.keywordsSemantics
dc.language.isoeng
dc.publisherAssociation for Computational Linguistics
dc.relation.affiliationKoç University
dc.relation.collectionKoç University Institutional Repository
dc.relation.ispartofSEM 2013 - 2nd Joint Conference on Lexical and Computational Semantics
dc.relation.openaccessYes
dc.rightsCC BY-NC-SA (Attribution-NonCommercial-ShareAlike)
dc.rights.urihttps://creativecommons.org/licenses/by-nc-sa/4.0/
dc.subjectComputer science
dc.subjectArtificial intelligence
dc.titleAI-KU: using substitute vectors and co-occurrence modeling for word sense induction and disambiguation
dc.typeConference Proceeding
dspace.entity.typePublication
local.contributor.kuauthorBaşkaya, Osman
local.contributor.kuauthorSert, Enis Rıfat
local.contributor.kuauthorCirik, Volkan
local.contributor.kuauthorYüret, Deniz
relation.isOrgUnitOfPublication77d67233-829b-4c3a-a28f-bd97ab5c12c7
relation.isOrgUnitOfPublication.latestForDiscovery77d67233-829b-4c3a-a28f-bd97ab5c12c7
relation.isParentOrgUnitOfPublicationd437580f-9309-4ecb-864a-4af58309d287
relation.isParentOrgUnitOfPublication.latestForDiscoveryd437580f-9309-4ecb-864a-4af58309d287

Files

Original bundle

Now showing 1 - 1 of 1
Thumbnail Image
Name:
IR06859.pdf
Size:
135.69 KB
Format:
Adobe Portable Document Format