Publication: AI-KU: using substitute vectors and co-occurrence modeling for word sense induction and disambiguation
| dc.conference.date | JUN 14-15, 2013 | |
| dc.conference.location | Atlanta, Georgia, ABD | |
| dc.conference.organizer | Association for Computational Linguistics | |
| dc.contributor.department | KUIS AI (Koç University & İş Bank Artificial Intelligence Center) | |
| dc.contributor.facultymember | Yes | |
| dc.contributor.kuauthor | Başkaya, Osman | |
| dc.contributor.kuauthor | Cirik, Volkan | |
| dc.contributor.kuauthor | Yüret, Deniz | |
| dc.contributor.schoolcollegeinstitute | Research Center | |
| dc.date.accessioned | 2024-11-09T22:49:59Z | |
| dc.date.issued | 2013 | |
| dc.description.abstract | Word sense induction aims to discover different senses of a word from a corpus by using unsupervised learning approaches. Once a sense inventory is obtained for an ambiguous word, word sense discrimination approaches choose the best-fitting single sense for a given context from the induced sense inventory. However, there may not be a clear distinction between one sense and another, although for a context, more than one induced sense can be suitable. Graded word sense method allows for labeling a word in more than one sense. In contrast to the most common approach which is to apply clustering or graph partitioning on a representation of first or second order co-occurrences of a word, we propose a system that creates a substitute vector for each target word from the most likely substitutes suggested by a statistical language model. Word samples are then taken according to probabilities of these substitutes and the results of the co-occurrence model are clustered. This approach outperforms the other systems on graded word sense induction task in SemEval-2013. | |
| dc.description.fulltext | Yes | |
| dc.description.harvestedfrom | Manual | |
| dc.description.indexedby | Scopus | |
| dc.description.openaccess | Gold OA | |
| dc.description.peerreviewstatus | N/A | |
| dc.description.publisherscope | International | |
| dc.description.readpublish | N/A | |
| dc.description.sponsoredbyTubitakEu | N/A | |
| dc.description.sponsorship | The ACL Special Interest Group on Computational Semantics (SIGSEM).The ACL Special Interest Group on the Lexicon (SIGLEX). The US Defense Advanced Research Projects Agency (DARPA) | |
| dc.description.version | Published Version | |
| dc.identifier.embargo | No | |
| dc.identifier.isbn | 9781-9372-8449-7 | |
| dc.identifier.quartile | N/A | |
| dc.identifier.scopus | 2-s2.0-85040566343 | |
| dc.identifier.uri | https://hdl.handle.net/20.500.14288/6571 | |
| dc.keywords | Computational linguistics | |
| dc.keywords | Natural language processing systems | |
| dc.keywords | Co-occurrence | |
| dc.keywords | Graph partitioning | |
| dc.keywords | Most likely | |
| dc.keywords | Second orders | |
| dc.keywords | Sense inventories | |
| dc.keywords | Statistical language modeling | |
| dc.keywords | Target words | |
| dc.keywords | Word sense inductions | |
| dc.keywords | Semantics | |
| dc.language.iso | eng | |
| dc.publisher | Association for Computational Linguistics | |
| dc.relation.affiliation | Koç University | |
| dc.relation.collection | Koç University Institutional Repository | |
| dc.relation.ispartof | SEM 2013 - 2nd Joint Conference on Lexical and Computational Semantics | |
| dc.relation.openaccess | Yes | |
| dc.rights | CC BY-NC-SA (Attribution-NonCommercial-ShareAlike) | |
| dc.rights.uri | https://creativecommons.org/licenses/by-nc-sa/4.0/ | |
| dc.subject | Computer science | |
| dc.subject | Artificial intelligence | |
| dc.title | AI-KU: using substitute vectors and co-occurrence modeling for word sense induction and disambiguation | |
| dc.type | Conference Proceeding | |
| dspace.entity.type | Publication | |
| local.contributor.kuauthor | Başkaya, Osman | |
| local.contributor.kuauthor | Sert, Enis Rıfat | |
| local.contributor.kuauthor | Cirik, Volkan | |
| local.contributor.kuauthor | Yüret, Deniz | |
| relation.isOrgUnitOfPublication | 77d67233-829b-4c3a-a28f-bd97ab5c12c7 | |
| relation.isOrgUnitOfPublication.latestForDiscovery | 77d67233-829b-4c3a-a28f-bd97ab5c12c7 | |
| relation.isParentOrgUnitOfPublication | d437580f-9309-4ecb-864a-4af58309d287 | |
| relation.isParentOrgUnitOfPublication.latestForDiscovery | d437580f-9309-4ecb-864a-4af58309d287 |
Files
Original bundle
1 - 1 of 1
