FASTSUBS: an efficient and exact procedure for finding the most likely lexical substitutes based on an N-gram language model

Publication:
FASTSUBS: an efficient and exact procedure for finding the most likely lexical substitutes based on an N-gram language model

Files

1266.pdf (94.36 KB)

Departments

Organizational Unit

Department of Computer Engineering

School / College / Institute

Organizational Unit

College of Engineering

KU-Authors

Yüret, Deniz

Date

2012

Type

Journal Article

Embargo Status

NO

Abstract

Lexical substitutes have found use in areas such as paraphrasing, text simplification, machine translation, word sense disambiguation, and part of speech induction. However the computational complexity of accurately identifying the most likely substitutes for a word has made large scale experiments difficult. In this letter we introduce a new search algorithm, FASTSUBS, that is guaranteed to find the K most likely lexical substitutes for a given word in a sentence based on an n-gram language model. The computation is sublinear in both K and the vocabulary size V. An implementation of the algorithm and a dataset with the top 100 substitutes of each token in the WSJ section of the Penn Treebank are available at https://goo.gl/jzKH0.

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Subject

Engineering, Electrical and electronic

Source

The IEEE Signal Processing Letters

DOI

10.1109/LSP.2012.2215587

URI

https://hdl.handle.net/20.500.14288/1498

Collections

Publications with Fulltext

Full item page

0

Views

12

Downloads

View PlumX Details

Publication: FASTSUBS: an efficient and exact procedure for finding the most likely lexical substitutes based on an N-gram language model

Files

Departments

School / College / Institute

Program

KU-Authors

KU Authors

Co-Authors

Editor & Affiliation

Compiler & Affiliation

Translator

Other Contributor

Date

Language

Type

Embargo Status

Journal Title

Journal ISSN

Volume Title

Alternative Title

Abstract

Source

Publisher

Subject

Citation

Has Part

Source

Book Series Title

Edition

DOI

URI

item.page.datauri

Link

Rights

Copyrights Note

Collections

Endorsement

Review

Supplemented By

Referenced By

Related Goal

0

Views

12

Downloads

Publication:
FASTSUBS: an efficient and exact procedure for finding the most likely lexical substitutes based on an N-gram language model