Publication: Transfer learning for low-resource neural machine translation
dc.contributor.coauthor | Zoph, Barret | |
dc.contributor.coauthor | May, Jonathan | |
dc.contributor.coauthor | Knight, Kevin | |
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.kuauthor | Yüret, Deniz | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.other | Department of Computer Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.yokid | 179996 | |
dc.date.accessioned | 2024-11-09T12:25:05Z | |
dc.date.issued | 2016 | |
dc.description.abstract | The encoder-decoder framework for neural machine translation (NMT) has been shown effective in large data scenarios, but is much less effective for low-resource languages. We present a transfer learning method that significantly improves BLEU scores across a range of low-resource languages. Our key idea is to first train a high-resource language pair (the parent model), then transfer some of the learned parameters to the low-resource pair (the child model) to initialize and constrain training. Using our transfer learning method we improve baseline NMT models by an average of 5.6 BLEU on four low-resource language pairs. Ensembling and unknown word replacement add another 2 BLEU which brings the NMT performance on low-resource machine translation close to a strong syntax based machine translation (SBMT) system, exceeding its performance on one language pair. Additionally, using the transfer learning model for re-scoring, we can improve the SBMT system by an average of 1.3 BLEU, improving the state-of-the-art on low-resource machine translation. | |
dc.description.fulltext | YES | |
dc.description.indexedby | Scopus | |
dc.description.openaccess | YES | |
dc.description.publisherscope | International | |
dc.description.sponsoredbyTubitakEu | TÜBİTAK | |
dc.description.sponsorship | ARL/ARO | |
dc.description.sponsorship | DARPA | |
dc.description.sponsorship | Scientific and Technological Research Council of Turkey (TÜBİTAK) | |
dc.description.version | Publisher version | |
dc.format | ||
dc.identifier.doi | 10.18653/v1/D16-1163 | |
dc.identifier.embargo | NO | |
dc.identifier.filenameinventoryno | IR01959 | |
dc.identifier.isbn | 9781945626258 | |
dc.identifier.link | https://doi.org/10.18653/v1/D16-1163 | |
dc.identifier.quartile | N/A | |
dc.identifier.scopus | 2-s2.0-85072838695 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/1538 | |
dc.keywords | Computational linguistics | |
dc.keywords | Computer aided language translation | |
dc.keywords | Natural language processing systems | |
dc.language | English | |
dc.publisher | Association for Computational Linguistics (ACL) | |
dc.relation.grantno | W911NF-10-1-0533 | |
dc.relation.grantno | HR0011-15-C-0115 | |
dc.relation.grantno | 114E628 and 215E201 | |
dc.relation.uri | http://cdm21054.contentdm.oclc.org/cdm/ref/collection/IR/id/8511 | |
dc.source | Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing | |
dc.subject | Computer engineering | |
dc.subject | Learning systems | |
dc.title | Transfer learning for low-resource neural machine translation | |
dc.type | Conference proceeding | |
dspace.entity.type | Publication | |
local.contributor.authorid | 0000-0002-7039-0046 | |
local.contributor.kuauthor | Yüret, Deniz | |
relation.isOrgUnitOfPublication | 89352e43-bf09-4ef4-82f6-6f9d0174ebae | |
relation.isOrgUnitOfPublication.latestForDiscovery | 89352e43-bf09-4ef4-82f6-6f9d0174ebae |
Files
Original bundle
1 - 1 of 1