KU ai at MEDIQA 2019: domain-specific pre-training and transfer learning for medical NLI

Publication:
KU ai at MEDIQA 2019: domain-specific pre-training and transfer learning for medical NLI

dc.contributor.department	Department of Computer Engineering
dc.contributor.department	Graduate School of Sciences and Engineering
dc.contributor.kuauthor	Cengiz, Cemil
dc.contributor.kuauthor	Sert, Ulaş
dc.contributor.kuauthor	Yüret, Deniz
dc.contributor.schoolcollegeinstitute	College of Engineering
dc.contributor.schoolcollegeinstitute	GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
dc.date.accessioned	2024-11-09T23:39:29Z
dc.date.issued	2019
dc.description.abstract	In this paper, we describe our system and results submitted for the Natural Language Inference (NLI) track of the MEDIQA 2019 Shared Task (Ben Abacha et al., 2019). As KU ai team, we used BERT (Devlin et al., 2018) as our baseline model and pre-processed the MedNLI dataset to mitigate the negative impact of de-identification artifacts. Moreover, we investigated different pre-training and transfer learning approaches to improve the performance. We show that pre-training the language model on rich biomedical corpora has a significant effect in teaching the model domain-specific language. In addition, training the model on large NLI datasets such as MultiNLI and SNLI helps in learning task-specific reasoning. Finally, we ensembled our highest-performing models, and achieved 84.7% accuracy on the unseen test dataset and ranked 10th out of 17 teams in the official results.
dc.description.indexedby	WOS
dc.description.indexedby	Scopus
dc.description.openaccess	YES
dc.description.publisherscope	International
dc.description.sponsoredbyTubitakEu	N/A
dc.identifier.isbn	9781-9507-3728-4
dc.identifier.link	https://www.scopus.com/inward/record.uri?eid=2-s2.0-85107900736&partnerID=40&md5=8dcdc859764782655c63af36da23bf04
dc.identifier.scopus	2-s2.0-85107900736
dc.identifier.uri	https://hdl.handle.net/20.500.14288/13128
dc.keywords	Computational linguistics
dc.keywords	Large dataset
dc.keywords	Learning systems
dc.keywords	Natural language processing systems
dc.keywords	Statistical tests
dc.keywords	Baseline models
dc.keywords	De-identification
dc.keywords	Domain specific
dc.keywords	Language inference
dc.keywords	Language model
dc.keywords	Learning approach
dc.keywords	Natural languages
dc.keywords	Performance
dc.keywords	Pre-training
dc.keywords	Transfer learning
dc.keywords	Problem oriented languages
dc.language.iso	eng
dc.publisher	Association for Computational Linguistics (ACL)
dc.relation.ispartof	BioNLP 2019 - SIGBioMed Workshop on Biomedical Natural Language Processing, Proceedings of the 18th BioNLP Workshop and Shared Task
dc.subject	Computer Science
dc.title	KU ai at MEDIQA 2019: domain-specific pre-training and transfer learning for medical NLI
dc.type	Conference Proceeding
dspace.entity.type	Publication
local.contributor.kuauthor	Yüret, Deniz
local.contributor.kuauthor	Sert, Ulaş
local.contributor.kuauthor	Cengiz, Cemil
local.publication.orgunit1	College of Engineering
local.publication.orgunit1	GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
local.publication.orgunit2	Department of Computer Engineering
local.publication.orgunit2	Graduate School of Sciences and Engineering
relation.isOrgUnitOfPublication	89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication	3fc31c89-e803-4eb1-af6b-6258bc42c3d8
relation.isOrgUnitOfPublication.latestForDiscovery	89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isParentOrgUnitOfPublication	8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication	434c9663-2b11-4e66-9399-c863e2ebae43
relation.isParentOrgUnitOfPublication.latestForDiscovery	8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Collections

Publications without Fulltext

Publication: KU ai at MEDIQA 2019: domain-specific pre-training and transfer learning for medical NLI

Files

Collections

Publication:
KU ai at MEDIQA 2019: domain-specific pre-training and transfer learning for medical NLI