KU_ai at MEDIQA 2019: domain-specific pre-training and transfer learning for medical NLI

Publication:
KU_ai at MEDIQA 2019: domain-specific pre-training and transfer learning for medical NLI

dc.contributor.department	Department of Computer Engineering
dc.contributor.department	Graduate School of Sciences and Engineering
dc.contributor.kuauthor	Cengiz, Cemil
dc.contributor.kuauthor	Sert, Ulaş
dc.contributor.kuauthor	Yüret, Deniz
dc.contributor.schoolcollegeinstitute	College of Engineering
dc.contributor.schoolcollegeinstitute	GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
dc.date.accessioned	2024-11-09T11:50:26Z
dc.date.issued	2019
dc.description.abstract	In this paper, we describe our system and results submitted for the Natural Language Inference (NLI) track of the MEDIQA 2019 Shared Task (Ben Abacha et al., 2019). As KU ai team, we used BERT (Devlin et al., 2018) as our baseline model and pre-processed the MedNLI dataset to mitigate the negative impact of de-identification artifacts. Moreover, we investigated different pre-training and transfer learning approaches to improve the performance. We show that pre-training the language model on rich biomedical corpora has a significant effect in teaching the model domain-specific language. In addition, training the model on large NLI datasets such as MultiNLI and SNLI helps in learning task-specific reasoning. Finally, we ensembled our highest-performing models, and achieved 84.7% accuracy on the unseen test dataset and ranked 10th out of 17 teams in the official results.
dc.description.fulltext	YES
dc.description.indexedby	WOS
dc.description.openaccess	YES
dc.description.publisherscope	International
dc.description.sponsoredbyTubitakEu	N/A
dc.description.sponsorship	Huawei Turkey R&D Center, Huawei Graduate Research Support Scholarship
dc.description.version	Publisher version
dc.identifier.doi	10.18653/v1/W19-5045
dc.identifier.embargo	NO
dc.identifier.filenameinventoryno	IR02193
dc.identifier.isbn	9781950737284
dc.identifier.quartile	N/A
dc.identifier.uri	https://doi.org/10.18653/v1/W19-5045
dc.identifier.wos	521946800045
dc.language.iso	eng
dc.publisher	Association for Computational Linguistics (ACL)
dc.relation.grantno	NA
dc.relation.ispartof	Proceedings of the BioNLP 2019 Workshop
dc.relation.uri	http://cdm21054.contentdm.oclc.org/cdm/ref/collection/IR/id/8833
dc.subject	Computer science, artificial intelligence
dc.subject	Medical informatics
dc.title	KU_ai at MEDIQA 2019: domain-specific pre-training and transfer learning for medical NLI
dc.type	Conference Proceeding
dspace.entity.type	Publication
local.contributor.kuauthor	Cengiz, Cemil
local.contributor.kuauthor	Sert, Ulaş
local.contributor.kuauthor	Yüret, Deniz
local.publication.orgunit1	GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
local.publication.orgunit1	College of Engineering
local.publication.orgunit2	Department of Computer Engineering
local.publication.orgunit2	Graduate School of Sciences and Engineering
relation.isOrgUnitOfPublication	89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication	3fc31c89-e803-4eb1-af6b-6258bc42c3d8
relation.isOrgUnitOfPublication.latestForDiscovery	89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isParentOrgUnitOfPublication	8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication	434c9663-2b11-4e66-9399-c863e2ebae43
relation.isParentOrgUnitOfPublication.latestForDiscovery	8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 8833.pdf
Size:: 365.12 KB
Format:: Adobe Portable Document Format

Download

Collections

Publications with Fulltext

Publication: KU_ai at MEDIQA 2019: domain-specific pre-training and transfer learning for medical NLI

Files

Original bundle

Collections

Publication:
KU_ai at MEDIQA 2019: domain-specific pre-training and transfer learning for medical NLI