Publication: KU_ai at MEDIQA 2019: domain-specific pre-training and transfer learning for medical NLI
dc.contributor.department | N/A | |
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.kuauthor | Cengiz, Cemil | |
dc.contributor.kuauthor | Sert, Ulaş | |
dc.contributor.kuauthor | Yüret, Deniz | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.other | Department of Computer Engineering | |
dc.contributor.schoolcollegeinstitute | Graduate School of Sciences and Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.yokid | N/A | |
dc.contributor.yokid | N/A | |
dc.contributor.yokid | 179996 | |
dc.date.accessioned | 2024-11-09T11:50:26Z | |
dc.date.issued | 2019 | |
dc.description.abstract | In this paper, we describe our system and results submitted for the Natural Language Inference (NLI) track of the MEDIQA 2019 Shared Task (Ben Abacha et al., 2019). As KU ai team, we used BERT (Devlin et al., 2018) as our baseline model and pre-processed the MedNLI dataset to mitigate the negative impact of de-identification artifacts. Moreover, we investigated different pre-training and transfer learning approaches to improve the performance. We show that pre-training the language model on rich biomedical corpora has a significant effect in teaching the model domain-specific language. In addition, training the model on large NLI datasets such as MultiNLI and SNLI helps in learning task-specific reasoning. Finally, we ensembled our highest-performing models, and achieved 84.7% accuracy on the unseen test dataset and ranked 10th out of 17 teams in the official results. | |
dc.description.fulltext | YES | |
dc.description.indexedby | WoS | |
dc.description.openaccess | YES | |
dc.description.publisherscope | International | |
dc.description.sponsoredbyTubitakEu | N/A | |
dc.description.sponsorship | Huawei Turkey R&D Center, Huawei Graduate Research Support Scholarship | |
dc.description.version | Publisher version | |
dc.format | ||
dc.identifier.doi | 10.18653/v1/W19-5045 | |
dc.identifier.embargo | NO | |
dc.identifier.filenameinventoryno | IR02193 | |
dc.identifier.isbn | 9781950737284 | |
dc.identifier.link | https://doi.org/10.18653/v1/W19-5045 | |
dc.identifier.quartile | N/A | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/678 | |
dc.identifier.wos | 521946800045 | |
dc.language | English | |
dc.publisher | Association for Computational Linguistics (ACL) | |
dc.relation.grantno | NA | |
dc.relation.uri | http://cdm21054.contentdm.oclc.org/cdm/ref/collection/IR/id/8833 | |
dc.source | Proceedings of the BioNLP 2019 Workshop | |
dc.subject | Computer science, artificial intelligence | |
dc.subject | Medical informatics | |
dc.title | KU_ai at MEDIQA 2019: domain-specific pre-training and transfer learning for medical NLI | |
dc.type | Conference proceeding | |
dspace.entity.type | Publication | |
local.contributor.authorid | N/A | |
local.contributor.authorid | N/A | |
local.contributor.authorid | 0000-0002-7039-0046 | |
local.contributor.kuauthor | Cengiz, Cemil | |
local.contributor.kuauthor | Sert, Ulaş | |
local.contributor.kuauthor | Yüret, Deniz | |
relation.isOrgUnitOfPublication | 89352e43-bf09-4ef4-82f6-6f9d0174ebae | |
relation.isOrgUnitOfPublication.latestForDiscovery | 89352e43-bf09-4ef4-82f6-6f9d0174ebae |
Files
Original bundle
1 - 1 of 1