Publication:
Large language models as a rapid and objective tool for pathology report data extraction

dc.contributor.departmentDepartment of Computer Engineering
dc.contributor.kuauthorBolat, Beyza
dc.contributor.kuauthorEren, Özgür Can
dc.contributor.kuauthorDur Karasayar, Ayşe Hümeyra
dc.contributor.kuauthorMeriçöz, Çisel Aydın
dc.contributor.kuauthorDemir, Çiğdem Gündüz
dc.contributor.kuauthorKulaç, İbrahim
dc.contributor.otherDepartment of Computer Engineering
dc.contributor.researchcenterKoç Üniversitesi İş Bankası Enfeksiyon Hastalıkları Uygulama ve Araştırma Merkezi (EHAM) / Koç University İşbank Center for Infectious Diseases (KU-IS CID)
dc.contributor.researchcenterKoç Üniversitesi İş Bankası Yapay Zeka Uygulama ve Araştırma Merkezi (KUIS AI)/ Koç University İş Bank Artificial Intelligence Center (KUIS AI)
dc.contributor.researchcenterKoç University Research Center for Translational Medicine (KUTTAM) / Koç Üniversitesi Translasyonel Tıp Araştırma Merkezi (KUTTAM)
dc.contributor.schoolcollegeinstituteSchool of Medicine
dc.contributor.schoolcollegeinstituteGraduate School of Health Sciences
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.date.accessioned2024-12-29T09:39:57Z
dc.date.issued2024
dc.description.abstractMedical institutions continuously create a substantial amount of data that is used for scientific research. One of the departments with a great amount of archived data is the pathology department. Pathology archives hold the potential to create a case series of valuable rare entities or large cohorts of common entities. The major problem in creation of these databases is data extraction which is still commonly done manually and is highly laborious and error prone. For these reasons, we offer using large language models to overcome these challenges. Ten pathology reports of selected resection specimens were retrieved from electronic archives of Ko & ccedil; University Hospital for the initial set. These reports were de-identified and uploaded to ChatGPT and Google Bard. Both algorithms were asked to turn the reports in a synoptic report format that is easy to export to a data editor such as Microsoft Excel or Google Sheets. Both programs created tables with Google Bard facilitating the creation of a spreadsheet from the data automatically. In conclusion, we propose the use of AI-assisted data extraction for academic research purposes, as it may enhance efficiency and precision compared to manual data entry.
dc.description.indexedbyWoS
dc.description.indexedbyScopus
dc.description.indexedbyPubMed
dc.description.indexedbyTR Dizin
dc.description.issue2
dc.description.publisherscopeNational
dc.description.volume40
dc.identifier.doi10.5146/tjpath.2024.13256
dc.identifier.eissn1309-5730
dc.identifier.issn1018-5615
dc.identifier.quartileQ4
dc.identifier.scopus2-s2.0-85193086812
dc.identifier.urihttps://doi.org/10.5146/tjpath.2024.13256
dc.identifier.urihttps://hdl.handle.net/20.500.14288/23174
dc.identifier.wos1229191100008
dc.keywordsLarge language models (LLMs)
dc.keywordsPathology
dc.keywordsGenerative pre-trained transformer-4 (GPT-4)
dc.keywordsChatGPT
dc.keywordsBard
dc.languageen
dc.publisherFederation Turkish Pathology Soc.
dc.sourceTürk Patoloji Dergisi- Turkish Journal of Pathology
dc.subjectPathology
dc.titleLarge language models as a rapid and objective tool for pathology report data extraction
dc.typeLetter
dspace.entity.typePublication
local.contributor.kuauthorBolat, Beyza
local.contributor.kuauthorEren, Özgür Can
local.contributor.kuauthorDur Karasayar, Ayşe Hümeyra
local.contributor.kuauthorMeriçöz, Çisel Aydın
local.contributor.kuauthorDemir, Çiğdem Gündüz
local.contributor.kuauthorKulaç, İbrahim
relation.isOrgUnitOfPublication89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication.latestForDiscovery89352e43-bf09-4ef4-82f6-6f9d0174ebae

Files