Publication: Large language models as a rapid and objective tool for pathology report data extraction
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.kuauthor | Bolat, Beyza | |
dc.contributor.kuauthor | Eren, Özgür Can | |
dc.contributor.kuauthor | Dur Karasayar, Ayşe Hümeyra | |
dc.contributor.kuauthor | Meriçöz, Çisel Aydın | |
dc.contributor.kuauthor | Demir, Çiğdem Gündüz | |
dc.contributor.kuauthor | Kulaç, İbrahim | |
dc.contributor.other | Department of Computer Engineering | |
dc.contributor.researchcenter | Koç Üniversitesi İş Bankası Enfeksiyon Hastalıkları Uygulama ve Araştırma Merkezi (EHAM) / Koç University İşbank Center for Infectious Diseases (KU-IS CID) | |
dc.contributor.researchcenter | Koç Üniversitesi İş Bankası Yapay Zeka Uygulama ve Araştırma Merkezi (KUIS AI)/ Koç University İş Bank Artificial Intelligence Center (KUIS AI) | |
dc.contributor.researchcenter | Koç University Research Center for Translational Medicine (KUTTAM) / Koç Üniversitesi Translasyonel Tıp Araştırma Merkezi (KUTTAM) | |
dc.contributor.schoolcollegeinstitute | School of Medicine | |
dc.contributor.schoolcollegeinstitute | Graduate School of Health Sciences | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.date.accessioned | 2024-12-29T09:39:57Z | |
dc.date.issued | 2024 | |
dc.description.abstract | Medical institutions continuously create a substantial amount of data that is used for scientific research. One of the departments with a great amount of archived data is the pathology department. Pathology archives hold the potential to create a case series of valuable rare entities or large cohorts of common entities. The major problem in creation of these databases is data extraction which is still commonly done manually and is highly laborious and error prone. For these reasons, we offer using large language models to overcome these challenges. Ten pathology reports of selected resection specimens were retrieved from electronic archives of Ko & ccedil; University Hospital for the initial set. These reports were de-identified and uploaded to ChatGPT and Google Bard. Both algorithms were asked to turn the reports in a synoptic report format that is easy to export to a data editor such as Microsoft Excel or Google Sheets. Both programs created tables with Google Bard facilitating the creation of a spreadsheet from the data automatically. In conclusion, we propose the use of AI-assisted data extraction for academic research purposes, as it may enhance efficiency and precision compared to manual data entry. | |
dc.description.indexedby | WoS | |
dc.description.indexedby | Scopus | |
dc.description.indexedby | PubMed | |
dc.description.indexedby | TR Dizin | |
dc.description.issue | 2 | |
dc.description.publisherscope | National | |
dc.description.volume | 40 | |
dc.identifier.doi | 10.5146/tjpath.2024.13256 | |
dc.identifier.eissn | 1309-5730 | |
dc.identifier.issn | 1018-5615 | |
dc.identifier.quartile | Q4 | |
dc.identifier.scopus | 2-s2.0-85193086812 | |
dc.identifier.uri | https://doi.org/10.5146/tjpath.2024.13256 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/23174 | |
dc.identifier.wos | 1229191100008 | |
dc.keywords | Large language models (LLMs) | |
dc.keywords | Pathology | |
dc.keywords | Generative pre-trained transformer-4 (GPT-4) | |
dc.keywords | ChatGPT | |
dc.keywords | Bard | |
dc.language | en | |
dc.publisher | Federation Turkish Pathology Soc. | |
dc.source | Türk Patoloji Dergisi- Turkish Journal of Pathology | |
dc.subject | Pathology | |
dc.title | Large language models as a rapid and objective tool for pathology report data extraction | |
dc.type | Letter | |
dspace.entity.type | Publication | |
local.contributor.kuauthor | Bolat, Beyza | |
local.contributor.kuauthor | Eren, Özgür Can | |
local.contributor.kuauthor | Dur Karasayar, Ayşe Hümeyra | |
local.contributor.kuauthor | Meriçöz, Çisel Aydın | |
local.contributor.kuauthor | Demir, Çiğdem Gündüz | |
local.contributor.kuauthor | Kulaç, İbrahim | |
relation.isOrgUnitOfPublication | 89352e43-bf09-4ef4-82f6-6f9d0174ebae | |
relation.isOrgUnitOfPublication.latestForDiscovery | 89352e43-bf09-4ef4-82f6-6f9d0174ebae |