Publication: Systematic comparison of GPT models for the analysis of pathology reports in a low-resource language: a case study for Turkish
| dc.contributor.coauthor | Dilbaz, Omer Faruk | |
| dc.contributor.department | School of Medicine | |
| dc.contributor.department | Department of Computer Engineering | |
| dc.contributor.department | KUIS AI (Koç University & İş Bank Artificial Intelligence Center) | |
| dc.contributor.department | KUTTAM (Koç University Research Center for Translational Medicine) | |
| dc.contributor.department | Graduate School of Sciences and Engineering | |
| dc.contributor.kuauthor | Bolat, Beyza | |
| dc.contributor.kuauthor | Demir, Çiğdem Gündüz | |
| dc.contributor.kuauthor | Kulaç, İbrahim | |
| dc.contributor.kuauthor | Özateş, Muhammet Nusret | |
| dc.contributor.schoolcollegeinstitute | SCHOOL OF MEDICINE | |
| dc.contributor.schoolcollegeinstitute | College of Engineering | |
| dc.contributor.schoolcollegeinstitute | GRADUATE SCHOOL OF SCIENCES AND ENGINEERING | |
| dc.date.accessioned | 2025-12-31T08:24:36Z | |
| dc.date.available | 2025-12-31 | |
| dc.date.issued | 2025 | |
| dc.description.abstract | Objective Large language models (LLMs) can process text for various applications, including surgical pathology reports, but studies primarily focus on English. Their performance has not been systematically studied for a low-resource language. To analyze the performance of various LLMs, 759 Turkish pathology reports from 5 different procedures were selected.Methods We used 10 examples from every procedure to optimize prompts for OpenAI's GPT-3.5 Turbo, GPT-4o mini, and GPT-4o. The rest was used to test generalizability.Results The GPT-4o model performed superior in processing Turkish reports (12%-25% over GPT-3.5 Turbo, 3%-16% over GPT-4o mini). English-translated versions of the reports have been demonstrated to enhance accuracy, especially for GPT-3.5 Turbo and GPT-4o mini. GPT4-o showed comparable results for Turkish and English. A 12% to 22% performance gap was observed between GPT-4o and GPT-3.5 Turbo for English-translated reports. Domain-related tips in prompts increased accuracy. Results of larger test sets were parallel for all models with the validation set. The GPT-4o model yielded the most accurate results, while the GPT-4o mini model demonstrated intermediate performance. The GPT-3.5 Turbo model exhibited the least accuracy.Conclusions To our knowledge, for the first time in the literature, we have demonstrated the performance of GPT models in Turkish surgical pathology reports, and results indicate that data extracted by GPT-4o are almost ready for direct application. | |
| dc.description.fulltext | Yes | |
| dc.description.harvestedfrom | Manual | |
| dc.description.indexedby | WOS | |
| dc.description.indexedby | Scopus | |
| dc.description.indexedby | PubMed | |
| dc.description.publisherscope | International | |
| dc.description.readpublish | N/A | |
| dc.description.sponsoredbyTubitakEu | N/A | |
| dc.identifier.doi | 10.1093/ajcp/aqaf091 | |
| dc.identifier.eissn | 1943-7722 | |
| dc.identifier.embargo | No | |
| dc.identifier.issn | 0002-9173 | |
| dc.identifier.pubmed | 40971916 | |
| dc.identifier.quartile | Q3 | |
| dc.identifier.scopus | 2-s2.0-105022413328 | |
| dc.identifier.uri | https://doi.org/10.1093/ajcp/aqaf091 | |
| dc.identifier.uri | https://hdl.handle.net/20.500.14288/31805 | |
| dc.identifier.wos | 001574536000001 | |
| dc.keywords | LLM | |
| dc.keywords | GPT-4o | |
| dc.keywords | Pathology | |
| dc.keywords | Data extraction | |
| dc.language.iso | eng | |
| dc.publisher | OXFORD UNIV PRESS INC | |
| dc.relation.affiliation | Koç University | |
| dc.relation.collection | Koç University Institutional Repository | |
| dc.relation.ispartof | American Journal of Clinical Pathology | |
| dc.relation.openaccess | Yes | |
| dc.rights | CC BY-NC-ND (Attribution-NonCommercial-NoDerivs) | |
| dc.rights.uri | https://creativecommons.org/licenses/by-nc-nd/4.0/ | |
| dc.subject | Pathology | |
| dc.title | Systematic comparison of GPT models for the analysis of pathology reports in a low-resource language: a case study for Turkish | |
| dc.type | Journal Article | |
| dspace.entity.type | Publication | |
| person.familyName | Bolat | |
| person.familyName | Demir | |
| person.familyName | Kulaç | |
| person.familyName | Özateş | |
| person.givenName | Beyza | |
| person.givenName | Çiğdem Gündüz | |
| person.givenName | İbrahim | |
| person.givenName | Muhammet Nusret | |
| relation.isOrgUnitOfPublication | d02929e1-2a70-44f0-ae17-7819f587bedd | |
| relation.isOrgUnitOfPublication | 89352e43-bf09-4ef4-82f6-6f9d0174ebae | |
| relation.isOrgUnitOfPublication | 77d67233-829b-4c3a-a28f-bd97ab5c12c7 | |
| relation.isOrgUnitOfPublication | 91bbe15d-017f-446b-b102-ce755523d939 | |
| relation.isOrgUnitOfPublication | 3fc31c89-e803-4eb1-af6b-6258bc42c3d8 | |
| relation.isOrgUnitOfPublication.latestForDiscovery | d02929e1-2a70-44f0-ae17-7819f587bedd | |
| relation.isParentOrgUnitOfPublication | 17f2dc8e-6e54-4fa8-b5e0-d6415123a93e | |
| relation.isParentOrgUnitOfPublication | 8e756b23-2d4a-4ce8-b1b3-62c794a8c164 | |
| relation.isParentOrgUnitOfPublication | 434c9663-2b11-4e66-9399-c863e2ebae43 | |
| relation.isParentOrgUnitOfPublication.latestForDiscovery | 17f2dc8e-6e54-4fa8-b5e0-d6415123a93e |
