Publication: Al-based Parkinson's disease diagnosis with word level speech data using advanced signal processing techniques
| dc.conference.date | SEP 17-19, 2025 | |
| dc.conference.location | Poznan | |
| dc.contributor.coauthor | Hanci, Nur Banu | |
| dc.contributor.coauthor | Erdem, Og̃uzhan | |
| dc.contributor.coauthor | Ulukaya, Sezer | |
| dc.contributor.coauthor | Güler, Sibel | |
| dc.contributor.department | School of Medicine | |
| dc.contributor.kuauthor | Uzun, Cem | |
| dc.contributor.schoolcollegeinstitute | SCHOOL OF MEDICINE | |
| dc.date.accessioned | 2025-12-31T08:18:57Z | |
| dc.date.available | 2025-12-31 | |
| dc.date.issued | 2025 | |
| dc.description.abstract | Parkinson's Disease (PD) is a progressive neurodegenerative disorder that profoundly compromises patients' quality of life. Early and accurate diagnosis remains a clinical challenge, with voice signal analysis emerging as a promising non-invasive biomarker. Unlike the classical vowel sounds used in existing studies, we present a new PD sound dataset that includes a Turkish word "gofret"vocalized by the PD and control groups. The collected dataset was converted into images using Mel - Frequency Cepstral Coefficients (MFCCs), spectrograms, chromograms and tempograms in order to develop vision-based deep learning models from audio recordings. To mitigate data scarcity and enhance model generalizability, a suite of data augmentation strategies including frequency masking, time stretching, shifting and masking were systematically applied. In this study, alternative deep learnnig architectures were developed and their performances were compared. Quantitative evaluations, employing rigorous cross-validation protocols, demonstrated superior classification performance of spectrogram-based models with 85.6% accuracy in PD diagnosis, underscoring their robustness in capturing pathological vocal characteristics. The findings advocate for the integration of advanced augmentation techniques and multifaceted acoustic representations to bolster automated PD detection efficacy from voice data. | |
| dc.description.fulltext | Yes | |
| dc.description.harvestedfrom | Manual | |
| dc.description.indexedby | Scopus | |
| dc.description.publisherscope | International | |
| dc.description.readpublish | N/A | |
| dc.description.sponsoredbyTubitakEu | N/A | |
| dc.description.sponsorship | Trakya Üniversitesi, (2021/90); Trakya Üniversitesi | |
| dc.identifier.doi | 10.23919/SPA65537.2025.11215100 | |
| dc.identifier.embargo | No | |
| dc.identifier.endpage | 181 | |
| dc.identifier.isbn | 9788362065516 | |
| dc.identifier.issn | 2326-0262 | |
| dc.identifier.quartile | N/A | |
| dc.identifier.scopus | 2-s2.0-105022409431 | |
| dc.identifier.startpage | 176 | |
| dc.identifier.uri | https://doi.org/10.23919/SPA65537.2025.11215100 | |
| dc.identifier.uri | https://hdl.handle.net/20.500.14288/31419 | |
| dc.keywords | Audio processing | |
| dc.keywords | Deep neural networks | |
| dc.keywords | Neurological disorder | |
| dc.keywords | Parkinson disease | |
| dc.keywords | Voice analysis | |
| dc.language.iso | eng | |
| dc.publisher | IEEE Computer Society | |
| dc.relation.affiliation | Koç University | |
| dc.relation.collection | Koç University Institutional Repository | |
| dc.relation.ispartof | Signal Processing - Algorithms, Architectures, Arrangements, and Applications Conference Proceedings, SPA | |
| dc.relation.openaccess | Yes | |
| dc.rights | CC BY-NC-ND (Attribution-NonCommercial-NoDerivs) | |
| dc.rights.uri | https://creativecommons.org/licenses/by-nc-nd/4.0/ | |
| dc.subject | Deep neural networks | |
| dc.title | Al-based Parkinson's disease diagnosis with word level speech data using advanced signal processing techniques | |
| dc.type | Conference Proceeding | |
| dspace.entity.type | Publication | |
| person.familyName | Uzun | |
| person.givenName | Cem | |
| relation.isOrgUnitOfPublication | d02929e1-2a70-44f0-ae17-7819f587bedd | |
| relation.isOrgUnitOfPublication.latestForDiscovery | d02929e1-2a70-44f0-ae17-7819f587bedd | |
| relation.isParentOrgUnitOfPublication | 17f2dc8e-6e54-4fa8-b5e0-d6415123a93e | |
| relation.isParentOrgUnitOfPublication.latestForDiscovery | 17f2dc8e-6e54-4fa8-b5e0-d6415123a93e |
