Publication:
Al-based Parkinson's disease diagnosis with word level speech data using advanced signal processing techniques

dc.conference.dateSEP 17-19, 2025
dc.conference.locationPoznan
dc.contributor.coauthorHanci, Nur Banu
dc.contributor.coauthorErdem, Og̃uzhan
dc.contributor.coauthorUlukaya, Sezer
dc.contributor.coauthorGüler, Sibel
dc.contributor.departmentSchool of Medicine
dc.contributor.kuauthorUzun, Cem
dc.contributor.schoolcollegeinstituteSCHOOL OF MEDICINE
dc.date.accessioned2025-12-31T08:18:57Z
dc.date.available2025-12-31
dc.date.issued2025
dc.description.abstractParkinson's Disease (PD) is a progressive neurodegenerative disorder that profoundly compromises patients' quality of life. Early and accurate diagnosis remains a clinical challenge, with voice signal analysis emerging as a promising non-invasive biomarker. Unlike the classical vowel sounds used in existing studies, we present a new PD sound dataset that includes a Turkish word "gofret"vocalized by the PD and control groups. The collected dataset was converted into images using Mel - Frequency Cepstral Coefficients (MFCCs), spectrograms, chromograms and tempograms in order to develop vision-based deep learning models from audio recordings. To mitigate data scarcity and enhance model generalizability, a suite of data augmentation strategies including frequency masking, time stretching, shifting and masking were systematically applied. In this study, alternative deep learnnig architectures were developed and their performances were compared. Quantitative evaluations, employing rigorous cross-validation protocols, demonstrated superior classification performance of spectrogram-based models with 85.6% accuracy in PD diagnosis, underscoring their robustness in capturing pathological vocal characteristics. The findings advocate for the integration of advanced augmentation techniques and multifaceted acoustic representations to bolster automated PD detection efficacy from voice data.
dc.description.fulltextYes
dc.description.harvestedfromManual
dc.description.indexedbyScopus
dc.description.publisherscopeInternational
dc.description.readpublishN/A
dc.description.sponsoredbyTubitakEuN/A
dc.description.sponsorshipTrakya Üniversitesi, (2021/90); Trakya Üniversitesi
dc.identifier.doi10.23919/SPA65537.2025.11215100
dc.identifier.embargoNo
dc.identifier.endpage181
dc.identifier.isbn9788362065516
dc.identifier.issn2326-0262
dc.identifier.quartileN/A
dc.identifier.scopus2-s2.0-105022409431
dc.identifier.startpage176
dc.identifier.urihttps://doi.org/10.23919/SPA65537.2025.11215100
dc.identifier.urihttps://hdl.handle.net/20.500.14288/31419
dc.keywordsAudio processing
dc.keywordsDeep neural networks
dc.keywordsNeurological disorder
dc.keywordsParkinson disease
dc.keywordsVoice analysis
dc.language.isoeng
dc.publisherIEEE Computer Society
dc.relation.affiliationKoç University
dc.relation.collectionKoç University Institutional Repository
dc.relation.ispartofSignal Processing - Algorithms, Architectures, Arrangements, and Applications Conference Proceedings, SPA
dc.relation.openaccessYes
dc.rightsCC BY-NC-ND (Attribution-NonCommercial-NoDerivs)
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subjectDeep neural networks
dc.titleAl-based Parkinson's disease diagnosis with word level speech data using advanced signal processing techniques
dc.typeConference Proceeding
dspace.entity.typePublication
person.familyNameUzun
person.givenNameCem
relation.isOrgUnitOfPublicationd02929e1-2a70-44f0-ae17-7819f587bedd
relation.isOrgUnitOfPublication.latestForDiscoveryd02929e1-2a70-44f0-ae17-7819f587bedd
relation.isParentOrgUnitOfPublication17f2dc8e-6e54-4fa8-b5e0-d6415123a93e
relation.isParentOrgUnitOfPublication.latestForDiscovery17f2dc8e-6e54-4fa8-b5e0-d6415123a93e

Files