Publication:
Ransac-based training data selection for speaker state recognition

dc.contributor.coauthorErdem, Çiğdem Eroğlu
dc.contributor.coauthorErdem, A. Tanju
dc.contributor.departmentMVGL (Multimedia, Vision and Graphics Laboratory)
dc.contributor.kuauthorBozkurt, Elif
dc.contributor.kuauthorErzin, Engin
dc.contributor.schoolcollegeinstituteLaboratory
dc.date.accessioned2024-11-10T00:00:05Z
dc.date.issued2011
dc.description.abstractWe present a Random Sampling Consensus (RANSAC) based training approach for the problem of speaker state recognition from spontaneous speech. Our system is trained and tested with the INTERSPEECH 2011 Speaker State Challenge corpora that includes the Intoxication and the Sleepiness Sub-challenges, where each sub-challenge defines a two-class classification task. We aim to perform a RANSAC-based training data selection coupled with the Support Vector Machine (SVM) based classification to prune possible outliers, which exist in the training data. Our experimental evaluations indicate that utilization of RANSAC-based training data selection provides 66.32 % and 65.38 % unweighted average (UA) recall rate on the development and test sets for the Sleepiness Sub-challenge, respectively and a slight improvement on the Intoxication Sub-challenge performance.
dc.description.fulltextNo
dc.description.harvestedfromManual
dc.description.indexedbyWOS
dc.description.openaccessNO
dc.description.peerreviewstatusN/A
dc.description.publisherscopeInternational
dc.description.readpublishN/A
dc.description.sponsoredbyTubitakEuN/A
dc.description.versionN/A
dc.identifier.embargoN/A
dc.identifier.isbn9781618392701
dc.identifier.quartileN/A
dc.identifier.scopus2-s2.0-84865741850
dc.identifier.urihttps://hdl.handle.net/20.500.14288/15750
dc.identifier.wos316502201314
dc.keywordsSpeaker state challenge
dc.keywordsIntoxication
dc.keywordsSleepiness
dc.keywordsRansac
dc.language.isoeng
dc.publisherIsca-Int Speech Communication Assoc
dc.relation.affiliationKoç University
dc.relation.collectionKoç University Institutional Repository
dc.relation.ispartof12th Annual Conference of the International Speech Communication Association 2011 (Interspeech 2011), Vols 1-5
dc.relation.openaccessN/A
dc.rightsN/A
dc.subjectComputer science
dc.subjectArtificial intelligence
dc.subjectComputer science
dc.subjectEngineering
dc.subjectElectrical electronic engineering
dc.titleRansac-based training data selection for speaker state recognition
dc.typeConference Proceeding
dspace.entity.typePublication
local.contributor.kuauthorBozkurt, Elif
local.contributor.kuauthorErzin, Engin
relation.isOrgUnitOfPublicationcb6bbbf6-fd19-4052-b581-f591a9748d21
relation.isOrgUnitOfPublication.latestForDiscoverycb6bbbf6-fd19-4052-b581-f591a9748d21
relation.isParentOrgUnitOfPublication20385dee-35e7-484b-8da6-ddcc08271d96
relation.isParentOrgUnitOfPublication.latestForDiscovery20385dee-35e7-484b-8da6-ddcc08271d96

Files