Publication:
Ransac-based training data selection on spectral features for emotion recognition from spontaneous speech

dc.contributor.coauthorErdem, Çiǧdem Eroǧlu
dc.contributor.coauthorErdem, A. Tanju
dc.contributor.departmentDepartment of Computer Engineering
dc.contributor.departmentGraduate School of Sciences and Engineering
dc.contributor.kuauthorBozkurt, Elif
dc.contributor.kuauthorErzin, Engin
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.contributor.schoolcollegeinstituteGRADUATE SCHOOL OF SCIENCES AND ENGINEERING
dc.date.accessioned2024-11-09T23:21:37Z
dc.date.issued2011
dc.description.abstractTraining datasets containing spontaneous emotional speech are often imperfect due the ambiguities and difficulties of labeling such data by human observers. In this paper, we present a Random Sampling Consensus (RANSAC) based training approach for the problem of emotion recognition from spontaneous speech recordings. Our motivation is to insert a data cleaning process to the training phase of the Hidden Markov Models (HMMs) for the purpose of removing some suspicious instances of labels that may exist in the training dataset. Our experiments using HMMs with Mel Frequency Cepstral Coefficients (MFCC) and Line Spectral Frequency (LSF) features indicate that utilization of RANSAC in the training phase provides an improvement in the unweighted recall rates on the test set. Experimental studies performed over the FAU Aibo Emotion Corpus demonstrate that decision fusion configurations with LSF and MFCC based classifiers provide further significant performance improvements. © 2011 Springer-Verlag.
dc.description.indexedbyScopus
dc.description.openaccessYES
dc.description.publisherscopeInternational
dc.description.sponsoredbyTubitakEuN/A
dc.description.sponsorshipEuropean Cooperation in Science and Technology (COST)
dc.description.sponsorshipEuropean Network on Social Signal Processing (SSPnet)
dc.description.volume6800 LNCS
dc.identifier.doi10.1007/978-3-642-25775-9_3
dc.identifier.isbn9783-6422-5774-2
dc.identifier.issn0302-9743
dc.identifier.quartileQ4
dc.identifier.scopus2-s2.0-82955173848
dc.identifier.urihttps://doi.org/10.1007/978-3-642-25775-9_3
dc.identifier.urihttps://hdl.handle.net/20.500.14288/10921
dc.identifier.wos307258000003
dc.keywordsAffect recognition
dc.keywordsData cleaning
dc.keywordsDecision fusion
dc.keywordsEmotional speech classification
dc.keywordsRANSAC affect recognition
dc.keywordsData reduction
dc.keywordsFeature extraction
dc.keywordsHidden Markov models
dc.keywordsSpeech analysis
dc.keywordsSpeech communication
dc.keywordsSpeech recognition
dc.language.isoeng
dc.publisherSpringer
dc.relation.ispartofLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
dc.subjectComputer engineering
dc.titleRansac-based training data selection on spectral features for emotion recognition from spontaneous speech
dc.typeConference Proceeding
dspace.entity.typePublication
local.contributor.kuauthorErzin, Engin
local.contributor.kuauthorBozkurt, Elif
local.publication.orgunit1College of Engineering
local.publication.orgunit1GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
local.publication.orgunit2Department of Computer Engineering
local.publication.orgunit2Graduate School of Sciences and Engineering
relation.isOrgUnitOfPublication89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication3fc31c89-e803-4eb1-af6b-6258bc42c3d8
relation.isOrgUnitOfPublication.latestForDiscovery89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isParentOrgUnitOfPublication8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication434c9663-2b11-4e66-9399-c863e2ebae43
relation.isParentOrgUnitOfPublication.latestForDiscovery8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Files