Publication:
Lip feature extraction based on audio-visual correlation

dc.contributor.departmentDepartment of Electrical and Electronics Engineering
dc.contributor.departmentDepartment of Computer Engineering
dc.contributor.departmentGraduate School of Sciences and Engineering
dc.contributor.kuauthorErzin, Engin
dc.contributor.kuauthorSargın, Mehmet Emre
dc.contributor.kuauthorTekalp, Ahmet Murat
dc.contributor.kuauthorYemez, Yücel
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.contributor.schoolcollegeinstituteGRADUATE SCHOOL OF SCIENCES AND ENGINEERING
dc.date.accessioned2024-11-09T23:18:14Z
dc.date.issued2005
dc.description.abstractIn this paper, the lip feature that has the highest correlation with audio features is investigated. Audio features are selected as Mel Frequency Cepstral Coefficients (MFCC) of the audio signal. Three different lip features are considered for the visual lip information, where these features are 2D DCT coefficients of the intensity based image and the optical flow vectors within the lip region, and the distances between pre-defined points on the lip contour which carries the lip shape information. In this study, we present two techniques based on class conditional probability analysis and canonical correlation analysis to estimate and compare the correlations between audio feature and each lip feature. The lip feature, which has the highest correlation to audio features, is identified among the above lip features. Isolation of lip features, which are highly correlated with audio signal, can be used for audio-visual speech recognition, audio-visual lip synchronization and estimation of lip shapes using audio signal for visual synthesis.
dc.description.indexedbyScopus
dc.description.openaccessYES
dc.description.publisherscopeInternational
dc.description.sponsoredbyTubitakEuN/A
dc.identifier.isbn1604-2382-16
dc.identifier.isbn9781-6042-3821-1
dc.identifier.linkhttps://www.scopus.com/inward/record.uri?eid=2-s2.0-84863652641&partnerID=40&md5=583676e6d6d9269bd8814d7cd6dd3257
dc.identifier.quartileN/A
dc.identifier.scopus2-s2.0-84863652641
dc.identifier.urihttps://IEEExplore.IEEE.org/document/7077967
dc.identifier.urihttps://hdl.handle.net/20.500.14288/10354
dc.keywordsAudio features
dc.keywordsAudio signal
dc.keywordsAudio visual speech recognition
dc.keywordsAudio-visual
dc.keywordsAudio-visual correlations
dc.keywordsCanonical correlation analysis
dc.keywordsConditional probability analysis
dc.keywordsDCT coefficients
dc.keywordsHighly-correlated
dc.keywordsIntensity-based
dc.keywordsLip contour
dc.keywordsLip features
dc.keywordsLip synchronization
dc.keywordsMel-frequency cepstral coefficients
dc.keywordsShape information
dc.keywordsFeature extraction
dc.keywordsSignal processing
dc.keywordsSpeech recognition
dc.language.isoeng
dc.publisherEuropean Association for Signal Processing
dc.relation.ispartof13th European Signal Processing Conference, EUSIPCO 2005
dc.subjectEngineering
dc.titleLip feature extraction based on audio-visual correlation
dc.typeConference Proceeding
dspace.entity.typePublication
local.contributor.kuauthorYemez, Yücel
local.contributor.kuauthorErzin, Engin
local.contributor.kuauthorSargın, Mehmet Emre
local.contributor.kuauthorTekalp, Ahmet Murat
local.publication.orgunit1College of Engineering
local.publication.orgunit1GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
local.publication.orgunit2Department of Computer Engineering
local.publication.orgunit2Department of Electrical and Electronics Engineering
local.publication.orgunit2Graduate School of Sciences and Engineering
relation.isOrgUnitOfPublication21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isOrgUnitOfPublication89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication3fc31c89-e803-4eb1-af6b-6258bc42c3d8
relation.isOrgUnitOfPublication.latestForDiscovery21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isParentOrgUnitOfPublication8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication434c9663-2b11-4e66-9399-c863e2ebae43
relation.isParentOrgUnitOfPublication.latestForDiscovery8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Files