Publication:
On optimal selection of lip-motion features for speaker identification

dc.conference.dateSEP 29-OCT 01, 2004
dc.conference.locationSiena, Italy
dc.conference.organizer6th IEEE Workshop on Multimedia Signal Processing
dc.contributor.departmentMVGL (Multimedia, Vision and Graphics Laboratory)
dc.contributor.facultymemberYes
dc.contributor.kuauthorÇetingül, Hasan Ertan
dc.contributor.kuauthorErzin, Engin
dc.contributor.kuauthorTekalp, Ahmet Murat
dc.contributor.kuauthorYemez, Yücel
dc.contributor.schoolcollegeinstituteLaboratory
dc.date.accessioned2024-11-10T00:06:41Z
dc.date.issued2004
dc.description.abstractThis paper addresses the selection of best lip motion features for biometric open-set speaker identification. The best features are those that result in the highest discrimination of individual speakers in a population. We first detect the face region in each video frame. The lip region for each frame is then segmented following registration of successive face regions by global motion compensation. The initial lip feature vector is composed of the 2D-DCT coefficients of the optical flow vectors within the lip region at each frame. We propose to select the most discriminative features from the full set of transform coefficients by using a probabilistic measure that maximizes the ratio of intra-class and inter-class probabilities. The resulting discriminative feature vector with reduced dimension is expected to maximize the identification performance. Experimental results are also included to demonstrate the performance.
dc.description.fulltextNo
dc.description.harvestedfromManual
dc.description.indexedbyWOS
dc.description.indexedbyScopus
dc.description.openaccessGreen OA
dc.description.peerreviewstatusN/A
dc.description.publisherscopeInternational
dc.description.readpublishN/A
dc.description.sponsoredbyTubitakEuN/A
dc.description.studentonlypublicationNo
dc.description.studentpublicationYes
dc.description.versionPost-print
dc.identifier.embargoNo
dc.identifier.filenameinventorynoIR06891
dc.identifier.isbn0780385780
dc.identifier.quartileN/A
dc.identifier.scopus2-s2.0-13344277211
dc.identifier.urihttps://hdl.handle.net/20.500.14288/16656
dc.identifier.wos000224752800002
dc.keywordsSpeech
dc.keywordsSpeaker identification
dc.keywordsLip motion
dc.language.isoeng
dc.publisherInstitute of Electrical and Electronics Engineers
dc.relation.affiliationKoç University
dc.relation.collectionKoç University Institutional Repository
dc.relation.ispartof2004 IEEE 6th Workshop On Multimedia Signal Processing
dc.relation.openaccessYes
dc.rightsOther
dc.subjectComputer science
dc.subjectArtificial intelligence
dc.subjectEngineering
dc.subjectElectrical electronic engineering
dc.subjectImaging science
dc.subjectPhotographic technology
dc.titleOn optimal selection of lip-motion features for speaker identification
dc.typeConference Proceeding
dspace.entity.typePublication
local.contributor.kuauthorÇetingül, Hasan Ertan
local.contributor.kuauthorErzin, Engin
local.contributor.kuauthorYemez, Yücel
local.contributor.kuauthorTekalp, Ahmet Murat
relation.isOrgUnitOfPublicationcb6bbbf6-fd19-4052-b581-f591a9748d21
relation.isOrgUnitOfPublication.latestForDiscoverycb6bbbf6-fd19-4052-b581-f591a9748d21
relation.isParentOrgUnitOfPublication20385dee-35e7-484b-8da6-ddcc08271d96
relation.isParentOrgUnitOfPublication.latestForDiscovery20385dee-35e7-484b-8da6-ddcc08271d96

Files

Original bundle

Now showing 1 - 1 of 1
Thumbnail Image
Name:
IR06891.pdf
Size:
556.98 KB
Format:
Adobe Portable Document Format