On optimal selection of lip-motion features for speaker identification

Publication:
On optimal selection of lip-motion features for speaker identification

dc.conference.date	SEP 29-OCT 01, 2004
dc.conference.location	Siena, Italy
dc.conference.organizer	6th IEEE Workshop on Multimedia Signal Processing
dc.contributor.department	MVGL (Multimedia, Vision and Graphics Laboratory)
dc.contributor.facultymember	Yes
dc.contributor.kuauthor	Çetingül, Hasan Ertan
dc.contributor.kuauthor	Erzin, Engin
dc.contributor.kuauthor	Tekalp, Ahmet Murat
dc.contributor.kuauthor	Yemez, Yücel
dc.contributor.schoolcollegeinstitute	Laboratory
dc.date.accessioned	2024-11-10T00:06:41Z
dc.date.issued	2004
dc.description.abstract	This paper addresses the selection of best lip motion features for biometric open-set speaker identification. The best features are those that result in the highest discrimination of individual speakers in a population. We first detect the face region in each video frame. The lip region for each frame is then segmented following registration of successive face regions by global motion compensation. The initial lip feature vector is composed of the 2D-DCT coefficients of the optical flow vectors within the lip region at each frame. We propose to select the most discriminative features from the full set of transform coefficients by using a probabilistic measure that maximizes the ratio of intra-class and inter-class probabilities. The resulting discriminative feature vector with reduced dimension is expected to maximize the identification performance. Experimental results are also included to demonstrate the performance.
dc.description.fulltext	No
dc.description.harvestedfrom	Manual
dc.description.indexedby	WOS
dc.description.indexedby	Scopus
dc.description.openaccess	Green OA
dc.description.peerreviewstatus	N/A
dc.description.publisherscope	International
dc.description.readpublish	N/A
dc.description.sponsoredbyTubitakEu	N/A
dc.description.studentonlypublication	No
dc.description.studentpublication	Yes
dc.description.version	Post-print
dc.identifier.WoSQuartile	N/A
dc.identifier.embargo	No
dc.identifier.filenameinventoryno	IR06891
dc.identifier.isbn	0780385780
dc.identifier.scopus	2-s2.0-13344277211
dc.identifier.uri	https://hdl.handle.net/20.500.14288/16656
dc.identifier.wos	000224752800002
dc.keywords	Speech
dc.keywords	Speaker identification
dc.keywords	Lip motion
dc.language.iso	eng
dc.publisher	Institute of Electrical and Electronics Engineers
dc.relation.affiliation	Koç University
dc.relation.collection	Koç University Institutional Repository
dc.relation.ispartof	2004 IEEE 6th Workshop On Multimedia Signal Processing
dc.relation.openaccess	Yes
dc.rights	Other
dc.subject	Computer science
dc.subject	Artificial intelligence
dc.subject	Engineering
dc.subject	Electrical electronic engineering
dc.subject	Imaging science
dc.subject	Photographic technology
dc.title	On optimal selection of lip-motion features for speaker identification
dc.type	Conference Proceeding
dspace.entity.type	Publication
local.contributor.kuauthor	Çetingül, Hasan Ertan
local.contributor.kuauthor	Erzin, Engin
local.contributor.kuauthor	Yemez, Yücel
local.contributor.kuauthor	Tekalp, Ahmet Murat
relation.isOrgUnitOfPublication	cb6bbbf6-fd19-4052-b581-f591a9748d21
relation.isOrgUnitOfPublication.latestForDiscovery	cb6bbbf6-fd19-4052-b581-f591a9748d21
relation.isParentOrgUnitOfPublication	20385dee-35e7-484b-8da6-ddcc08271d96
relation.isParentOrgUnitOfPublication.latestForDiscovery	20385dee-35e7-484b-8da6-ddcc08271d96

Files

Original bundle

Now showing 1 - 1 of 1

Name:: IR06891.pdf
Size:: 556.98 KB
Format:: Adobe Portable Document Format

Download

Collections

Publications with Fulltext

Publication: On optimal selection of lip-motion features for speaker identification

Files

Original bundle

Collections

Publication:
On optimal selection of lip-motion features for speaker identification