Publication: On optimal selection of lip-motion features for speaker identification
| dc.conference.date | SEP 29-OCT 01, 2004 | |
| dc.conference.location | Siena, Italy | |
| dc.conference.organizer | 6th IEEE Workshop on Multimedia Signal Processing | |
| dc.contributor.department | MVGL (Multimedia, Vision and Graphics Laboratory) | |
| dc.contributor.facultymember | Yes | |
| dc.contributor.kuauthor | Çetingül, Hasan Ertan | |
| dc.contributor.kuauthor | Erzin, Engin | |
| dc.contributor.kuauthor | Tekalp, Ahmet Murat | |
| dc.contributor.kuauthor | Yemez, Yücel | |
| dc.contributor.schoolcollegeinstitute | Laboratory | |
| dc.date.accessioned | 2024-11-10T00:06:41Z | |
| dc.date.issued | 2004 | |
| dc.description.abstract | This paper addresses the selection of best lip motion features for biometric open-set speaker identification. The best features are those that result in the highest discrimination of individual speakers in a population. We first detect the face region in each video frame. The lip region for each frame is then segmented following registration of successive face regions by global motion compensation. The initial lip feature vector is composed of the 2D-DCT coefficients of the optical flow vectors within the lip region at each frame. We propose to select the most discriminative features from the full set of transform coefficients by using a probabilistic measure that maximizes the ratio of intra-class and inter-class probabilities. The resulting discriminative feature vector with reduced dimension is expected to maximize the identification performance. Experimental results are also included to demonstrate the performance. | |
| dc.description.fulltext | No | |
| dc.description.harvestedfrom | Manual | |
| dc.description.indexedby | WOS | |
| dc.description.indexedby | Scopus | |
| dc.description.openaccess | Green OA | |
| dc.description.peerreviewstatus | N/A | |
| dc.description.publisherscope | International | |
| dc.description.readpublish | N/A | |
| dc.description.sponsoredbyTubitakEu | N/A | |
| dc.description.studentonlypublication | No | |
| dc.description.studentpublication | Yes | |
| dc.description.version | Post-print | |
| dc.identifier.embargo | No | |
| dc.identifier.filenameinventoryno | IR06891 | |
| dc.identifier.isbn | 0780385780 | |
| dc.identifier.quartile | N/A | |
| dc.identifier.scopus | 2-s2.0-13344277211 | |
| dc.identifier.uri | https://hdl.handle.net/20.500.14288/16656 | |
| dc.identifier.wos | 000224752800002 | |
| dc.keywords | Speech | |
| dc.keywords | Speaker identification | |
| dc.keywords | Lip motion | |
| dc.language.iso | eng | |
| dc.publisher | Institute of Electrical and Electronics Engineers | |
| dc.relation.affiliation | Koç University | |
| dc.relation.collection | Koç University Institutional Repository | |
| dc.relation.ispartof | 2004 IEEE 6th Workshop On Multimedia Signal Processing | |
| dc.relation.openaccess | Yes | |
| dc.rights | Other | |
| dc.subject | Computer science | |
| dc.subject | Artificial intelligence | |
| dc.subject | Engineering | |
| dc.subject | Electrical electronic engineering | |
| dc.subject | Imaging science | |
| dc.subject | Photographic technology | |
| dc.title | On optimal selection of lip-motion features for speaker identification | |
| dc.type | Conference Proceeding | |
| dspace.entity.type | Publication | |
| local.contributor.kuauthor | Çetingül, Hasan Ertan | |
| local.contributor.kuauthor | Erzin, Engin | |
| local.contributor.kuauthor | Yemez, Yücel | |
| local.contributor.kuauthor | Tekalp, Ahmet Murat | |
| relation.isOrgUnitOfPublication | cb6bbbf6-fd19-4052-b581-f591a9748d21 | |
| relation.isOrgUnitOfPublication.latestForDiscovery | cb6bbbf6-fd19-4052-b581-f591a9748d21 | |
| relation.isParentOrgUnitOfPublication | 20385dee-35e7-484b-8da6-ddcc08271d96 | |
| relation.isParentOrgUnitOfPublication.latestForDiscovery | 20385dee-35e7-484b-8da6-ddcc08271d96 |
Files
Original bundle
1 - 1 of 1
