Comparative lip motion analysis for speaker identification

Publication:
Comparative lip motion analysis for speaker identification

dc.contributor.department	Department of Electrical and Electronics Engineering
dc.contributor.department	Department of Computer Engineering
dc.contributor.department	Graduate School of Sciences and Engineering
dc.contributor.facultymember	Yes
dc.contributor.kuauthor	Çetingül, Hasan Ertan
dc.contributor.kuauthor	Erzin, Engin
dc.contributor.kuauthor	Tekalp, Ahmet Murat
dc.contributor.kuauthor	Yemez, Yücel
dc.contributor.schoolcollegeinstitute	College of Engineering
dc.contributor.schoolcollegeinstitute	GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
dc.date.accessioned	2024-11-09T23:43:31Z
dc.date.issued	2005
dc.description.abstract	The aim of this work is to determine the best lip analysis system, thus the most accurate lip motion features for audio-visual open-set speaker identification problem. Based on different analysis points on the lip region, two alternatives for initial lip motion representation is considered. In the first alternative, the feature vector is composed of the 2D-DCT coefficients of the motion vectors estimated within the rectangular mouth region whereas in the second, outer lip boundaries are tracked over the video frames and only the motion vectors around the lip contour are taken into account along with the shape of the lip boundary. Another comparison has been performed between optical flow and block-matching motion estimation methods to find the best model for lip movement. The dimension of the obtained lip feature vector is then reduced by a two-stage discrimination method selecting the most discriminative lip features. An HMM-based identification system has been used for performance comparison of these motion representations. It is observed that the lower-dimensional feature vector computed by block-matching within a rectangular grid in the lip region maximizes the identification performance. /Bu çalışmanın amacı, görsel-işitsel açık set konuşmacı tanıma problemi için en iyi dudak analiz sistemini, dolayısıyla en doğru dudak hareketi özelliklerini belirlemektir. Dudak bölgesindeki farklı analiz noktalarına dayalı olarak, başlangıç dudak hareketi gösterimi için iki alternatif göz önünde bulundurulur. Birinci alternatifte öznitelik vektörü dikdörtgen ağız bölgesi içinde tahmin edilen hareket vektörlerinin 2D-DCT katsayılarından oluşurken, ikinci alternatifte dış dudak sınırları video kareleri üzerinden izlenir ve sadece dudak konturu etrafındaki hareket vektörleri izlenir. dudak sınırının şekli ile birlikte dikkate alınır. Dudak hareketi için en iyi modeli bulmak için optik akış ve blok eşleştirme hareket tahmin yöntemleri arasında başka bir karşılaştırma yapılmıştır. Elde edilen dudak özelliği vektörünün boyutu daha sonra en ayırt edici dudak özelliklerini seçen iki aşamalı bir ayrım yöntemiyle azaltılır. Bu hareket gösterimlerinin performans karşılaştırması için HMM tabanlı bir tanımlama sistemi kullanılmıştır. Dudak bölgesinde dikdörtgen bir grid içerisinde blok eşleştirme ile hesaplanan alt boyutlu özellik vektörünün tanımlama performansını maksimuma çıkardığı görülmektedir.
dc.description.fulltext	No
dc.description.harvestedfrom	Manual
dc.description.indexedby	Scopus
dc.description.openaccess	YES
dc.description.peerreviewstatus	N/A
dc.description.publisherscope	International
dc.description.readpublish	N/A
dc.description.sponsoredbyTubitakEu	N/A
dc.description.studentonlypublication	No
dc.description.studentpublication	Yes
dc.description.version	N/A
dc.identifier.doi	10.1109/SIU.2005.1567680
dc.identifier.embargo	N/A
dc.identifier.isbn	0780-3923-96
dc.identifier.isbn	9780-7803-9239-7
dc.identifier.quartile	Bakılacak
dc.identifier.scopus	2-s2.0-33846625546
dc.identifier.uri	https://IEEExplore.IEEE.org/stamp/stamp.jsp?arnumber=1567680
dc.identifier.uri	https://www.scopus.com/inward/record.uri?eid=2-s2.0-33846625546&doi=10.1109%2fSIU.2005.1567680&partnerID=40&md5=bdb4bf484822bf5745623f03b78dec21
dc.identifier.uri	https://hdl.handle.net/20.500.14288/13502
dc.keywords	Feature extraction
dc.keywords	Markov processes
dc.keywords	Motion estimation
dc.keywords	Problem solving
dc.keywords	Vectors
dc.keywords	Video signal processing
dc.keywords	Lip motion analysis
dc.keywords	Motion vectors
dc.keywords	Optical flow
dc.keywords	Speech analysis
dc.language.iso	tur
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)
dc.relation.affiliation	Koç University
dc.relation.collection	Koç University Institutional Repository
dc.relation.ispartof	Proceedings of the IEEE 13th Signal Processing and Communications Applications Conference, SIU 2005
dc.relation.openaccess	N/A
dc.rights	N/A
dc.subject	Computer engineering
dc.title	Comparative lip motion analysis for speaker identification
dc.title.alternative	Konuşmacı tanıma için karşılaştırmalı dudak devinim analizi
dc.type	Conference Proceeding
dspace.entity.type	Publication
local.contributor.kuauthor	Yemez, Yücel
local.contributor.kuauthor	Erzin, Engin
local.contributor.kuauthor	Tekalp, Ahmet Murat
local.contributor.kuauthor	Çetingül, Hasan Ertan
relation.isOrgUnitOfPublication	21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isOrgUnitOfPublication	89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication	3fc31c89-e803-4eb1-af6b-6258bc42c3d8
relation.isOrgUnitOfPublication.latestForDiscovery	21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isParentOrgUnitOfPublication	8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication	434c9663-2b11-4e66-9399-c863e2ebae43
relation.isParentOrgUnitOfPublication.latestForDiscovery	8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Collections

Publications without Fulltext

Publication: Comparative lip motion analysis for speaker identification

Files

Collections

Publication:
Comparative lip motion analysis for speaker identification