Publication: Speech driven 3D head gesture synthesis
dc.contributor.coauthor | Erdem, Tanju A. | |
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.department | N/A | |
dc.contributor.department | Department of Electrical and Electronics Engineering | |
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.department | Department of Electrical and Electronics Engineering | |
dc.contributor.kuauthor | Yemez, Yücel | |
dc.contributor.kuauthor | Erzin, Engin | |
dc.contributor.kuauthor | Sargın, Mehmet Emre | |
dc.contributor.kuauthor | Tekalp, Ahmet Murat | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.kuprofile | Master Student | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.schoolcollegeinstitute | Graduate School of Sciences and Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.yokid | 107907 | |
dc.contributor.yokid | 34503 | |
dc.contributor.yokid | N/A | |
dc.contributor.yokid | 26207 | |
dc.date.accessioned | 2024-11-09T23:20:38Z | |
dc.date.issued | 2006 | |
dc.description.abstract | In this paper, we present a speech driven natural head gesture analysis and synthesis system. The proposed system assumes that sharp head movements are correlated with prominence in speech. For analysis, a binocular camera system is employed to capture the head motion of a talking person. The motion parameters associated with the 3D head motion are then used for extraction of the repetitive head gestures. In parallel, prosodic events are detected using an HMM structure with pitch and formant frequencies and speech intensity as audio features. For synthesis, the head motion parameters are estimated from the prosodic events based on a gesture-speech correlation model and then the associated Euler angles are used for speech driven animation of a 3D personalized talking head model. Results on head motion feature extraction, prosodic event detection and correlation modelling are provided. / Bu yazıda, konuşmaya dayalı doğal bir baş hareketi analiz ve sentez sistemi sunuyoruz. Önerilen sistem, keskin kafa hareketlerinin konuşmadaki belirginlik ile ilişkili olduğunu varsayar. Analiz için, konuşan bir kişinin baş hareketini yakalamak için bir dürbün kamera sistemi kullanılır. 3B kafa hareketiyle ilişkili hareket parametreleri daha sonra tekrarlayan baş hareketlerinin çıkarılması için kullanılır. Buna paralel olarak, prozodik olaylar, ses özellikleri olarak perde ve formant frekansları ve konuşma yoğunluğu ile bir HMM yapısı kullanılarak tespit edilir. Sentez için, baş hareketi parametreleri, bir jest-konuşma korelasyon modeline dayalı prozodik olaylardan tahmin edilir ve ardından ilişkili Euler açıları, bir 3B kişiselleştirilmiş konuşan kafa modelinin konuşmaya dayalı animasyonu için kullanılır. Kafa hareketi öznitelik çıkarımı, prozodik olay tespiti ve korelasyon modellemesi ile ilgili sonuçlar sağlanır. | |
dc.description.indexedby | WoS | |
dc.description.indexedby | Scopus | |
dc.description.openaccess | YES | |
dc.description.publisherscope | International | |
dc.description.sponsoredbyTubitakEu | N/A | |
dc.description.volume | 2006 | |
dc.identifier.doi | 10.1109/SIU.2006.1659683 | |
dc.identifier.isbn | 1424-4023-95 | |
dc.identifier.isbn | 9781-4244-0239-7 | |
dc.identifier.link | https://www.scopus.com/inward/record.uri?eid=2-s2.0-34247107135&doi=10.1109%2fSIU.2006.1659683&partnerID=40&md5=788651926aa81cc5b0256be723f4ce80 | |
dc.identifier.quartile | N/A | |
dc.identifier.scopus | 2-s2.0-34247107135 | |
dc.identifier.uri | https://IEEExplore.IEEE.org/stamp/stamp.jsp?tp=&arnumber=1659683 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/10757 | |
dc.identifier.wos | 245347800061 | |
dc.keywords | Binocular vision | |
dc.keywords | Cameras | |
dc.keywords | Correlation theory | |
dc.keywords | Feature extraction | |
dc.keywords | Speech synthesis | |
dc.keywords | Three dimensional | |
dc.keywords | Binocular camera system | |
dc.keywords | Euler angles | |
dc.keywords | Formant frequencies | |
dc.keywords | Speech intensity | |
dc.keywords | Gesture recognition | |
dc.language | Turkish | |
dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) | |
dc.source | 2006 IEEE 14th Signal Processing and Communications Applications Conference | |
dc.subject | Engineering | |
dc.title | Speech driven 3D head gesture synthesis | |
dc.title.alternative | Konuşma ile sürülen kafa jesti analizi ve sentezi | |
dc.type | Conference proceeding | |
dspace.entity.type | Publication | |
local.contributor.authorid | 0000-0002-7515-3138 | |
local.contributor.authorid | 0000-0002-2715-2368 | |
local.contributor.authorid | N/A | |
local.contributor.authorid | 0000-0003-1465-8121 | |
local.contributor.kuauthor | Yemez, Yücel | |
local.contributor.kuauthor | Erzin, Engin | |
local.contributor.kuauthor | Sargın, Mehmet Emre | |
local.contributor.kuauthor | Tekalp, Ahmet Murat | |
relation.isOrgUnitOfPublication | 89352e43-bf09-4ef4-82f6-6f9d0174ebae | |
relation.isOrgUnitOfPublication | 21598063-a7c5-420d-91ba-0cc9b2db0ea0 | |
relation.isOrgUnitOfPublication.latestForDiscovery | 21598063-a7c5-420d-91ba-0cc9b2db0ea0 |