Publication: Evaluation of audio features for audio-visual analysis of dance figures
dc.contributor.department | Department of Electrical and Electronics Engineering | |
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.kuauthor | Tekalp, Ahmet Murat | |
dc.contributor.kuauthor | Erzin, Engin | |
dc.contributor.kuauthor | Yemez, Yücel | |
dc.contributor.kuauthor | Demir, Yasemin | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.kuprofile | Master Student | |
dc.contributor.other | Department of Electrical and Electronics Engineering | |
dc.contributor.other | Department of Computer Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.schoolcollegeinstitute | Graduate School of Sciences and Engineering | |
dc.contributor.yokid | 26207 | |
dc.contributor.yokid | 34503 | |
dc.contributor.yokid | 107907 | |
dc.contributor.yokid | N/A | |
dc.date.accessioned | 2024-11-10T00:00:43Z | |
dc.date.issued | 2008 | |
dc.description.abstract | We present a framework for selecting best audio features for audio-visual analysis and synthesis of dance figures. Dance figures are performed synchronously with the musical rhythm. They can be analyzed through the audio spectra using spectral and rhythmic musical features. In the proposed audio feature evaluation system, dance figures are manually labeled over the video stream. The music segments, which correspond to labeled dance figures, are used to train hidden Markov model (HMM) structures to learn spectral audio patterns for the dance figure melodies. The melody recognition performances of the HMM models for various spectral feature sets are evaluated. Audio features, which are maximizing dance figure melody recognition performances, are selected as the best audio features for the analyzed audiovisual dance recordings. In our evaluations, mel-scale cepstral coefficients (MFCC) with their first and second derivatives, spectral centroid, spectral flux and spectral roll-off are used as candidate audio features. Selection of the best audio features can be used towards analysis and synthesis of audio-driven body animation. | |
dc.description.indexedby | Scopus | |
dc.description.openaccess | YES | |
dc.description.publisherscope | International | |
dc.description.sponsoredbyTubitakEu | TÜBİTAK | |
dc.description.sponsorship | Swiss National Science Foundation (SNSF | |
dc.description.sponsorship | Interactive Multimodal Information Management (IM) | |
dc.description.sponsorship | SIMILAR - Network of Excellence | |
dc.description.sponsorship | Canton de Vaud | |
dc.identifier.doi | N/A | |
dc.identifier.issn | 2219-5491 | |
dc.identifier.link | https://www.scopus.com/inward/record.uri?eid=2-s2.0-84863761125andpartnerID=40andmd5=35c9d16851e841980583394cbe735d2a | |
dc.identifier.quartile | N/A | |
dc.identifier.scopus | 2-s2.0-84863761125 | |
dc.identifier.uri | N/A | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/15854 | |
dc.keywords | Analysis and synthesis | |
dc.keywords | Audio features | |
dc.keywords | Audio-driven body animation | |
dc.keywords | Audiovisual analysis | |
dc.keywords | Cepstral coefficients | |
dc.keywords | HMM models | |
dc.keywords | Melody recognition | |
dc.keywords | Music segments | |
dc.keywords | Musical features | |
dc.keywords | Musical rhythm | |
dc.keywords | Second derivatives | |
dc.keywords | Spectral feature | |
dc.keywords | Spectral flux | |
dc.keywords | Video streams | |
dc.keywords | Animation | |
dc.keywords | Hidden markov models | |
dc.keywords | Signal processing | |
dc.keywords | Speech recognition | |
dc.keywords | Video streaming | |
dc.keywords | Audio acoustics | |
dc.language | English | |
dc.publisher | IEEE | |
dc.source | European Signal Processing Conference | |
dc.subject | Electrical electronics engineering | |
dc.subject | Computer engineering | |
dc.title | Evaluation of audio features for audio-visual analysis of dance figures | |
dc.type | Conference proceeding | |
dspace.entity.type | Publication | |
local.contributor.authorid | 0000-0003-1465-8121 | |
local.contributor.authorid | 0000-0002-2715-2368 | |
local.contributor.authorid | 0000-0002-7515-3138 | |
local.contributor.authorid | N/A | |
local.contributor.kuauthor | Tekalp, Ahmet Murat | |
local.contributor.kuauthor | Erzin, Engin | |
local.contributor.kuauthor | Yemez, Yücel | |
local.contributor.kuauthor | Demir, Yasemin | |
relation.isOrgUnitOfPublication | 21598063-a7c5-420d-91ba-0cc9b2db0ea0 | |
relation.isOrgUnitOfPublication | 89352e43-bf09-4ef4-82f6-6f9d0174ebae | |
relation.isOrgUnitOfPublication.latestForDiscovery | 21598063-a7c5-420d-91ba-0cc9b2db0ea0 |