Publication: Affective burst detection from speech using Kernel-fusion dilated convolutional neural networks
dc.contributor.coauthor | N/A | |
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.kuauthor | Erzin, Engin | |
dc.contributor.kuauthor | Köprü, Berkay | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.date.accessioned | 2024-11-10T00:05:57Z | |
dc.date.issued | 2022 | |
dc.description.abstract | As speech interfaces are getting richer and widespread, speech emotion recognition promises more attractive applications. In the continuous emotion recognition (CER) problem, tracking changes across affective states is an essential and desired capability. Although CER studies widely use correlation metrics in evaluations, these metrics do not always capture all the high-intensity changes in the affective domain. In this paper, we define a novel affective burst detection problem to capture high-intensity changes of the affective attributes accurately. We formulate a two-class classification approach to isolate affective burst regions over the affective state contour for this problem. The proposed classifier is a kernel-fusion dilated convolutional neural network (KFDCNN) architecture driven by speech spectral features to segment the affective attribute contour into idle and burst sections. Experimental evaluations are performed on the RECOLA and CreativeIT datasets. The proposed KFDCNN outperforms baseline feedforward neural networks on both datasets. | |
dc.description.indexedby | WOS | |
dc.description.indexedby | Scopus | |
dc.description.openaccess | NO | |
dc.description.publisherscope | International | |
dc.description.sponsoredbyTubitakEu | N/A | |
dc.identifier.isbn | 978-90-827970-9-1 | |
dc.identifier.issn | 2076-1465 | |
dc.identifier.quartile | N/A | |
dc.identifier.scopus | 2-s2.0-85141010480 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/16532 | |
dc.identifier.wos | 918827600022 | |
dc.keywords | Emotion recognition | |
dc.keywords | Affective burst detection | |
dc.keywords | Kernel fusion | |
dc.keywords | Convolutional neural networks | |
dc.keywords | Speech analysis | |
dc.language.iso | eng | |
dc.publisher | IEEE | |
dc.relation.ispartof | 2022 30th European Signal Processing Conference (Eusipco 2022) | |
dc.subject | Acoustics | |
dc.subject | Computer science | |
dc.subject | Software engineering | |
dc.subject | Engineering | |
dc.subject | Electrical and electronic engineering | |
dc.subject | Imaging science | |
dc.subject | Photographic technology | |
dc.subject | Telecommunications | |
dc.title | Affective burst detection from speech using Kernel-fusion dilated convolutional neural networks | |
dc.type | Conference Proceeding | |
dspace.entity.type | Publication | |
local.contributor.kuauthor | Köprü, Berkay | |
local.contributor.kuauthor | Erzin, Engin | |
local.publication.orgunit1 | College of Engineering | |
local.publication.orgunit2 | Department of Computer Engineering | |
relation.isOrgUnitOfPublication | 89352e43-bf09-4ef4-82f6-6f9d0174ebae | |
relation.isOrgUnitOfPublication.latestForDiscovery | 89352e43-bf09-4ef4-82f6-6f9d0174ebae | |
relation.isParentOrgUnitOfPublication | 8e756b23-2d4a-4ce8-b1b3-62c794a8c164 | |
relation.isParentOrgUnitOfPublication.latestForDiscovery | 8e756b23-2d4a-4ce8-b1b3-62c794a8c164 |
Files
Original bundle
1 - 1 of 1