Publication:
Exploring modulation spectrum features for speech-based depression level classification

Placeholder

Organizational Units

Program

KU-Authors

KU Authors

Co-Authors

Toledo-Ronen, Orith
Sorin, Alexander

Advisor

Publication Date

2014

Language

English

Type

Conference proceeding

Journal Title

Journal ISSN

Volume Title

Abstract

In this paper, we propose a Modulation Spectrum-based manageable feature set for detection of depressed speech. Modulation Spectrum (MS) is obtained from the conventional speech spectrogram by spectral analysis along the temporal trajectories of the acoustic frequency bins. While MS representation of speech provides rich and high-dimensional joint frequency information, extraction of discriminative features from it remains as an open question. We propose a lower dimensional representation, which first employs a Melfrequency filterbank in the acoustic frequency domain and Discrete Cosine Transform in the modulation frequency domain, and then applies feature selection in both domains. We compare and fuse the proposed feature set with other complementary prosodic and spectral features at the feature and decision levels. In our experiments, we use Support Vector Machines for discriminating the depressed speech in a speaker-independent fashion. Feature-level fusion of the proposed MS-based features with other prosodic and spectral features after dimension reduction provides up to ~9% improvement over the baseline results and also correlates the most with clinical ratings of patients' depression level.

Description

Source:

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Publisher:

International Speech and Communication Association

Keywords:

Subject

Computer science, Artificial intelligence, Engineering, Electrical electronic engineering

Citation

Endorsement

Review

Supplemented By

Referenced By

Copy Rights Note

0

Views

0

Downloads

View PlumX Details