Estimating severity of depression from acoustic features and embeddings of natural speech

Sri Harsha Dumpala, Sheri Rempel, Katerina Dikaios, Mehri Sajjadian, Rudolf Uher, Sageev Oore

Résultat de recherche: Conference articleexamen par les pairs

14 Citations (Scopus)

Résumé

Major depressive disorder, referred to as depression, is a leading cause of disability, absence from work, and premature death. Automatic assessment of depression from speech is a critical step towards improving diagnosis and treatment of depression. Previous works on depression assessment from speech considered various acoustic features extracted from speech to estimate depression severity. But performance of these approaches is not at clinical standards, and thus requires further improvement. In this work, we examine two novel approaches for improving depression severity estimation from short audio recordings of speech. Specifically, in audio recordings of a narrative by individuals diagnosed with major depressive disorder, we analyze spectral-based and excitation source-based features extracted from speech, and significance of sentiment and emotion classification in estimation of depression severity. Initial results indicate synchrony between depression scores and the sentiment and emotion labels. We propose the use of sentiment and emotion based embeddings obtained using machine learning techniques in estimation of depression severity. We also propose use of multi-task training to better estimate depression severity. We show that the proposed approaches provide additive improvements in the estimation of depression severity.

Langue d'origineEnglish
Pages (de-à)7278-7282
Nombre de pages5
JournalProceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing
Volume2021-June
DOI
Statut de publicationPublished - 2021
Événement2021 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2021 - Virtual, Toronto, Canada
Durée: juin 6 2021juin 11 2021

Note bibliographique

Funding Information:
Resources used in preparing this research were provided, in part, by CIHR funding reference #165835, NSERC, the Province of Ontario, Canada Research Chairs Program, the Government of Canada through CIFAR, and companies sponsoring the Vector Institute www.vectorinstitute.ai/#partners.

Publisher Copyright:
©2021 IEEE

ASJC Scopus Subject Areas

  • Software
  • Signal Processing
  • Electrical and Electronic Engineering

Empreinte numérique

Plonger dans les sujets de recherche 'Estimating severity of depression from acoustic features and embeddings of natural speech'. Ensemble, ils forment une empreinte numérique unique.

Citer