Exploring the Contributions of Various Acoustic Features in Cantonese Vocal Emotions.

Saved in:
Bibliographic Details
Title: Exploring the Contributions of Various Acoustic Features in Cantonese Vocal Emotions.
Authors: Han, Dong1, Yang, Yike1 yyang@hksyu.edu
Source: Journal of Speech, Language & Hearing Research. Nov2025, Vol. 68 Issue 11, p5226-5237. 12p.
Subject Terms: *Data analysis, *Emotions, *Speech perception, Statistical models, Descriptive statistics, Physiological aspects of speech, Statistics, Decision trees, Data analysis software
Abstract: Purpose: The aim of this study was to investigate the acoustic patterns of six emotions and a neutral state in Cantonese speech by focusing on the prosodic modulations that convey emotional content in this tonal language, which has six lexical tones. Method: We employed the extended Geneva minimalistic acoustic parameter set to systematically analyze the acoustic features of 3,474 recordings from the Cantonese Audio-Visual Emotional Speech Database. Linear mixed-effects models were fitted to examine variations in acoustic parameters across emotional states. Decision tree models were used to assess the relative contributions of 22 acoustic parameters in classifying emotions. Results: By fitting linear mixed-effects models, our results revealed statistically significant variations in most of the acoustic parameters across diverse emotional states. The decision tree models showed the relative contributions of 22 acoustic parameters in the classification of emotions, with spectral parameters accounting for 65.45% of the significance in distinguishing all seven emotional states, significantly exceeding other groups of features. Conclusions: Our findings highlight the unique characteristics of emotional expression in Cantonese, in which spectral parameters play a more significant role compared to the frequency-related parameters that are often emphasized in nontonal languages. Our results contribute significantly to understanding vocal emotion expression in tonal languages and are particularly useful for designing emotion-recognition systems and hearing aids that are tailored to tonal language environments. Furthermore, these insights have potential implications for enhancing emotional communication and cognitive training interventions for Cantonese-speaking individuals who use hearing aids or have cochlear implants, are on the autism spectrum, or have Alzheimer's disease. [ABSTRACT FROM AUTHOR]
Copyright of Journal of Speech, Language & Hearing Research is the property of American Speech-Language-Hearing Association and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database: Education Research Complete
Description
Abstract:Purpose: The aim of this study was to investigate the acoustic patterns of six emotions and a neutral state in Cantonese speech by focusing on the prosodic modulations that convey emotional content in this tonal language, which has six lexical tones. Method: We employed the extended Geneva minimalistic acoustic parameter set to systematically analyze the acoustic features of 3,474 recordings from the Cantonese Audio-Visual Emotional Speech Database. Linear mixed-effects models were fitted to examine variations in acoustic parameters across emotional states. Decision tree models were used to assess the relative contributions of 22 acoustic parameters in classifying emotions. Results: By fitting linear mixed-effects models, our results revealed statistically significant variations in most of the acoustic parameters across diverse emotional states. The decision tree models showed the relative contributions of 22 acoustic parameters in the classification of emotions, with spectral parameters accounting for 65.45% of the significance in distinguishing all seven emotional states, significantly exceeding other groups of features. Conclusions: Our findings highlight the unique characteristics of emotional expression in Cantonese, in which spectral parameters play a more significant role compared to the frequency-related parameters that are often emphasized in nontonal languages. Our results contribute significantly to understanding vocal emotion expression in tonal languages and are particularly useful for designing emotion-recognition systems and hearing aids that are tailored to tonal language environments. Furthermore, these insights have potential implications for enhancing emotional communication and cognitive training interventions for Cantonese-speaking individuals who use hearing aids or have cochlear implants, are on the autism spectrum, or have Alzheimer's disease. [ABSTRACT FROM AUTHOR]
ISSN:10924388
DOI:10.1044/2025_JSLHR-24-00677