Effects of Fundamental Frequency and Vocal Tract Resonance on Sentence Recognition in Noise.

Saved in:

Bibliographic Details
Title:	Effects of Fundamental Frequency and Vocal Tract Resonance on Sentence Recognition in Noise.
Authors:	Jing Yang¹, A. Xianhui Wang¹, Victoria Costa¹, Li Xu¹
Source:	Journal of Speech, Language & Hearing Research. Jun2025, Vol. 68 Issue 6, p3011-3022. 12p.
Subject Terms:	Intelligibility of speech, Speech perception, Auditory perception, Comparative studies, Vocal cords, Automatic speech recognition, T-test (Statistics), Logistic regression analysis, Sex distribution, Descriptive statistics, Physiological aspects of speech, Nose, Psychometrics, Acoustic stimulation, Sensitivity & specificity (Statistics)
Abstract:	Purpose: This study examined the effects of change in a talker's sex-related acoustic properties (fundamental frequency [F0] and vocal tract resonance [VTR]) on speech recognition in noise. Method: The stimuli were Hearing in Noise Test sentences, with the F0 and VTR of the original male talker manipulated into four conditions: low F0 and low VTR (LF0LVTR; i.e., the original recordings), low F0 and high VTR (LF0HVTR), high F0 and high VTR (HF0HVTR), and high F0 and low VTR (HF0LVTR). The listeners were 42 English-speaking, normal-hearing adults (21--31 years old). The sentences mixed with speech spectrum--shaped noise at various signal-to-noise ratios (i.e., -10, -5, 0, and +5 dB) were presented to the listeners for recognition. Results: The results revealed no significant differences between the HF0HVTR and LF0LVTR conditions in sentence recognition performance and the estimated speech reception thresholds (SRTs). However, in the HF0LVTR and LF0HVTR conditions, the recognition performance was reduced, and the listeners showed significantly higher SRTs relative to those in the HF0HVTR and LF0LVTR conditions. Conclusion: These findings indicate that male and female voices with matched F0 and VTR (e.g., LF0LVTR and HF0HVTR) yield equivalent speech recognition in noise, whereas voices with mismatched F0 and VTR may reduce intelligibility in noisy environments. [ABSTRACT FROM AUTHOR]
	Copyright of Journal of Speech, Language & Hearing Research is the property of American Speech-Language-Hearing Association and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database:	Education Research Complete

Be the first to leave a comment!