Spectral Degradation and Speaking Style Effects on Emotional Prosody Perception Are Largely Independent of Cross-Modal Dual-Tasking.
Saved in:
| Title: | Spectral Degradation and Speaking Style Effects on Emotional Prosody Perception Are Largely Independent of Cross-Modal Dual-Tasking. |
|---|---|
| Authors: | Xie, Zilong1,2 zx22c@fsu.edu |
| Source: | American Journal of Audiology. Mar2026, Vol. 35 Issue 1, p218-230. 13p. |
| Subject Terms: | *Data analysis, *Emotions, *Attention, *Speech perception, *Auditory perception, *Visual perception, Task performance, Research funding, Long short-term memory, Descriptive statistics, Physiological aspects of speech, Psycholinguistics, Statistics, Human voice, Data analysis software, Reaction time |
| Abstract: | Purpose: This study examined the extent to which spectral degradation and speaking style (child-directed vs. adult-directed speech) affect emotion recognition from prosodic cues and how these effects are modulated by concurrent tasks involving nonauditory sensory input. Method: Adults with normal hearing completed an emotion recognition task under three conditions: alone (auditory single-task), concurrently with a low-load visual memory task (four identical images), and with a high-load visual memory task (four different images). Stimuli consisted of semantically neutral sentences spoken in five emotions (angry, happy, neutral, sad, and scared) and two speaking styles (child-directed and adult-directed). All sentences were vocoded to simulate spectral degradation. Emotion recognition was assessed using a single-interval, five-alternative, forced-choice paradigm, in which the participants were asked to indicate which of five emotions was associated with each heard sentence. Results: Emotion recognition was significantly reduced for vocoded stimuli, as indicated by lower sensitivity (d') and prolonged reaction times (RTs). Childdirected speech led to better performance than adult-directed speech, although its facilitative effect was reduced under vocoded conditions. Dual-tasking impaired performance, with lower d' values in both dual-task conditions and slower RTs under high-load dual-task conditions. Crucially, dual-task effects did not significantly vary with spectral degradation or speaking style. Conclusions: Top-down cognitive demands from cross-modal dual-tasking and bottom-up stimulus factors, such as spectral degradation and speaking style, independently influence emotion recognition from prosodic cues. These findings provide insight into how cochlear implant users perceive emotional speech in complex, multimodal environments. [ABSTRACT FROM AUTHOR] |
| Copyright of American Journal of Audiology is the property of American Speech-Language-Hearing Association and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.) | |
| Database: | Education Research Complete |
| FullText | Links: – Type: pdflink Text: Availability: 0 |
|---|---|
| Header | DbId: ehh DbLabel: Education Research Complete An: 192148343 AccessLevel: 6 PubType: Academic Journal PubTypeId: academicJournal PreciseRelevancyScore: 0 |
| IllustrationInfo | |
| Items | – Name: Title Label: Title Group: Ti Data: Spectral Degradation and Speaking Style Effects on Emotional Prosody Perception Are Largely Independent of Cross-Modal Dual-Tasking. – Name: Author Label: Authors Group: Au Data: <searchLink fieldCode="AR" term="%22Xie%2C+Zilong%22">Xie, Zilong</searchLink><relatesTo>1,2</relatesTo><i> zx22c@fsu.edu</i> – Name: TitleSource Label: Source Group: Src Data: <searchLink fieldCode="JN" term="%22American+Journal+of+Audiology%22">American Journal of Audiology</searchLink>. Mar2026, Vol. 35 Issue 1, p218-230. 13p. – Name: Subject Label: Subject Terms Group: Su Data: *<searchLink fieldCode="DE" term="%22Data+analysis%22">Data analysis</searchLink><br />*<searchLink fieldCode="DE" term="%22Emotions%22">Emotions</searchLink><br />*<searchLink fieldCode="DE" term="%22Attention%22">Attention</searchLink><br />*<searchLink fieldCode="DE" term="%22Speech+perception%22">Speech perception</searchLink><br />*<searchLink fieldCode="DE" term="%22Auditory+perception%22">Auditory perception</searchLink><br />*<searchLink fieldCode="DE" term="%22Visual+perception%22">Visual perception</searchLink><br /><searchLink fieldCode="DE" term="%22Task+performance%22">Task performance</searchLink><br /><searchLink fieldCode="DE" term="%22Research+funding%22">Research funding</searchLink><br /><searchLink fieldCode="DE" term="%22Long+short-term+memory%22">Long short-term memory</searchLink><br /><searchLink fieldCode="DE" term="%22Descriptive+statistics%22">Descriptive statistics</searchLink><br /><searchLink fieldCode="DE" term="%22Physiological+aspects+of+speech%22">Physiological aspects of speech</searchLink><br /><searchLink fieldCode="DE" term="%22Psycholinguistics%22">Psycholinguistics</searchLink><br /><searchLink fieldCode="DE" term="%22Statistics%22">Statistics</searchLink><br /><searchLink fieldCode="DE" term="%22Human+voice%22">Human voice</searchLink><br /><searchLink fieldCode="DE" term="%22Data+analysis+software%22">Data analysis software</searchLink><br /><searchLink fieldCode="DE" term="%22Reaction+time%22">Reaction time</searchLink> – Name: Abstract Label: Abstract Group: Ab Data: Purpose: This study examined the extent to which spectral degradation and speaking style (child-directed vs. adult-directed speech) affect emotion recognition from prosodic cues and how these effects are modulated by concurrent tasks involving nonauditory sensory input. Method: Adults with normal hearing completed an emotion recognition task under three conditions: alone (auditory single-task), concurrently with a low-load visual memory task (four identical images), and with a high-load visual memory task (four different images). Stimuli consisted of semantically neutral sentences spoken in five emotions (angry, happy, neutral, sad, and scared) and two speaking styles (child-directed and adult-directed). All sentences were vocoded to simulate spectral degradation. Emotion recognition was assessed using a single-interval, five-alternative, forced-choice paradigm, in which the participants were asked to indicate which of five emotions was associated with each heard sentence. Results: Emotion recognition was significantly reduced for vocoded stimuli, as indicated by lower sensitivity (d') and prolonged reaction times (RTs). Childdirected speech led to better performance than adult-directed speech, although its facilitative effect was reduced under vocoded conditions. Dual-tasking impaired performance, with lower d' values in both dual-task conditions and slower RTs under high-load dual-task conditions. Crucially, dual-task effects did not significantly vary with spectral degradation or speaking style. Conclusions: Top-down cognitive demands from cross-modal dual-tasking and bottom-up stimulus factors, such as spectral degradation and speaking style, independently influence emotion recognition from prosodic cues. These findings provide insight into how cochlear implant users perceive emotional speech in complex, multimodal environments. [ABSTRACT FROM AUTHOR] – Name: AbstractSuppliedCopyright Label: Group: Ab Data: <i>Copyright of American Journal of Audiology is the property of American Speech-Language-Hearing Association and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.) |
| PLink | https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=ehh&AN=192148343 |
| RecordInfo | BibRecord: BibEntity: Identifiers: – Type: doi Value: 10.1044/2025_AJA-25-00190 Languages: – Code: eng Text: English PhysicalDescription: Pagination: PageCount: 13 StartPage: 218 Subjects: – SubjectFull: Data analysis Type: general – SubjectFull: Emotions Type: general – SubjectFull: Attention Type: general – SubjectFull: Speech perception Type: general – SubjectFull: Auditory perception Type: general – SubjectFull: Visual perception Type: general – SubjectFull: Task performance Type: general – SubjectFull: Research funding Type: general – SubjectFull: Long short-term memory Type: general – SubjectFull: Descriptive statistics Type: general – SubjectFull: Physiological aspects of speech Type: general – SubjectFull: Psycholinguistics Type: general – SubjectFull: Statistics Type: general – SubjectFull: Human voice Type: general – SubjectFull: Data analysis software Type: general – SubjectFull: Reaction time Type: general Titles: – TitleFull: Spectral Degradation and Speaking Style Effects on Emotional Prosody Perception Are Largely Independent of Cross-Modal Dual-Tasking. Type: main BibRelationships: HasContributorRelationships: – PersonEntity: Name: NameFull: Xie, Zilong IsPartOfRelationships: – BibEntity: Dates: – D: 01 M: 03 Text: Mar2026 Type: published Y: 2026 Identifiers: – Type: issn-print Value: 10590889 Numbering: – Type: volume Value: 35 – Type: issue Value: 1 Titles: – TitleFull: American Journal of Audiology Type: main |
| ResultId | 1 |