Addressing Bias in Spoken Language Systems Used in the Development and Implementation of Automated Child Language-Based Assessment

Saved in:
Bibliographic Details
Title: Addressing Bias in Spoken Language Systems Used in the Development and Implementation of Automated Child Language-Based Assessment
Language: English
Authors: Alison L. Bailey (ORCID 0000-0001-9303-8304), Alexander Johnson, Natarajan Balaji Shankar, Hariram Veeramani, Julie A. Washington, Abeer Alwan
Source: Journal of Educational Measurement. 2026 63(1).
Availability: Wiley. Available from: John Wiley & Sons, Inc. 111 River Street, Hoboken, NJ 07030. Tel: 800-835-6770; e-mail: cs-journals@wiley.com; Web site: https://www.wiley.com/en-us
Peer Reviewed: Y
Page Count: 30
Publication Date: 2026
Sponsoring Agency: National Science Foundation (NSF)
Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD) (DHHS/NIH)
Contract Number: 2202585
P01HD070837
Document Type: Journal Articles
Reports - Research
Descriptors: Bias, Oral Language, Automation, Child Language, Speech Communication, Scoring, Accuracy, Natural Language Processing, Algorithms, Measurement, Language Variation, Black Dialects, Suprasegmentals, Pronunciation, Language Usage, Grammar, Language Impairments, Reading Difficulties
DOI: 10.1111/jedm.12435
ISSN: 0022-0655
1745-3984
Abstract: This article addresses bias in Spoken Language Systems (SLS) that involve both Automatic Speech Recognition (ASR) and Natural Language Processing (NLP) and reports experiments to improve the performance of SLS for automated language and literacy-related assessments with students who are under served in the U.S. educational system. We frame bias in SLS in terms of testing fairness and validity, stemming in part from the exclusion of sufficiently large training datasets in varieties of English other than General American English (GAE). We adopt an Interpretation/Use Argument approach to validity focused on clarity of constructs and scoring accuracy. While SLS use ASR to automatically transcribe students' utterances, and apply NLP algorithms to ASR transcripts to measure students' speech samples, it is well-documented in studies with adults that ASR is typically more problematic for African American English (AAE) speakers than for other groups due to differences in prosody, pronunciation, word usage, and grammar. We utilized child speech and text corpora to improve algorithms that score oral task responses for child AAE speakers and, in some experiments, children with oral language and reading difficulties. Favorable results provide impetus and possible solutions for fair and inclusive assessments for diverse student groups in the future.
Abstractor: As Provided
Entry Date: 2026
Accession Number: EJ1501375
Database: ERIC
Full text is not displayed to guests.
Description
Abstract:This article addresses bias in Spoken Language Systems (SLS) that involve both Automatic Speech Recognition (ASR) and Natural Language Processing (NLP) and reports experiments to improve the performance of SLS for automated language and literacy-related assessments with students who are under served in the U.S. educational system. We frame bias in SLS in terms of testing fairness and validity, stemming in part from the exclusion of sufficiently large training datasets in varieties of English other than General American English (GAE). We adopt an Interpretation/Use Argument approach to validity focused on clarity of constructs and scoring accuracy. While SLS use ASR to automatically transcribe students' utterances, and apply NLP algorithms to ASR transcripts to measure students' speech samples, it is well-documented in studies with adults that ASR is typically more problematic for African American English (AAE) speakers than for other groups due to differences in prosody, pronunciation, word usage, and grammar. We utilized child speech and text corpora to improve algorithms that score oral task responses for child AAE speakers and, in some experiments, children with oral language and reading difficulties. Favorable results provide impetus and possible solutions for fair and inclusive assessments for diverse student groups in the future.
ISSN:0022-0655
1745-3984
DOI:10.1111/jedm.12435