Differences in Reading Screening Accuracy by Percentile Cutoff and English Proficiency: Feature Selection and Group-Wise Prediction Evaluation

Saved in:
Bibliographic Details
Title: Differences in Reading Screening Accuracy by Percentile Cutoff and English Proficiency: Feature Selection and Group-Wise Prediction Evaluation
Language: English
Authors: Julian M. Siebert (ORCID 0000-0002-0472-4677), Phaedra Bell, Nuria Gutiérrez (ORCID 0000-0002-2809-1608), Mónica Zegers (ORCID 0000-0002-9672-6417), Eric Falke, Benjamin W. Domingue (ORCID 0000-0002-3894-9049), Yaacov Petscher (ORCID 0000-0001-8858-3498), Hugh Catts, Lucy Yan, Lillian Durán (ORCID 0000-0003-4364-6445), Maria Luisa Gorno-Tempini (ORCID 0000-0002-7426-7782)
Source: Reading Research Quarterly. 2026 61(1).
Availability: Wiley. Available from: John Wiley & Sons, Inc. 111 River Street, Hoboken, NJ 07030. Tel: 800-835-6770; e-mail: cs-journals@wiley.com; Web site: https://www.wiley.com/en-us
Peer Reviewed: Y
Page Count: 19
Publication Date: 2026
Sponsoring Agency: National Institute on Deafness and Other Communication Disorders (NIDCD) (DHHS/NIH)
National Institute of Neurological Disorders and Stroke (NINDS) (DHHS/NIH)
Contract Number: K24DC015544
R01NS050915
Document Type: Journal Articles
Reports - Research
Education Level: Early Childhood Education
Elementary Education
Kindergarten
Primary Education
Grade 1
Descriptors: Reading Difficulties, Educational Diagnosis, Reading Tests, Screening Tests, Kindergarten, Grade 1, Young Children, At Risk Students, Reading Skills, Predictor Variables, Accuracy, Native Speakers, English Learners
Geographic Terms: California
DOI: 10.1002/rrq.70074
ISSN: 0034-0553
1936-2722
Abstract: This exploratory predictive study investigates our capacity to equitably predict the risk of developing reading difficulties based on a battery of tasks tapping into commonly used reading screening constructs. A key question pertains to the extent to which we can make predictions with equal accuracy for native English speakers (EO) and emerging multilinguals (English learners [ELs] with sufficient English proficiency to participate in English screening). This prediction exercise takes place in a linguistically diverse sample, over different prediction intervals (end of the same and subsequent academic year), and across a range of percentile cutoffs (1st-25th) defining reading difficulty. Specifically, using a sample of 1692 kindergarteners and first graders from 24 schools across 13 Californian public school districts, we (a) identified the most important reading screening tasks for a range of percentile cutoffs defining "risk" using random forest feature selection; (b) built a series of logistic regression classifiers; and (c) evaluated these models' performances for ELs and EO students separately. When risk is defined as performance below the tenth percentile or higher, the same tasks generally surface as the most important predictors of later word reading difficulty (letter naming and sounds for kindergarten; word and non-word reading for first graders), though with different relative importance for EO versus EL students and for the different prediction intervals. Phonological skills (blending, deletion) also show to be more predictive for ELs. Though most models can be optimized to exceed sensitivity and specificity values of 0.80, classification accuracy is higher for EO students, especially for 2-year predictions. Though models meeting conventionally used minimum accuracy requirements for both the EO and EL groups can be produced, more work is needed to achieve truly linguistically equitable (i.e., equally accurate) screening. We discuss the equity implications and feasibility of universal reading screening using English screening tasks in linguistically diverse populations.
Abstractor: As Provided
Entry Date: 2026
Accession Number: EJ1494567
Database: ERIC
Full text is not displayed to guests.
Description
Abstract:This exploratory predictive study investigates our capacity to equitably predict the risk of developing reading difficulties based on a battery of tasks tapping into commonly used reading screening constructs. A key question pertains to the extent to which we can make predictions with equal accuracy for native English speakers (EO) and emerging multilinguals (English learners [ELs] with sufficient English proficiency to participate in English screening). This prediction exercise takes place in a linguistically diverse sample, over different prediction intervals (end of the same and subsequent academic year), and across a range of percentile cutoffs (1st-25th) defining reading difficulty. Specifically, using a sample of 1692 kindergarteners and first graders from 24 schools across 13 Californian public school districts, we (a) identified the most important reading screening tasks for a range of percentile cutoffs defining "risk" using random forest feature selection; (b) built a series of logistic regression classifiers; and (c) evaluated these models' performances for ELs and EO students separately. When risk is defined as performance below the tenth percentile or higher, the same tasks generally surface as the most important predictors of later word reading difficulty (letter naming and sounds for kindergarten; word and non-word reading for first graders), though with different relative importance for EO versus EL students and for the different prediction intervals. Phonological skills (blending, deletion) also show to be more predictive for ELs. Though most models can be optimized to exceed sensitivity and specificity values of 0.80, classification accuracy is higher for EO students, especially for 2-year predictions. Though models meeting conventionally used minimum accuracy requirements for both the EO and EL groups can be produced, more work is needed to achieve truly linguistically equitable (i.e., equally accurate) screening. We discuss the equity implications and feasibility of universal reading screening using English screening tasks in linguistically diverse populations.
ISSN:0034-0553
1936-2722
DOI:10.1002/rrq.70074