Differences in Reading Screening Accuracy by Percentile Cutoff and English Proficiency: Feature Selection and Group-Wise Prediction Evaluation
Saved in:
| Title: | Differences in Reading Screening Accuracy by Percentile Cutoff and English Proficiency: Feature Selection and Group-Wise Prediction Evaluation |
|---|---|
| Language: | English |
| Authors: | Julian M. Siebert (ORCID |
| Source: | Reading Research Quarterly. 2026 61(1). |
| Availability: | Wiley. Available from: John Wiley & Sons, Inc. 111 River Street, Hoboken, NJ 07030. Tel: 800-835-6770; e-mail: cs-journals@wiley.com; Web site: https://www.wiley.com/en-us |
| Peer Reviewed: | Y |
| Page Count: | 19 |
| Publication Date: | 2026 |
| Sponsoring Agency: | National Institute on Deafness and Other Communication Disorders (NIDCD) (DHHS/NIH) National Institute of Neurological Disorders and Stroke (NINDS) (DHHS/NIH) |
| Contract Number: | K24DC015544 R01NS050915 |
| Document Type: | Journal Articles Reports - Research |
| Education Level: | Early Childhood Education Elementary Education Kindergarten Primary Education Grade 1 |
| Descriptors: | Reading Difficulties, Educational Diagnosis, Reading Tests, Screening Tests, Kindergarten, Grade 1, Young Children, At Risk Students, Reading Skills, Predictor Variables, Accuracy, Native Speakers, English Learners |
| Geographic Terms: | California |
| DOI: | 10.1002/rrq.70074 |
| ISSN: | 0034-0553 1936-2722 |
| Abstract: | This exploratory predictive study investigates our capacity to equitably predict the risk of developing reading difficulties based on a battery of tasks tapping into commonly used reading screening constructs. A key question pertains to the extent to which we can make predictions with equal accuracy for native English speakers (EO) and emerging multilinguals (English learners [ELs] with sufficient English proficiency to participate in English screening). This prediction exercise takes place in a linguistically diverse sample, over different prediction intervals (end of the same and subsequent academic year), and across a range of percentile cutoffs (1st-25th) defining reading difficulty. Specifically, using a sample of 1692 kindergarteners and first graders from 24 schools across 13 Californian public school districts, we (a) identified the most important reading screening tasks for a range of percentile cutoffs defining "risk" using random forest feature selection; (b) built a series of logistic regression classifiers; and (c) evaluated these models' performances for ELs and EO students separately. When risk is defined as performance below the tenth percentile or higher, the same tasks generally surface as the most important predictors of later word reading difficulty (letter naming and sounds for kindergarten; word and non-word reading for first graders), though with different relative importance for EO versus EL students and for the different prediction intervals. Phonological skills (blending, deletion) also show to be more predictive for ELs. Though most models can be optimized to exceed sensitivity and specificity values of 0.80, classification accuracy is higher for EO students, especially for 2-year predictions. Though models meeting conventionally used minimum accuracy requirements for both the EO and EL groups can be produced, more work is needed to achieve truly linguistically equitable (i.e., equally accurate) screening. We discuss the equity implications and feasibility of universal reading screening using English screening tasks in linguistically diverse populations. |
| Abstractor: | As Provided |
| Entry Date: | 2026 |
| Accession Number: | EJ1494567 |
| Database: | ERIC |
|
Full text is not displayed to guests.
Login for full access.
|
|
| Abstract: | This exploratory predictive study investigates our capacity to equitably predict the risk of developing reading difficulties based on a battery of tasks tapping into commonly used reading screening constructs. A key question pertains to the extent to which we can make predictions with equal accuracy for native English speakers (EO) and emerging multilinguals (English learners [ELs] with sufficient English proficiency to participate in English screening). This prediction exercise takes place in a linguistically diverse sample, over different prediction intervals (end of the same and subsequent academic year), and across a range of percentile cutoffs (1st-25th) defining reading difficulty. Specifically, using a sample of 1692 kindergarteners and first graders from 24 schools across 13 Californian public school districts, we (a) identified the most important reading screening tasks for a range of percentile cutoffs defining "risk" using random forest feature selection; (b) built a series of logistic regression classifiers; and (c) evaluated these models' performances for ELs and EO students separately. When risk is defined as performance below the tenth percentile or higher, the same tasks generally surface as the most important predictors of later word reading difficulty (letter naming and sounds for kindergarten; word and non-word reading for first graders), though with different relative importance for EO versus EL students and for the different prediction intervals. Phonological skills (blending, deletion) also show to be more predictive for ELs. Though most models can be optimized to exceed sensitivity and specificity values of 0.80, classification accuracy is higher for EO students, especially for 2-year predictions. Though models meeting conventionally used minimum accuracy requirements for both the EO and EL groups can be produced, more work is needed to achieve truly linguistically equitable (i.e., equally accurate) screening. We discuss the equity implications and feasibility of universal reading screening using English screening tasks in linguistically diverse populations. |
|---|---|
| ISSN: | 0034-0553 1936-2722 |
| DOI: | 10.1002/rrq.70074 |