Routing Strategies and Optimizing Design for Multistage Testing in International Large-Scale Assessments

Saved in:
Bibliographic Details
Title: Routing Strategies and Optimizing Design for Multistage Testing in International Large-Scale Assessments
Language: English
Authors: Svetina, Dubravka, Liaw, Yuan-Ling, Rutkowski, Leslie, Rutkowski, David
Source: Journal of Educational Measurement. Spr 2019 56(1):192-213.
Availability: Wiley-Blackwell. 350 Main Street, Malden, MA 02148. Tel: 800-835-6770; Tel: 781-388-8598; Fax: 781-388-8232; e-mail: cs-journals@wiley.com; Web site: http://www.wiley.com/WileyCDA
Peer Reviewed: Y
Page Count: 22
Publication Date: 2019
Document Type: Journal Articles
Reports - Research
Descriptors: Measurement, Item Analysis, Test Construction, Item Response Theory, Test Length, Scoring, Test Bias, Test Items, Simulation
DOI: 10.1111/jedm.12206
ISSN: 0022-0655
Abstract: This study investigates the effect of several design and administration choices on item exposure and person/item parameter recovery under a multistage test (MST) design. In a simulation study, we examine whether number-correct (NC) or item response theory (IRT) methods are differentially effective at routing students to the correct next stage(s) and whether routing choices (optimal versus suboptimal routing) have an impact on achievement precision. Additionally, we examine the impact of testlet length on both person and item recovery. Overall, our results suggest that no single approach works best across the studied conditions. With respect to the mean person parameter recovery, IRT scoring (via either Fisher information or preliminary EAP estimates) outperformed classical NC methods, although differences in bias and root mean squared error were generally small. Item exposure rates were found to be more evenly distributed when suboptimal routing methods were used, and item recovery (both difficulty and discrimination) was most precisely observed for items with moderate difficulties. Based on the results of the simulation study, we draw conclusions and discuss implications for practice in the context of international large-scale assessments that recently introduced adaptive assessment in the form of MST. Future research directions are also discussed.
Abstractor: As Provided
Entry Date: 2019
Accession Number: EJ1208659
Database: ERIC
Full text is not displayed to guests.
Description
Abstract:This study investigates the effect of several design and administration choices on item exposure and person/item parameter recovery under a multistage test (MST) design. In a simulation study, we examine whether number-correct (NC) or item response theory (IRT) methods are differentially effective at routing students to the correct next stage(s) and whether routing choices (optimal versus suboptimal routing) have an impact on achievement precision. Additionally, we examine the impact of testlet length on both person and item recovery. Overall, our results suggest that no single approach works best across the studied conditions. With respect to the mean person parameter recovery, IRT scoring (via either Fisher information or preliminary EAP estimates) outperformed classical NC methods, although differences in bias and root mean squared error were generally small. Item exposure rates were found to be more evenly distributed when suboptimal routing methods were used, and item recovery (both difficulty and discrimination) was most precisely observed for items with moderate difficulties. Based on the results of the simulation study, we draw conclusions and discuss implications for practice in the context of international large-scale assessments that recently introduced adaptive assessment in the form of MST. Future research directions are also discussed.
ISSN:0022-0655
DOI:10.1111/jedm.12206