Using Gaussian Process Regression in Two-Dimensional Regression Discontinuity Designs. EdWorkingPaper No. 24-1043

Saved in:
Bibliographic Details
Title: Using Gaussian Process Regression in Two-Dimensional Regression Discontinuity Designs. EdWorkingPaper No. 24-1043
Language: English
Authors: Lily An, Zach Branson, Luke Miratrix, Annenberg Institute for School Reform at Brown University
Source: Annenberg Institute for School Reform at Brown University. 2024.
Availability: Annenberg Institute for School Reform at Brown University. Brown University Box 1985, Providence, RI 02912. Tel: 401-863-7990; Fax: 401-863-1290; e-mail: AISR_Info@brown.edu; Web site: http://www.annenberginstitute.org
Peer Reviewed: N
Page Count: 52
Publication Date: 2024
Document Type: Reports - Research
Education Level: High Schools
Secondary Education
Higher Education
Postsecondary Education
Descriptors: Hierarchical Linear Modeling, English Language Learners, English (Second Language), Second Language Learning, Educational Policy, Public Schools, High Schools, Classification, Language Tests, Scores, College Entrance Examinations
Geographic Terms: Wisconsin
Assessment and Survey Identifiers: ACT Assessment
Abstract: Sometimes a treatment, such as receiving a high school diploma, is assigned to students if their scores on two inputs (e.g., math and English test scores) are above established cutoffs. This forms a multidimensional regression discontinuity design (RDD) to analyze the effect of the educational treatment where there are two running variables instead of one. Present methods for estimating such designs either collapse the two running variables into a single running variable, estimate two separate one-dimensional RDDs, or jointly model the entire response surface. The first two approaches may lose valuable information, while the third approach can be very sensitive to model misspecification. We examine an alternative approach, developed in the context of geographic RDDs, which uses Gaussian processes to flexibly model the response surfaces and estimate the impact of treatment along the full range of students that were on the margin of receiving treatment. We demonstrate theoretically, in simulation, and in an applied example, that this approach has several advantages over current approaches, including over another nonparametric surface response method. In particular, using Gaussian process regression in two-dimensional RDDs shows strong coverage and standard error estimation, and allows for easy examination of treatment effect variation for students with different patterns of running variables and outcomes. As these nonparametric approaches are new in education-specific RDDs, we also provide an R package for users to estimate treatment effects using Gaussian process regression.
Abstractor: As Provided
Entry Date: 2024
Accession Number: ED661553
Database: ERIC
Description
Abstract:Sometimes a treatment, such as receiving a high school diploma, is assigned to students if their scores on two inputs (e.g., math and English test scores) are above established cutoffs. This forms a multidimensional regression discontinuity design (RDD) to analyze the effect of the educational treatment where there are two running variables instead of one. Present methods for estimating such designs either collapse the two running variables into a single running variable, estimate two separate one-dimensional RDDs, or jointly model the entire response surface. The first two approaches may lose valuable information, while the third approach can be very sensitive to model misspecification. We examine an alternative approach, developed in the context of geographic RDDs, which uses Gaussian processes to flexibly model the response surfaces and estimate the impact of treatment along the full range of students that were on the margin of receiving treatment. We demonstrate theoretically, in simulation, and in an applied example, that this approach has several advantages over current approaches, including over another nonparametric surface response method. In particular, using Gaussian process regression in two-dimensional RDDs shows strong coverage and standard error estimation, and allows for easy examination of treatment effect variation for students with different patterns of running variables and outcomes. As these nonparametric approaches are new in education-specific RDDs, we also provide an R package for users to estimate treatment effects using Gaussian process regression.