Font Size: a A A

New Perspectives on Regression Adjustment in Causal Inference, with Applications to Educational Program Evaluation

Posted on:2014-08-09Degree:Ph.DType:Thesis
University:University of MichiganCandidate:Sales, Adam ChaimFull Text:PDF
GTID:2455390008460912Subject:Statistics
Abstract/Summary:
Causal inference from observational data---that is, data that did not come from an experiment---is notoriously difficult: because the probability distribution of the treatment variable Z is unknown, measured or unmeasured variables that correlate with both Z and the outcome Y may confound causal estimates. This thesis will present methods for designing and modeling causal observational studies that combine design-based techniques with regression to account for measured covariates X.;Regression discontinuity designs occur when treatment assignment is a function of a variable T: when T exceeds a threshold c, treatment is assigned. Conventionally, researchers analyze RDDs by regressing Y on both T and Z. This thesis argues for modeling RDDs as naturally-randomized experiments in two steps: modeling the relationship between Y and T, and using that design to infer and estimate effects of Z on Y. We illustrate this approach by reanalyzing a dataset used to estimate the effects of academic probation on students' grade point averages.;The rest of the thesis focuses on propensity-score stratification with high-dimensional data (p >> n). If treatment assignment is a random unknown function of X, researchers can adjust causal estimates for X by estimating propensity scores: subjects' respective probabilities of treatment assignment conditional on X. Researchers then stratify subjects based on their propensity scores and model the data as if treatment were randomized within strata. However, when the dimension of X is large, propensity-score estimation is impossible. We propose a method in which a subset of X is used to estimate propensity scores. Next, the entire matrix X can be used to model Y, using a high-dimensional regression technique; the model is trained on subjects excluded from the stratification. The model's predictions of Y can then be used to test balance on, and adjust for, the entire set of covariates in X. We illustrate this method by evaluating two high-school educational programs.
Keywords/Search Tags:Causal, Regression
Related items