Introduction
What follows is an example I worked up when trying to figure out how to communicate potential outcomes in a regression framework to students graphically. This discussion is derived from Morgan and Winship’s 2015 book1, especially pages 122-123. The goal is to represent the potential outcomes framework using standard regression notation, and then to discuss endogenous selection bias using this framework.
Math
Define Yi=μ0+(μ1−μ0)Di+{v0i+Di(v1i−v0i)}Yi=μ0+(μ1−μ0)Di+{v0i+Di(v1i−v0i)} where DiDi is binary assignment to treatment for individual ii, μ0μ0 is the expected outcome for control, μ1μ1 is the expected outcome for treated, μ1−μ0μ1−μ0 is the Average Treatment Effect or δδ.