Outcome regression
In this post, we consider the task of estimating treatment effects. This is the basic problem in causal inference, and it arises in a many areas of science and engineering. As a running example, we consider the task of estimating the efficacy of a vaccine booster. We begin by mathematically defining treatment effects using the potential outcomes framework.
To keep things simple, we focus on estimating the effect of a binary treatment (e.g. booster vs no booster). We define two potential outcomes and for each subject in the study. In the running example, is the viral load in the -subject if the subject got the booster, and is the viral load if the subject did not get the booster. The effect of the treatment on the -th subject is
The fundamental challenge in causal inference is only one treatment can be assigned to a subject, so only one of and can be observed. Thus is never observed. Nevertheless, it is possible (as we shall see) to estimate the average treatment effect (ATE)
by performing randomized experiments.
In a randomized experiment, we randomly assign treatments to the subjects and record the outcomes. Let and be the treatment assignment and observed outcome of the -th subject. In the running example, indicates whether the -th subject got the booster and is the (observed) viral load in the -th subject. Mathematically, in a randomized experiment, we have
The first condition (SUTVA) relates the observed outcomes to the potential outcomes: the observed outcome of the -subject is (resp ) if (resp ). The second condition says treatments are assigned in a way that does not depend on the potential outcomes. It implies the distribution of potential outcomes in the treated and untreated groups are identical:
In practice, treatments are often assigned randomly (e.g. by flipping a coin) to satisfy this condition.
Difference-in-means
A simple estimate of the ATE in a randomized experiment is the difference between the (sample) mean outcomes in treated and untreated subjects:
where is the number of subjects assigned treatment $w\in{0,1}$. This is called the difference-in-means estimator, and it is motivated by the observation that the (sample) mean outcome in a treatment group is an unbiased estimate of the expected potential outcome in a randomized experiment:
In light of this observation, it is not hard to see that the difference-in-means estimator is unbiased:
We leave as an exercise to show that is asymptotically normal:
where and for . This result allows us to form confidence intervals and test hypothesis regarding the ATE.
Posted on December 11, 2021
from Ann Arbor, MI