Optimality of the conditional expectation
This post supplements the supervised learning slides. Please see the slides for the setup.
We wish to show that the conditional expectation is the minimum mean squared error (MSE) prediction function of from ; i.e.
First, we note that the problem of finding the minimum MSE prediction function of from is equivalent to the problem of finding the minimum MSE constant prediction of ; i.e. finding the constant such that
This is because the minimum MSE prediction function must equal at ; i.e. . Otherwise, it is possible to reduce the MSE of by replacing its value at with :
Second, we show that by solving the optimization problem: . The cost function seems complicated, but it is actually a quadratic function of :
We differentiate the cost function and find its root to deduce . Recalling from the first part, we conclude
Posted on August 30, 2021
from Ann Arbor, MI