Expectation-Maximization Algorithm
Broadly used algorithm in situations where the observed data is considered incomplete when finding Method of maximum likelihood
- standard tool for mixture models
The complete data
consists of observed data
Formulation
Suppose we have the pdfs or pmfs for the complete data
The complete-data log-likelihood could be formed for
In the first iteration, the E-step finds the expectation of the complete-data log likelihood given
The M-step is to find the value of
for all possible
On the
E-Step: Calculate
M-Step: Find
for all
Stopping Criteria
The pdf of the incomplete data is given by integrating out z and getting the marginal of x
Repeat both the E and M steps alternatively until convergence as the difference of the incomplete-data likelihood