All observation models define a likelihood for producing data \(x_n\) from some cluster-specific density with parameter \(\phi_k\):
Supported Bayesian methods require specifying a (conjugate) prior:
The links below describe the mathematical and computational details for performing standard variational optimization for supported observation models: