Groups
Minimum Description Length (MDL) picks the model that compresses the data best by minimizing L(M) + L(D|M).
KullbackโLeibler (KL) divergence measures how one probability distribution P devotes probability mass differently from a reference distribution Q.