Groups
Category
Minimum Description Length (MDL) picks the model that compresses the data best by minimizing L(M) + L(D|M).
The Information Bottleneck (IB) principle formalizes learning compact representations T that keep only the information about X that is useful for predicting Y.