Groups
Category
Minimum Description Length (MDL) picks the model that compresses the data best by minimizing L(M) + L(D|M).
Cross-entropy loss measures how well predicted probabilities match the true labels by penalizing confident wrong predictions heavily.