Groups
Mean field variational family assumes the joint posterior over latent variables factorizes into independent pieces q(z) = ∏ q_i(z_i).
Label smoothing replaces a hard one-hot target with a slightly softened distribution to reduce model overconfidence.