Groups
Category
Mixed precision training stores and computes tensors in low precision (FP16/BF16) for speed and memory savings while keeping a master copy of weights in FP32 for accurate updates.
Multi-task loss balancing aims to automatically set each task’s weight so that no single loss dominates training.