The paper shows how to train a language model with special extra hints (privileged information) during practice so it can still do well later without any hints.
This paper shows a simple way for AI models to keep learning new things without forgetting what they already know.