Enhancing Linguistic Competence of Language Models through Pre-training with Language Learning Tasks
BeginnerAtsuki Yamaguchi, Maggie Mi et al.Jan 6arXiv
The paper teaches language models using extra 'language homework' made from the same raw text so they learn grammar and meaning, not just next-word guessing.
#language model pretraining#causal language modeling#linguistic competence