NanoKnow is a new benchmark that checks whether a language modelβs answers come from what it saw during training or from extra text we give it at question time.
The paper teaches language models using extra 'language homework' made from the same raw text so they learn grammar and meaning, not just next-word guessing.