BatCoder teaches a code model to write both code and its documentation by doing a round trip: from code to docs and back to code.
This paper builds a fair, big playground (a benchmark) to test many EEG foundation models side-by-side on the same rules.
The paper teaches language models using extra 'language homework' made from the same raw text so they learn grammar and meaning, not just next-word guessing.
Co2S is a new way to train segmentation models with very few labels by letting two different students (CLIP and DINOv3) learn together and correct each other.
This paper introduces NEPA, a very simple way to teach vision models by having them predict the next patch’s embedding in an image sequence, just like language models predict the next word.
Pixels are the raw stuff of images, and this paper shows you can learn great vision skills by predicting pixels directly, not by comparing fancy hidden features.
The paper tackles a paradox: visual tokenizers that get great pixel reconstructions often make worse images when used for generation.