Sci-CoE is a two-stage training method that helps one language model learn to both solve science problems and check those solutions with very little labeled data.
Big models are often used to grade AI answers, but they are expensive, slow, and depend too much on tricky prompts.