Nemotron-Math: Efficient Long-Context Distillation of Mathematical Reasoning from Multi-Mode Supervision
IntermediateWei Du, Shubham Toshniwal et al.Dec 17arXiv
Nemotron-Math is a giant math dataset with 7.5 million step-by-step solutions created in three thinking styles and with or without Python help.
#mathematical reasoning#long-context fine-tuning#multi-mode supervision