INTELLECT-3: Technical Report
Intermediate Prime Intellect Team, Mika Senghaas et al.Dec 18arXiv
INTELLECT-3 is a 106B-parameter Mixture-of-Experts model (about 12B active per token) trained with large-scale reinforcement learning and it beats many bigger models on math, coding, science, and reasoning tests.
#INTELLECT-3#prime-rl#verifiers