Pinned
we've scaled RL for a 100B+ MoE model achieving SOTA benchmark results for its size
more important than the final model checkpoint is making the frontier infra required to train models like this accessible to everyone
details on the full training recipe, our open source
00:41
Introducing INTELLECT-3: Scaling RL to a 100B+ MoE model on our end-to-end stack
Achieving state-of-the-art performance for its size across math, code and reasoning
Built using the same tools we put in your hands, from environments & evals, RL frameworks, sandboxes & more












