← Back
Scale-RL
technique
A reinforcement learning framework for scaling RL training with language models, developed as part of Meta research.
Topics
Also mentioned
(1)
Casual references without a clear endorsement
Nathan Lambert
mentioned
"I think there's a seminal paper from a Meta internship. It's called something like 'The Art of Sc..."
▶ 1:57:37