← Back

Scale-RL

technique

A reinforcement learning framework for scaling RL training with language models, developed as part of Meta research.

Also mentioned (1)

Casual references without a clear endorsement

Nathan Lambert mentioned "I think there's a seminal paper from a Meta internship. It's called something like 'The Art of Sc..." ▶ 1:57:37