← All creators
N

Nathan Lambert

37 recommendations

Showing 4 of 37 recommendations

Clear filters
OLMo created software
"I work at the Allen Institute for AI, where we've been building OLMo, which releases data and code."
RLHF Book created media
"I think a lot of what I was trying to do in this RLHF book is take post-training techniques and describe how people think about them influencing the model and what people are doing."
RLVR created technique
"Fun fact: I was on the team that came up with the term RLVR, which is from our Tulu 3 work before DeepSeek."
Tulu 3 created software
"Fun fact: I was on the team that came up with the term RLVR, which is from our Tulu 3 work before DeepSeek."