← All creators
N
Nathan Lambert
37 recommendations
Products they use or recommend
Showing 4 of 37 recommendations
Clear filters"I work at the Allen Institute for AI, where we've been building OLMo, which releases data and code."
"I think a lot of what I was trying to do in this RLHF book is take post-training techniques and describe how people think about them influencing the model and what people are doing."
"Fun fact: I was on the team that came up with the term RLVR, which is from our Tulu 3 work before DeepSeek."
"Fun fact: I was on the team that came up with the term RLVR, which is from our Tulu 3 work before DeepSeek."