← All creators
S
Sebastian Raschka
77 recommendations
Products they use or recommend
Showing 14 of 77 recommendations
Clear filters"So, I use the Codeium plugin for VS Code."
"So, I use the Codeium plugin for VS Code. You know, it's very convenient. It's just like a plugin, and then it's a chat interface that has access to your repository."
"I can give you also a hands-on example. I was training the Qwen 3 base model with RLVR on MATH-500. The base model had an accuracy of about 15%. Just 50 steps, like in a few minutes with RLVR, the model went from 15% to 50% accuracy."
"I was training the Qwen 3 base model with RLVR on MATH-500. The base model had an accuracy of about 15%."
"What I would recommend doing, or what I also do, is if I want to understand, for example, how OLMo is implemented, I would look at the weights in the model hub, the config file, and then you can see, 'Oh, they used so many layers.'"
"Sometimes it takes me a day. With OLMo 3, the challenge was RoPE for the position embeddings. They had a YaRN extension and there was some custom scaling there, and I couldn't quite match these things."
"Sometimes for pastime I play video games, like I like- Video games with puzzles, like Zelda and Metroid."
"Sometimes for pastime I play video games, like I like- Video games with puzzles, like Zelda and Metroid."
"Exa is my preferred search provider"
"I try Claude Code on the web every three to six months, which is just prompting a model to make an update to some GitHub repository that I have"
"I used that feature before, and I always feel bad because it does that every day, and I rarely check it out"
"I should say I use Composer a lot because one of the benefits it has is that it's fast"
"We had a Tesla GPU back then just for the computations. It was about 15 years ago now"
"Even back when I was a grad student, I was in a lab doing biophysical simulations, molecular dynamics, and we had a Tesla GPU back then just for the computations. It was about 15 years ago now."