Sebastian Raschka
FollowEverything Sebastian personally uses, recommends, or has created — plus things they don't recommend — sourced from their own show and appearances on other podcasts.
Created by Sebastian
Top picks
"I try Claude Code on the web every three to six months, which is just prompting a model to make an update to some GitHub repository that I have"
"So, I use the Codeium plugin for VS Code."
"Even back when I was a grad student, I was in a lab doing biophysical simulations, molecular dynamics, and we had a Tesla GPU back then just for the computations. It was about 15 years ago now."
"So, I use the Codeium plugin for VS Code. You know, it's very convenient. It's just like a plugin, and then it's a chat interface that has access to your repository."
"What I would recommend doing, or what I also do, is if I want to understand, for example, how OLMo is implemented, I would look at the weights in the model hub, the config file, and then you can see, 'Oh, they used so many layers.'"
"I can give you also a hands-on example. I was training the Qwen 3 base model with RLVR on MATH-500. The base model had an accuracy of about 15%. Just 50 steps, like in a few minutes with RLVR, the model went from 15% to 50% accuracy."
"Sometimes it takes me a day. With OLMo 3, the challenge was RoPE for the position embeddings. They had a YaRN extension and there was some custom scaling there, and I couldn't quite match these things."
"Exa is my preferred search provider"
All products
Products & gear
Software & tools
Techniques & practices
Recent episodes
All episodes →Showing 50 of 69 recommendations
Clear filters"A lot of researchers at these companies are so well-motivated, and definitely Anthropic and OpenAI culturally want to do good for the world."
"You want to add a new tab in Slack that you want to use, and I think AI will be able to do that pretty well"
"my wife the other day—she has a podcast for book discussions, a book club, and she was transferring the show notes from Spotify to YouTube, and then the links somehow broke."
"take something like Slack or Microsoft Word. I think if organizations allow it, AI could very easily implement features end-to-end"
"signing licensing deals with Black Forest Labs, which is an image generation company, or Midjourney"
"Let's throw in Mistral AI, Gemma..."
"Amazon is making Trainium"
"my wife the other day—she has a podcast for book discussions, a book club, and she was transferring the show notes from Spotify to YouTube, and then the links somehow broke."
"We see this with TikTok. You open it... I don't use TikTok, but supposedly in five minutes the algorithm gets you. It's locked in."
"For example, if you read a Substack article, I could maybe ask an LLM to give me opinions on that, but I wouldn't even know what to ask."
"We are starting to see some types of consolidation with Groq for $20 billion"
"I think there will be some other multi-billion dollar acquisitions, like Perplexity"
"ChatGPT has a memory feature, right? And so you may have a subscription and you use it for personal stuff, but I don't know if you want to use that same thing at work."
"I think when I was at Hugging Face, I was trying to get this to happen, but it was too early. It's like these open robotic models on Hugging Face"
"Let's throw in Mistral AI, Gemma..."
"gpt-oss, the open weight model by OpenAI... gpt-oss-120b is actually a very strong model and does some things that other models don't do very well."
"Actually, NVIDIA had a really cool one, Nemotron 3."
"And then you start, let's say, with your GPT-2 model and add these things."
"I would love to have tried Bing Sydney. Did that have more voice? Because it would so often go off the rails, which is historically obviously a scary way—like telling a reporter to leave his wife—is a crazy model to potentially put in general adoption."
"A lot of researchers at these companies are so well-motivated, and definitely Anthropic and OpenAI culturally want to do good for the world."