Sebastian Raschka

69 recommendations

Everything Sebastian personally uses, recommends, or has created — plus things they don't recommend — sourced from their own show and appearances on other podcasts.

Created by Sebastian

MLxtend created

Build a Large Language Model from Scratch created

ADAM Project created

Top picks

Claude Code

"I try Claude Code on the web every three to six months, which is just prompting a model to make an update to some GitHub repository that I have"

VS Code

"So, I use the Codeium plugin for VS Code."

Nvidia

"Even back when I was a grad student, I was in a lab doing biophysical simulations, molecular dynamics, and we had a Tesla GPU back then just for the computations. It was about 15 years ago now."

Codeium

"So, I use the Codeium plugin for VS Code. You know, it's very convenient. It's just like a plugin, and then it's a chat interface that has access to your repository."

OLMo

"What I would recommend doing, or what I also do, is if I want to understand, for example, how OLMo is implemented, I would look at the weights in the model hub, the config file, and then you can see, 'Oh, they used so many layers.'"

Qwen 3

"I can give you also a hands-on example. I was training the Qwen 3 base model with RLVR on MATH-500. The base model had an accuracy of about 15%. Just 50 steps, like in a few minutes with RLVR, the model went from 15% to 50% accuracy."

OLMo 3

"Sometimes it takes me a day. With OLMo 3, the challenge was RoPE for the position embeddings. They had a YaRN extension and there was some custom scaling there, and I couldn't quite match these things."

Exa

"Exa is my preferred search provider"

All products

Recent episodes

All episodes →

State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490

Jan 31, 2026 · on Lex Fridman guest · 27 recs

Show: All 69 | uses recommends mentions

All recommendations

Qwen 3 uses software

"I can give you also a hands-on example. I was training the Qwen 3 base model with RLVR on MATH-500. The base model had an accuracy of about 15%. Just 50 steps, like in a few minutes with RLVR, the model went from 15% to 50% accuracy."

From: State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490 • ▶ 1:44:16 • Jan 2026

MATH-500 uses other

"I was training the Qwen 3 base model with RLVR on MATH-500. The base model had an accuracy of about 15%."

From: State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490 • ▶ 1:44:20 • Jan 2026

OLMo uses software

"What I would recommend doing, or what I also do, is if I want to understand, for example, how OLMo is implemented, I would look at the weights in the model hub, the config file, and then you can see, 'Oh, they used so many layers.'"

From: State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490 • ▶ 2:01:42 • Jan 2026

Exa uses software

"Exa is my preferred search provider"

From: State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490 • ▶ 2:37:20 • Jan 2026

Tesla GPU uses product

"We had a Tesla GPU back then just for the computations. It was about 15 years ago now"

From: State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490 • ▶ 4:00:53 • Jan 2026

Codeium uses software

"So, I use the Codeium plugin for VS Code. You know, it's very convenient. It's just like a plugin, and then it's a chat interface that has access to your repository."

From: State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490 • ▶ 21:59 • Jan 2026

Zelda uses other

"Sometimes for pastime I play video games, like I like- Video games with puzzles, like Zelda and Metroid."

From: State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490 • ▶ 2:11:30 • Jan 2026

OLMo 3 uses software

"Sometimes it takes me a day. With OLMo 3, the challenge was RoPE for the position embeddings. They had a YaRN extension and there was some custom scaling there, and I couldn't quite match these things."

From: State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490 • ▶ 2:02:21 • Jan 2026

Metroid uses other

"Sometimes for pastime I play video games, like I like- Video games with puzzles, like Zelda and Metroid."

From: State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490 • ▶ 2:11:34 • Jan 2026

Nvidia uses product

"Even back when I was a grad student, I was in a lab doing biophysical simulations, molecular dynamics, and we had a Tesla GPU back then just for the computations. It was about 15 years ago now."

From: State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490 • ▶ 4:00:49 • Jan 2026

VS Code uses software

"So, I use the Codeium plugin for VS Code."

From: State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490 • ▶ 21:59 • Jan 2026

Cursor Composer uses software

"I should say I use Composer a lot because one of the benefits it has is that it's fast"

From: State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490 • ▶ 3:39:38 • Jan 2026

ChatGPT Pulse uses software

"I used that feature before, and I always feel bad because it does that every day, and I rarely check it out"

From: State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490 • ▶ 3:21:02 • Jan 2026

Claude Code uses software

"I try Claude Code on the web every three to six months, which is just prompting a model to make an update to some GitHub repository that I have"

From: State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490 • ▶ 2:37:55 • Jan 2026

ChatGPT recommends software

"So I suggested, 'Hey, let's try ChatGPT.' We copied the text into ChatGPT, and it fixed them. Instead of two hours going from link to link fixing that, it made that type of work much more seamless."

From: State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490 • ▶ 1:33:29 • Jan 2026

Recursive Language Model recommends technique

"The Recursive Language Model paper, that is one of the papers that tries to kind of address the long context thing"

From: State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490 • ▶ 2:46:45 • Jan 2026

Substack mentions software

"For example, if you read a Substack article, I could maybe ask an LLM to give me opinions on that, but I wouldn't even know what to ask."

From: State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490 • ▶ 1:21:04 • Jan 2026

TikTok mentions software

"We see this with TikTok. You open it... I don't use TikTok, but supposedly in five minutes the algorithm gets you. It's locked in."

From: State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490 • ▶ 1:25:05 • Jan 2026

Spotify mentions software

"my wife the other day—she has a podcast for book discussions, a book club, and she was transferring the show notes from Spotify to YouTube, and then the links somehow broke."

From: State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490 • ▶ 1:33:10 • Jan 2026

YouTube mentions software

"my wife the other day—she has a podcast for book discussions, a book club, and she was transferring the show notes from Spotify to YouTube, and then the links somehow broke."

From: State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490 • ▶ 1:33:10 • Jan 2026

Show all 66 recommendations

Sebastian Raschka

Created by Sebastian

Top picks

All products

Products & gear

Software & tools

Techniques & practices

Recent episodes

All recommendations

Sign in to follow creators