← All episodes

An AI state of the union: We’ve passed the inflection point & dark factories are coming

| 43 products mentioned
Watch on YouTube ai-driven software development coding agents agentic engineering test-driven development product prototyping dark factories career resilience

Simon Willison, co-creator of Django and veteran engineer with 25+ years of experience, argues that AI has crossed an inflection point in late 2024 where coding agents now reliably produce working code—shifting the bottleneck away from code generation to ideation, testing, and maintaining quality. The episode explores how this changes what software engineering means, why mid-career engineers are most at risk, and how the emerging "dark factory" pattern could enable production software built and tested entirely by AI agents without human code review. For builders, the key insight is that code is now cheap, which means the winners will be those who use this velocity advantage to ship higher-quality, more ambitious products rather than simply faster output.

Key takeaways
  • Coding agents have crossed a threshold where they "almost all of the time" do what you tell them to do—meaning you can now prototype three different solutions to a design problem and test which one users prefer, shifting product development from "build one thing well" to "explore multiple directions cheaply."
  • Test-driven development is crucial when working with coding agents because it forces agents to run and verify code; teams removing tests for speed are making a mistake, as tests actually accelerate development by preventing regressions when adding features.
  • Building a personal knowledge base of working code snippets, prototypes, and research projects (stored in GitHub or similar) lets you instruct agents to combine prior solutions—Simon maintains 193+ small HTML/JavaScript tools and 75+ research projects that agents can reference to solve new problems faster.
  • Mid-career engineers are at highest risk from AI disruption (not juniors or seniors): experienced engineers can amplify their skills with agents, interns onboard faster, but 5-15 year engineers lack both the deep expertise to leverage agents effectively and the fresh perspective of AI-native developers.
  • The dark factory pattern—where software is built and tested by agents without humans reading code—is already being tested in production; StrongDM uses swarms of simulated end-users running 24/7 testing against simulated APIs, spending $10,000/day on tokens to validate security-critical software without manual review.
  • Mid-career professionals should lean into AI as a learning amplifier: use it to prototype in unfamiliar domains you'd normally skip (like AppleScript), maintain ambitious projects, and build agency by deciding *what* to build rather than *how* to build it.

Recommendations (23)

ChatGPT
ChatGPT uses

"It used to be you'd ask ChatGPT for some code and it would spit out some code and you have to run it and test it."

Simon Willison · ▶ 0:06

Claude Code

"Anthropic came up with Claude Code back in sort of February of 2025 and it took off like crazy and a bunch of people started signing up for $200 a month accounts"

Simon Willison · ▶ 3:45

simonwillson.net
simonwillson.net recommends

"he's been sharing everything he's learning in real time through his incredible blog, simonwillson.net."

Lenny's Podcast · ▶ 2:20

Claude
Claude uses

"These days, I'll often just tell Claude, here's a URL to this thing, here's another thing. Go and read the source code and then solve this new problem. And it works so well."

Simon Willison · ▶ 1:07:27

"I never used Apple Script because Apple Script is a whole programming language you have to learn and I've been using Apple Script for like two and a half years now because ChatGPT knows Apple Script"

Simon Willison · ▶ 32:09

Mac
Mac uses

"and so now I can automate things on my Mac and that's great"

Simon Willison · ▶ 32:19

Hacker News

"I spend way too much time on Hacker News"

Simon Willison · ▶ 40:31

Claude Code for Web

"I use that one more than the one on my own computer. Partly because that's the one you can access through your phone."

Simon Willison · ▶ 48:52

Anthropic Claude app

"If you've got the Anthropic Claude app installed on iPhone, there's a code tab and you can go in there"

Simon Willison · ▶ 48:59

GPT-5.4
GPT-5.4 uses

"OpenAI came out with GPT 5.4 about 3 weeks ago. It's very very very good. I think it's on par with Claude Opus 4.6"

Simon Willison · ▶ 50:55

Claude Opus 4.6

"I think it's on par with Claude Opus 4.6 and possibly even better."

Simon Willison · ▶ 51:01

Codex
Codex uses

"OpenAI Codex and OpenAI Codex and Claude Code are almost almost indistinguishable from each other now. They're both very very good pieces of software."

Simon Willison · ▶ 51:11

Gemini
Gemini uses

"sometimes via the Gemini app. Like that that's that's a good option as well. And then I mean for image generation I'm using Gemini"

Simon Willison · ▶ 54:54

Apple Notes

"I've got 10,000 Apple notes as well that I just constantly add new things to"

Simon Willison · ▶ 1:05:09

Dropbox recommends

"The easiest one would be like a folder synced to Dropbox or something like that"

Simon Willison · ▶ 1:05:43

PDF.js
PDF.js uses

"I'd written some code which used a PDF library from Mozilla. So it's in JavaScript but it can open up a PDF and show you that PDF on the page."

Simon Willison · ▶ 1:06:51

Tesseract
Tesseract uses

"I'd also written some code that used Tesseract, which is an OCR library that can run in your browser and do actually really good OCR all in JavaScript."

Simon Willison · ▶ 1:07:00

Claude Opus

"So I told Claude Opus, I said, 'Here is the code for the OCR, the PDF thing I did. Here's the code for the OCR thing. Build a new thing that can open a PDF file and OCR every page.' And it did it."

Simon Willison · ▶ 1:07:12

"This is the most important thing when you're working with coding agents is they have to test the code. The best way to do that is to use a programming technique that we've been using for decades ca..."

Simon Willison · ▶ 1:08:57

Red/Green TDD recommends

"If you use the term red/green TDD, that's programming jargon which I didn't used to use, but it is jargon for run the test and watch them fail. The agents know what that means."

Simon Willison · ▶ 1:12:13

OpenClaw
OpenClaw uses

"I don't run it myself outside of a Docker container where I set it up to safely poke it and see what it could do. I got one running right here on my Mac Mini."

Simon Willison · ▶ 1:31:28

Docker
Docker uses

"I don't run it myself outside of a Docker container where I set it up to safely poke it"

Simon Willison · ▶ 1:31:28

Mac Mini
Mac Mini uses

"I got one running right here on my Mac Mini. Did you buy the Mac Mini for it? Yeah, I did."

Simon Willison · ▶ 1:31:33

Mentioned (20)

Pinterest
Pinterest "He co-created Django, the web framework that powers Instagram, Pinterest, Spotify, and thousands ..." ▶ 1:47
Spotify
Spotify "He co-created Django, the web framework that powers Instagram, Pinterest, Spotify, and thousands ..." ▶ 1:47
OpenAI
OpenAI "both Anthropic and OpenAI realized that code is the application like being a having these things ..." ▶ 3:36
Anthropic
Anthropic "both Anthropic and OpenAI realized that code is the application like being a having these things ..." ▶ 3:36
GPT o1
GPT o1 "OpenAI's o1 was the first model to exhibit that. And now all of the models do it." ▶ 4:14
GPT-4.1 "in November we had what I call the inflection point where GPT 5.1 and Claude Opus 4.5 came along ..." ▶ 4:36
Claude Opus 4.5
Claude Opus 4.5 "in November we had what I call the inflection point where GPT 5.1 and Claude Opus 4.5 came along ..." ▶ 4:36
Andrej Karpathy "and I like Andrej Karpathy's original definition of vibe coding" ▶ 8:55
StrongDM
StrongDM "this company called StrongDM has been pushing this and doing some really interesting experiments ..." ▶ 13:34
Jira
Jira "somebody needs to assign you access to Jira and then give you access to Slack and all of that kin..." ▶ 15:53
Slack
Slack "somebody needs to assign you access to Jira and then give you access to Slack and all of that kin..." ▶ 15:53
Okta
Okta "they built their own simulation of Slack and Jira and Okta and all of this software they were int..." ▶ 17:19
Go
Go "once they spun it up it was a little Go binary that sat there" ▶ 17:53
Playwright
Playwright "you can tell Claude Code fire up a sub agent that uses Playwright to simulate a browser" ▶ 18:37
Firefox "I think Firefox just a few days ago, maybe last week, said that they'd done a release which was a..." ▶ 19:47
Mozilla
Mozilla "Anthropic had discovered a hundred like potential vulnerabilities in Firefox and responsibly repo..." ▶ 19:57
Linear
Linear "I was talking to the founder of Linear the other day and this idea of the factory" ▶ 37:25
Cloudflare
Cloudflare "if you talk to Cloudflare and Shopify both said they were hiring a thousand interns over the cour..." ▶ 29:56
Shopify
Shopify "if you talk to Cloudflare and Shopify both said they were hiring a thousand interns over the cour..." ▶ 29:56
Thoughtworks
Thoughtworks "Thoughtworks the big IT consultancy did a offsite a few about a month ago" ▶ 29:31