Radford's Vintage Language Model Era | Generated by AI

Home 2026.06

Question: What has Alec Radford (the former OpenAI researcher) been doing in recent years?

Answer:

Quick recap of why he matters: Radford was the lead author on GPT-1, GPT-2, CLIP, and Whisper, and a core contributor to DALL-E — a technical IC who never ran a team in the executive sense but kept shipping the field forward, with his name first on the papers that defined the modern era. Here’s the recent timeline:

December 2024 — Left OpenAI. He told colleagues he was leaving to pursue research independently, and said he planned to collaborate with OpenAI and other AI developers. No new lab, no startup — just independent research, which is very on-brand for him.

March–April 2025 — Advisor at Thinking Machines Lab. He joined Mira Murati’s Thinking Machines Lab as an advisor, alongside Bob McGrew, OpenAI’s former chief research officer. Advisory role only, not full-time.

April 28, 2026 — Released Talkie, a “vintage language model.” This is the most interesting one for you, given your nanoGPT/training interests. Radford, with Nick Levine and David Duvenaud, trained a 13B-parameter model on 260 billion tokens of English text all predating January 1, 1931 — historical books, newspapers, scientific journals, patents, and case law. No web crawl, no Wikipedia, no modern code. Key technical details:

If you want to play with it hands-on: weights are on Hugging Face (Apache 2.0, base is ~53 GB so it fits comfortably on your MI300X, and a quantized -it would run on the 4070), inference library on GitHub, live chat at talkie-lm.com. Given that you’re training GPT-2-scale models from scratch, the dataset engineering side of Talkie (OCR pipelines, temporal leakage filtering — misdated documents or old texts with anachronistic editorial introductions can compromise historical fidelity) is probably the most instructive part to dig into.

References:


Back Donate