Lex Fridman Podcast
#490 – State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI
with Sebastian Raschka & Nathan Lambert
1 Feb 2026
18 min read
2h 5m
TL;DR
The AI landscape in early 2026 is defined by intense competition between US closed-weight models (OpenAI, Google, Anthropic) and Chinese open-weight alternatives (DeepSeek, Qwen, MiniMax), with no clear winner emerging. While US models currently maintain superior intelligence and speed advantages, Chinese open models are proliferating rapidly and gaining influence through accessibility, forcing differentiation through infrastructure, organizational culture, and specialized capabilities like coding and long-context retrieval.
Sebastian Raschka is a machine learning researcher and author of "Build a Large Language Model from Scratch" and "Build a Reasoning Model from Scratch." Nathan Lambert is the post-training lead at the Allen Institute for AI and author of the definitive book on Reinforcement Learning from Human Feedback. Both are widely respected researchers, engineers, educators, and influential voices on AI development and open-weight models.
Takeaways
1
Infrastructure & resources trump proprietary tech The fastest innovation spreads across labs as researchers move jobs, making technical ideas non-proprietary. Winners will be determined by GPU access, data center scale, and training budgets rather than algorithm secrecy. Google's vertical integration (TPUs, custom chips) and OpenAI's research velocity give them structural advantages unrelated to secret sauce.
2
Chinese open models reshape competitive dynamics DeepSeek's 2025 breakthrough catalyzed a wave of open-weight releases from Qwen, MiniMax, and Kimi that now threaten US API monetization. While US closed-weight models maintain intelligence/speed advantages today, Chinese firms are building influence internationally and challenging the assumption that AI users will pay subscription fees—a shift particularly significant outside North America.
3
Tool use unlocks the next LLM reliability frontier gpt-oss-120b and similar models trained with web search and code execution capabilities represent a paradigm shift beyond pure memorization. Rather than solving hallucinations through scaling, tool-augmented LLMs can ground answers in real-time data and computation, fundamentally changing how users interact with models and what reliability means.