All-In
Jensen Huang LIVE: Nvidia's Future, Physical AI, Rise of the Agent, Inference Explosion, AI PR Crisis
with Jensen Huang, CEO of Nvidia
19 Mar 2026
28 min read
2h 35m
TL;DR
Nvidia has evolved from a GPU company to an "AI factory" company through disaggregated inference and heterogeneous computing, enabling 10,000X computation increases in just two years. Jensen argues that a $50B inference factory produces lower-cost tokens despite higher upfront costs, and that agentic AI—not just generative AI—will drive massive revenue growth as agents replace human labor in knowledge work. The real PR crisis for AI isn't regulation but the risk that fear-driven policy will cause the U.S. to fall behind while other nations adopt the technology.
All-In is a weekly podcast featuring four prominent tech investors and entrepreneurs discussing the latest developments in technology, business, and culture. This special episode preempts the regular show to feature Jensen Huang, CEO of Nvidia, discussing the company's AI infrastructure strategy, physical AI, agent systems, and the inference explosion reshaping the industry.
Takeaways
1
Disaggregated inference reshapes AI economics Nvidia's shift from monolithic GPU design to disaggregated inference—splitting inference pipelines across heterogeneous chips (GPUs, CPUs, Groq LPUs, networking processors)—increases TAM by 33-50% and reduces per-token costs despite higher factory upfront costs. A $50B Vera Rubin data center with 10X throughput advantage justifies premium pricing when amortized across token volume, making token cost, not hardware cost, the relevant economic metric.
2
Agents, not chatbots, drive enterprise AI ROI The shift from generative AI (information) to agentic AI (work completion) represents a 100X+ computation increase in just two years. Enterprises pay for outcomes, not conversation; agents executing tasks with memory systems, tool integration, and code execution unlock productivity gains that justify massive token consumption, with Nvidia seeing 10,000X compute growth and engineering teams consuming $250K+ in tokens annually.
3
Open-source agents reshape computing architecture Open Claude's architecture—with memory, resource management, I/O subsystems, and skills APIs—constitutes the first true personal AI computer, establishing the operating system blueprint for agentic computing. This paradigm shift makes much existing AI regulation moot while raising new governance challenges around agent security, policy, and sensitive access that require industry coordination with policymakers.