Dec. 16, 2025

A Startup Beats Google, Power Users Break Away, and AI Gets Regulated

A Startup Beats Google, Power Users Break Away, and AI Gets Regulated

A six-person startup just beat Google on one of the hardest reasoning benchmarks — using Google’s own model. And inside companies, the top 5% of AI users are quietly gaining the equivalent of an extra workday every week.

In this episode, Jeff and Annie break down Poetiq’s ARC-AGI-2 win and why meta-systems — critique, refine, verify — may now matter more than picking the “best” model. They unpack OpenAI’s first State of Enterprise AI report, including the widening productivity gap between casual users and power users. Finally, they run through Quick Hits on chips, regulation, XR glasses, factuality benchmarks, the emerging AI licensing economy, and a major shift in how algorithmic bias could be judged.

🔍 In this episode:

  • Poetiq’s orchestration layer beats Gemini Deep Think on ARC-AGI-2
  • OpenAI data reveals the productivity chasm between median and power users
  • RSL 1.0, agent standards, and regulation reshape the emerging “AI internet economy”

🎧 Watch the full episode of What the AI?! on YouTube, Spotify, or Apple Podcasts → https://www.whattheai.fm

Relevant links: