About this episode
Beth Lyons and Andy Halliday break down the Gemini 3.1 Pro Preview release, comparing benchmark performance, agentic capability, cost-per-task, and reliability concerns. They discuss Google’s rapid rollout into products like AI Studio and NotebookLM, plus what they’re watching next from DeepSeek and GPT-5.3. The show also covers Apple Podcasts’ move into video, a demo/story around Post-Visit AI in healthcare, and a behind-the-scenes look at the team’s show prep and post-show analysis workflow.Key Points Discussed00:00:18 Opening, hosts, and what’s coming today00:01:04 Gemini 3.1 Pro Preview: benchmark jump and agentic index gap00:18:11 Google ecosystem rollout: AI Studio / NotebookLM and “free” access discussion00:20:25 What’s next: watching DeepSeek + GPT-5.3 / Codex 5.3 chatter00:22:00 Arc AGI-III: interactive benchmark, memory scaffolds, and “AGI” moving goalposts00:26:10 “A couple of little news items”: Apple Podcasts adds video + distro strategy00:35:47 WordPress + Claude integration talk and website experimentation00:37:03 Karl joins to share Post-Visit AI / reverse “AI scribe” healthcare agent00:45:04 Show prep workflow walkthrough (how they prep and what they share)00:49:11 Post-show analysis workflow: capturing comments, diarization, weekly follow-up00:56:26 Karl’s tool notes: Codex vs “Work max” experience building an iPhone app00:58:39 Wrap-up, reminders, and sign-offThe Daily AI Show Co Hosts: Beth Lyons, Andy Halliday, Karl Yeh