News from September 2025

Video

September 30, 2025 • Sam Witteveen

A concise walkthrough of Anthropic’s Sonnet 4.5 release, arguing it’s part of a broader “virtual collaborator” push, with improvements in coding performance, agentic capabilities (Claude Agent SDK), computer use, and long-running, plan‑driven workflows.

Sonnet 4.5 is the best coding model in the world

Video

September 30, 2025 • Theo - t3.gg

Theo reviews Anthropic’s Claude Sonnet 4.5, benchmarking its coding, agentic and safety improvements versus prior Claude and GPT models, and concludes it’s the nicest model to code with despite weaker UI generation.

Is Responsible AI Possible? with Dr. Sarah Bird

Video

September 29, 2025 • Syntax

Scott from Syntax interviews Dr. Sarah Bird at Microsoft about what responsible AI means for developers—covering privacy, security-by-default, human-in-the-loop oversight, local vs. cloud models, risks (malfunction, misuse, systemic), and how developers can build trustworthy AI.

Claude made its own language (it took 3 months)

Video

September 28, 2025 • Theo - t3․gg

Theo explores "cursed lang," a Gen Z–slang Golang-style language auto-generated by running Claude in a loop for three months, testing features like emoji identifiers, debugging builds, and reflecting on AI-driven development.

NVIDIA DGX Spark…What Went Wrong

Video

September 26, 2025 • Alex Ziskind

Alex Ziskind examines NVIDIA’s delayed DGX Spark—its promised GB10 Grace Blackwell architecture, FP4/FP8 advantages, clustering and CUDA ecosystem benefits—asking whether the still-unreleased $4,000 mini AI supercomputer is worth the wait.

The real reason Claude got dumber

Video

September 26, 2025 • Theo - t3․gg

Theo - t3․gg unpacks Anthropic’s postmortem on three overlapping infrastructure bugs—routing to 1M-context servers, output corruption on TPUs, and an XLA top‑K issue—that degraded Claude’s quality in August–September and what fixes and process changes followed.

Qwen just keeps winning, Qwen 3 Max is incredible

Video

September 26, 2025 • GosuCoder

GosuCoder reviews Qwen 3 Max’s latest update, showing strong coding performance, tool-calling accuracy, faster speeds, and improved UI output that challenges Anthropic’s Claude Sonnet on price and capability.

Hosting MCP-UI Servers on TanStack Start

Video

September 22, 2025 • Jack Herrington

Jack Herrington demonstrates building a simple MCP UI client in TanStack Start that renders external MCP server iframe responses securely using the MCP UI resource format and React renderer.

These might be my new favorite models (don’t sleep on Grok 4 Fast)

Video

September 21, 2025 • Theo - t3․gg

Theo - t3.gg explains why Grok 4 Fast is a surprisingly strong, fast, and ultra‑cost‑efficient successor to Grok 4, outperforming on several benchmarks while being far cheaper and more practical to use.

GPT 5 - Codex is a fantastic AI model for coding, but...

Video

September 19, 2025 • GosuCoder

GosuCoder reviews GPT‑5 Codex, finding it produces solid coding/refactoring results but is often 2–3× slower than GPT‑5, recommending Codex Low for practical use.

We Tier Ranked Every AI Coding Assistant

Video

September 19, 2025 • Syntax

Syntax ranks today’s major AI coding assistants—from Cursor, VS Code + Copilot, Windsurf, Kira, Claude Code, Opencode, Gemini, Qwen, Codex, v0, Bolt.new, Replit, Lovable, ChatGPT, and Warp—debating strengths, UX, planning modes, and real‑world reliability.

OpenAI dropped a new model built for developers

Video

September 15, 2025 • Theo - t3.gg

Theo reviews OpenAI’s new GPT-5 Codex model aimed at agentic coding, praising its token efficiency and code-review strengths while critiquing the Codex tooling’s search and UX rough edges.

I Switched! Claude Code to Cursor CLI

Video

September 15, 2025 • Jack Herrington

Jack Herrington explains why he’s switching from Claude Code to Cursor CLI, demonstrating installation, model switching, MCP integration, CLI vs desktop features, non‑interactive prompts, and practical coding workflows with caveats about beta glitches.

My AI Developer Workflow (September 2025)

Video

September 15, 2025 • Alex so yes

Alex shares his daily AI-driven developer stack and workflow—tools, optimal configuration (memory bank, rules, templates), and a live demo fixing a bug and adding a feature—showing how to plan then build to ship clean, high-quality code faster.

I tested OpenCode with the new Grok ... (incredible)

Video

September 9, 2025 • Alex so yes

Alex so yes tests OpenCode with xAI’s new Grok code model to debug a real AIDD project issue, explore the AGENTS.md standard, and show how terminal-first, provider-agnostic workflows can speed up coding while reducing IDE lock-in.

Predicting OpenAI's future via their acquisitions

Video

September 9, 2025 • Theo - t3.gg

Theo explains why OpenAI’s recent acquisitions (like Statsig and Alex) are really about importing aligned, ready‑to‑execute teams and product leadership to accelerate apps and developer tools without disrupting core research.

How to Vibecode an X Clone using Claude Code and Convex

Video

September 8, 2025 • Convex

Convex shows how to rapidly build and deploy a Twitter/X clone using Claude Code and Convex’s end-to-end type-safe queries, mutations, actions, and storage—covering auth, profiles, image uploads, tweeting, likes, retweets, bookmarks, DMs, notifications, and deployment—all in about 20–40 minutes.

Kimi K2 got a massive upgrade, possibly the best open source coding model now?

Video

September 6, 2025 • GosuCoder

This video reviews Kimi K2’s 0905 update—doubling context to 262K, improving frontend/code generation, and weighing speed, reliability, and prompt-caching costs versus providers like OpenRouter and Groq—with hands-on demos and cost/latency analysis.

BREAKING: Anthropic announces massive investment (50% of OpenAI???)

Video

September 4, 2025 • Theo - t3.gg

Analysis of Anthropic’s $13B raise at a $183B post-money valuation, comparing dilution, investor strategy, compute costs, and competitive dynamics with OpenAI and Google.

MCP-UI + TanStack = Next Gen Web

Video

September 3, 2025 • Jack Herrington

A walkthrough showing how MCP-UI enables tools to return full UI (HTML/JS/iframes) and how to build an MCP-UI server with TanStack Start, test it with the MCP-UI Inspector, and render an iframe card in Goose.

Qwen 3 235B on the Framework Desktop for Coding?

Video

September 3, 2025 • GosuCoder

A hands-on look at running local MoE coding models like Qwen 3 235B on the Framework Desktop, comparing TPS and context trade-offs, the limits of agentic workflows, and a prototype local code editor that applies AI-generated diffs directly to files.

The "right way" to vibe code (engineers, please watch)

Video

September 1, 2025 • Theo - t3.gg

Theo argues that vibe coding is valuable for quickly producing throwaway or legacy-bound code while emphasizing that you still must understand how code works; AI should augment skilled engineers, not replace them.

News from September 2025

Jacky THIERRY