Pinned
The Most Intuitive AI Coding Agent - Code faster, smarter, and stay in the flow.
Joined May 2024
- One model's blind spot is another model's easy catch. Errors that survive one model's review rarely survive three. Claude writes, a second model reviews, a third writes the tests. Multi-model verification is not about trusting any single model. It is about catching what any
- Two workflows, same task. Opus plans then Opus implements, vs Opus plans then M3 implements. Opus-only was a touch faster but cost almost 3x more. The Opus-plus-M3 combo saved about $2 per task and ran only ~3 min slower. Let a frontier model plan and a cheaper one build. Watch
00:00 - 57% of the top GitHub repos have no AGENTS.md. No context file means every AI session starts cold. No memory of the team's patterns, the architecture decisions, the things the agent should never touch. Repo Grokking closes that gap. But the AGENTS.md is still the handshake
- Same task, run spec-first this time. The cost spread is wild. M3 finished in 28 min for under $1. Opus was faster but cost about 10x more. GPT-5.5 was the quickest of the three, still far pricier than M3. Spec-driven dev changes which model makes sense. Watch the run.
00:00 - The biggest failure mode in AI-assisted engineering is not hallucination. It is context loss. An agent that does not know your codebase makes locally reasonable decisions that break the system globally. It passes the unit test and fails in production. Loading the whole codebase
- Same task: add dark mode to a website. Three models, three bills. All three shipped working code. Opus 4.8 was fastest. M3 was ~3x slower but cost half as much. GPT-5.5 ran ~3x the price of M3. Fastest is not always cheapest. Watch the run.
00:00 - Everyone can build an agent now. Securing it and shipping it to production is where the real work starts. Spec it, review every diff, keep it inside your repo's context. Build is the demo. Ship is the job.
- One task: add dark mode to a website. Three models, three very different bills. MiniMax M3 ยท $0.768 Opus 4.8 ยท $1.31 GPT-5.5 ยท $2.24 All three shipped working code. M3 came in at ~40% the cost of GPT-5.5 (slower, but it got there). Pick the model for the job, not the hype.






