AI Workflow · Module 9
Three-Tier Model Selection
"There is no best AI model. There is only the right model for the job."
Most developers pick an AI model when they first set up their tools — and then never change it. They use the same model for formatting fixes and for architecting database schemas. The same model for generating JSDoc comments and for reviewing security-critical authentication code.
This is like choosing between a scalpel and a sledgehammer and then using one for everything. Both are tools. Neither is universally right.
The choice matters because of the three-way trade-off that defines every AI model: reasoning depth, response speed, and cost. Optimizing all three at once is impossible. Choosing which ones to optimize for each task is the skill.
The Three Tiers
| Model | Provider | Strengths |
|---|---|---|
| Claude Opus 4.6 | Anthropic | Long-context reasoning, nuanced analysis, security review |
| GPT-4o | OpenAI | Multimodal, strong code generation, broad capability |
| Gemini 2.5 Pro | 1M+ context window, deep reasoning on large codebases |
| Model | Provider | Strengths |
|---|---|---|
| Claude Sonnet 4.6 | Anthropic | Fast, strong code quality, excellent instruction following |
| Gemini 2.0 Flash | Extremely fast, good for real-time interactive coding |
| Model | Provider | Strengths |
|---|---|---|
| Claude Haiku 4.5 | Anthropic | Ultra-fast, excellent at pattern tasks, very low cost |
| Gemini 2.0 Flash Lite | Near-instant responses, ideal for autocomplete and formatting |
The Decision Framework: 2 Questions
Before picking a model, answer these two questions:
Repetitive, known patterns, no reasoning required
Standard dev work, logical reasoning needed
Multi-system reasoning, novel problems, no clear pattern
Bug here = inconvenience, easily corrected
Bug here = user impact, needs proper fix
Security, payments, data integrity — regardless of perceived simplicity
The high stakes rule is absolute: even simple changes in critical code use a Titan. A one-line change to authentication logic that looks straightforward to a Sprinter might carry a subtle security implication that only a Titan's deeper reasoning catches.
Phase-Based Allocation: Matching Models to Project Stages
For larger features, optimize model choice by phase of work:
The Progressive Escalation Rule
Don't start with a Titan for everything. Use the escalation ladder:
This "trust but verify" approach uses Titan-level intelligence only where it's genuinely needed — which is roughly 20% of tasks. The 80% that Workhorse handles correctly? You've just cut your AI costs significantly without any quality loss.
Next in AI Workflow
Part 10 — Team AI
Individual AI workflows don't scale. How to build a shared standard across your entire engineering team — so everyone gets the benefit, not just the developers who figured it out themselves.