Smart Model Switching Three-tier Claude routing: Haiku → Sonnet → Opus Start with the cheapest model. Escalate only when needed. Save 50-90% on API costs. The Golden Rule If a human would need more than 30 seconds of focused thinking, escalate from Haiku to Sonnet. If the task involves architecture, complex tradeoffs, or deep reasoning, escalate to Opus. Cost Reality | Model | Input | Output | Relative Cost | |-------|-------|--------|---------------| | Haiku | \$0.25/M | \$1.25/M | 1x (baseline) | | Sonnet | \$3.00/M | \$15.00/M | 12x | | Opus | \$15.00/M | \$75.00/M | 60x | Bottom line: Wro…