Model Routing System How Routing Decisions Are Made Every user prompt goes through a 9-tier classification pipeline before any AI model processes it. The system answers three questions: 1. Which model should handle this? — 9-tier cost/complexity classification 2. Is the classifier itself working? — Cascading fallback (qwen3 → kimi → deepseek → cache) 3. Can we verify the result? — Tool-level fallback + auto-evaluation The Pipeline 9-Tier Routing Table | Tier | Model | Input (per M) | Output (per M) | Handles | |------|-------|---------------|----------------|---------| | 0 | Qwen3 (local) | $…