Small (1B–7B) — Edge, mobile, low-resource: - Phi-4-Mini (Microsoft) — best reasoning per parameter - Gemma 3 2B/7B (Google) — strong efficiency - Qwen3.5-3B/7B — excellent multilingual Medium (8B–30B) — Balanced production use: - Llama 4 8B — general purpose workhorse - Qwen3.5-14B — coding + math - Mistral Small — multilingual, tool use Large (70B+) — Max capability open: - Llama 4 405B — frontier open model - DeepSeek-V3.2 (MoE 671B active 37B) — math/reasoning - Qwen3.5-72B — top open coding/math Coding Specialists: - Qwen2.5-Coder-32B — #1 open coding - DeepSeek-Coder-V2 — MoE coding pow…