Groq Reference Architecture Overview Production architecture for applications built on Groq's LPU inference API. Covers model routing by latency requirements, streaming pipelines, multi-provider fallback, and the middleware layer that ties it together. Architecture Diagram Project Structure Instructions Step 1: Model Registry Step 2: Model Router Step 3: Middleware Layer Step 4: Fallback Chain Step 5: Streaming Pipeline Integration Patterns | Pattern | When to Use | Groq Feature | |---------|-------------|-------------| | Direct completion | Simple request/response | | | Streaming SSE | Real-…