OpenRouter Load Balancing Overview A single OpenRouter API key has rate limits (requests/minute and tokens/minute). To scale beyond those limits, distribute requests across multiple keys. OpenRouter also provides server-side load balancing via provider routing and the variant for low-latency inference. This skill covers multi-key rotation, health-based routing, circuit breakers, and concurrent request patterns. Multi-Key Round Robin Concurrent Request Processing Provider-Level Load Balancing Rate Limit Awareness Error Handling | Error | Cause | Fix | |-------|-------|-----| | 429 on all keys…