Exa Load & Scale Overview Load testing and capacity planning for Exa integrations. Key constraint: Exa's default rate limit is 10 QPS. Scaling strategies focus on caching, request queuing, parallel processing within rate limits, and search type selection for latency budgets. Prerequisites - k6 load testing tool installed - Test environment Exa API key (separate from production) - Redis for result caching Capacity Reference | Search Type | Typical Latency | Max Throughput (10 QPS) | |-------------|----------------|-------------------------| | | < 150ms | 10 req/s (600/min) | | | < 425ms | 10 r…