Exa Performance Tuning Overview Optimize Exa search API response times for production workloads. Key levers: search type selection (instant < fast < auto < neural < deep), result count reduction, content scope control, result caching, and parallel query execution. Latency by Search Type | Type | Typical Latency | Use Case | |------|----------------|----------| | | < 150ms | Real-time autocomplete, typeahead | | | p50 < 425ms | Speed-critical user-facing search | | | 300-1500ms | General purpose (default) | | | 500-2000ms | Best semantic quality | | | 2-5s | Maximum coverage, light deep search…