Exa Architecture Variants Overview Three deployment architectures for Exa neural search at different scales. Each uses real Exa SDK methods: , , , , and . Decision Matrix | Factor | Direct Search | Cached Search | RAG Pipeline | |--------|--------------|---------------|--------------| | Volume | < 1K/day | 1K-50K/day | Any volume | | Latency | 500-2000ms | 50ms (cached) | 3-8s total | | Use Case | Simple search UI | Content aggregation | AI answers with citations | | Complexity | Low | Medium | High | | Cache Required | No | Yes (Redis/LRU) | Yes | | Exa Methods | | + cache | All methods | In…