Perplexity Architecture Variants Overview Three validated architectures for Perplexity Sonar API at different scales. Each builds on the previous, adding caching and orchestration as volume grows. Decision Matrix | Factor | Direct Widget | Cached Layer | Research Pipeline | |--------|--------------|--------------|-------------------| | Volume | <500/day | 500-5K/day | 5K+/day | | Latency (p50) | 2-5s | 50ms (cached) / 2-5s (miss) | 10-30s | | Model | | + cache | + | | Monthly Cost | <$150 | $50-$300 | $300+ | | Complexity | Minimal | Moderate | High | Instructions Variant 1: Direct Search Wid…