Multi-Region RAG Three drivers for multi-region: (1) latency — cut cross-ocean RTT to your users; (2) sovereignty — data must stay in EU / US-only / India; (3) resilience — survive a full region outage. Each implies different architecture. Reference Architecture Control plane (auth, tenant config, billing) can stay single-region behind the gateway; replicate only the read paths. Geo-Replicated Vector Stores Pinecone - Serverless : single-region per index today; you create one index per region and replicate from the writer region. - Pods : multi-AZ in-region; multi-region is still an app-layer…