What to Do When Qdrant Needs More Capacity Vertical first: simpler operations, no network overhead, good up to 100M vectors per node depending on dimensions and quantization. Horizontal when: data exceeds single node capacity, need fault tolerance, need to isolate tenants, or IOPS-bound (more nodes = more independent IOPS). Most basic distributed configuration - 3 nodes, 3 shards with for zero-downtime scaling Minimum of 3 nodes is important for consensus and fault tolerance. With 3 nodes, you can lose 1 node without downtime. With 2 nodes, losing 1 node causes downtime for collection operati…