CoreWeave Production Checklist Inference Services - [ ] GPU type and count validated for model size - [ ] Autoscaling configured (KServe or HPA) - [ ] Health and readiness probes set - [ ] Resource requests AND limits specified - [ ] Node affinity targeting correct GPU class - [ ] for production (no cold starts) Storage - [ ] Model weights in PVC (not downloaded at startup) - [ ] Checkpoints saved to persistent storage - [ ] Storage class appropriate (SSD for inference, HDD for archival) Security - [ ] Secrets for model tokens and registry access - [ ] Network policies applied - [ ] Container…