Together AI Production Checklist Overview Together AI provides OpenAI-compatible inference across 100+ open-source models (Llama, Mixtral, Qwen, FLUX) plus fine-tuning and batch processing. A production integration routes completions, embeddings, or image generation through Together's API. Failures mean inference latency spikes, model availability gaps, or unexpected cost overruns from uncontrolled batch jobs. Authentication & Secrets - [ ] stored in secrets manager (not source code) - [ ] API key restricted to production workspace - [ ] Key rotation schedule documented (90-day cycle) - [ ] S…