Model Serving Endpoints FIRST : Use the parent skill for CLI basics, authentication, and profile selection. Model Serving provides managed endpoints for serving LLMs, custom ML models, and external models as scalable REST APIs. Endpoints are identified by name (unique per workspace). Endpoint Types | Type | When to Use | Key Detail | |------|-------------|------------| | Pay-per-token | Foundation Model APIs (Llama, DBRX, etc.) | Uses catalog models, simplest setup | | Provisioned throughput | Dedicated GPU capacity | Guaranteed throughput, higher cost | | Custom model | Your own MLflow model…