Universal Runtime Skills Best practices and code review checklists for the Universal Runtime - LlamaFarm's local ML inference server. Overview The Universal Runtime provides OpenAI-compatible endpoints for HuggingFace models: - Text generation (Causal LMs: GPT, Llama, Mistral, Qwen) - Text embeddings (BERT, sentence-transformers, ModernBERT) - Classification, NER, and reranking - OCR and document understanding - Anomaly detection Directory : Python : 3.11+ Key Dependencies : PyTorch, Transformers, FastAPI, llama-cpp-python Links to Shared Skills This skill extends the shared Python practices.…