Hugging Face Local Models Search the Hugging Face Hub for llama.cpp-compatible GGUF repos, choose the right quant, and launch the model with or . Default Workflow 1. Search the Hub with . 2. Open . 3. Prefer the exact HF local-app snippet and quant recommendation when it is visible. 4. Confirm exact filenames with . 5. Launch with or . 6. Fall back to plus when the repo uses custom file naming. 7. Convert from Transformers weights only if the repo does not already expose GGUF files. Quick Start Install llama.cpp Authenticate for gated repos Search the Hub Run directly from the Hub Run an exac…