Ollama Stack Deploy a local LLM stack for offline and privacy-first workflows. When to Use This Skill Use this skill when: - Setting up private/local LLM inference for development - Building air-gapped AI environments - Running models on personal hardware (Mac, Linux, Windows with GPU) - Creating team-shared inference endpoints - Prototyping before committing to cloud LLM APIs Prerequisites - 8 GB+ RAM (16 GB+ recommended for 7B+ models) - For GPU acceleration: NVIDIA GPU with 6 GB+ VRAM, or Apple Silicon Mac - Docker (for containerized deployment) - 20 GB+ disk for model storage Quick Start…