Faster Whisper Local speech-to-text using faster-whisper — a CTranslate2 reimplementation of OpenAI's Whisper that runs 4-6x faster with identical accuracy. With GPU acceleration, expect 20x realtime transcription (a 10-minute audio file in 30 seconds). When to Use Use this skill when you need to: - Transcribe audio/video files — meetings, interviews, podcasts, lectures, YouTube videos - Convert speech to text locally — no API costs, works offline (after model download) - Batch process multiple audio files — efficient for large collections - Generate subtitles/captions — word-level timestamps…