When to Use - User wants to transcribe an audio file to text - User provides an audio file path and asks for transcription - User says "转录", "识别", "transcribe", "语音转文字" When NOT to Use - User wants to synthesize speech from text (use ) - User wants to create a podcast or explainer (use or ) Purpose Transcribe audio files to text using , which runs fully offline via local speech recognition models. No API key required. Supports Chinese, English, Japanese, Korean, and Cantonese (sensevoice model) or English-only (whisper model). Run for current CLI options and supported flags. Hard Constraints…