When to Use - User wants to convert text to spoken audio - User asks for "read aloud", "TTS", "text to speech", "voice narration" - User says "朗读", "配音", "语音合成" - User wants multi-speaker scripted audio or dialogue When NOT to Use - User wants a podcast-style discussion with topic exploration (use ) - User wants an explainer video with visuals (use ) - User wants to generate an image (use ) Purpose Convert text into natural-sounding speech audio. Two paths: 1. Quick mode ( ): Single voice, low-latency, sync. For casual chat, reading snippets, instant audio. 2. Script mode ( ): Multi-speaker,…