HyperFrames Media Preprocessing Three CLI commands that produce assets for compositions: (speech), (timestamps), and (transparent video). Each downloads a model on first run and caches it under . Drop the output into the project, then reference it from the composition HTML — see the skill for the audio/video element conventions. Text-to-Speech ( ) Generate speech audio locally with Kokoro-82M. No API key. Voice Selection Match voice to content. Default is . | Content type | Voice | Why | | ----------------- | --------------------- | ----------------------------- | | Product demo | / | Warm, p…