Back to Models

Mochi-1

Mochi-1 is a text-to-video generator that turns descriptive prompts into smooth MP4 clips with customizable length, frame rate, and style. You control frames (30–170), FPS (10–60), guidance scale (1–10), and image-vs-text influence to balance fidelity and creativity. For best results, use cinematic FPS (24–30), 30–70 frames for short clips or 100–150 for longer sequences, guidance 3–5 for natural outputs, and an image prompt strength of 0.3–0.5. Set a fixed seed for reproducibility. Expect iteration: ambiguous prompts or extreme settings can reduce coherence. Mochi-1 is ideal for storytelling, marketing visuals, prototypes, education content, and social media clips.

Parameter-Controlled Text-To-Video
Cinematic Composition
Consistent Mp4 Output
Mochi-1

Output Example

Used Prompt

Close-up of a chameleon's eye, with its scaly skin changing color. Ultra high resolution 4k.