Back to Models

Mochi-1

Text to Video
Style Transfer

Mochi-1 is a text-to-video generator that turns descriptive prompts into smooth MP4 clips with customizable length, frame rate, and style. You control frames (30–170), FPS (10–60), guidance scale (1–10), and image-vs-text influence to balance fidelity and creativity. For best results, use cinematic FPS (24–30), 30–70 frames for short clips or 100–150 for longer sequences, guidance 3–5 for natural outputs, and an image prompt strength of 0.3–0.5. Set a fixed seed for reproducibility. Expect iteration: ambiguous prompts or extreme settings can reduce coherence. Mochi-1 is ideal for storytelling, marketing visuals, prototypes, education content, and social media clips.

Text-Driven Video
Adjustable FPS Control
Seed Reproducibility
Mochi-1

Output Example

Used Prompt

Close-up of a chameleon's eye, with its scaly skin changing color. Ultra high resolution 4k.