Back to Models
PikaPika

Pika | v2.1 | Text to Video

Video
Text to Video
Enhance / Upscale
Video to Video

This AI turns natural language prompts and images into cinematic short videos with smooth motion and strong visual coherence. Built on latent diffusion, it iteratively denoises a prompt‑conditioned latent space to control camera moves, scene dynamics, and subtle object motion. It supports motion prompts like slow pans and push-ins, generates up to 1080p at 24–30 fps, and excels at 4–10 second clips ideal for social media and marketing. Best results come from concise, descriptive prompts and high-quality source images. Start with subtle motion, iterate on phrasing, and layer short clips for longer sequences. Expect occasional artifacts with large or complex movements.

Motion Prompts
Temporal Coherence
Subtle Parallax
Pika | v2.1 | Text to Video

Output Example

Used Prompt

A young woman with dark pink hair, dressed modestly in a long coat and scarf, walks slowly along a quiet sunlit road while holding a small suitcase in her right hand. The warm breeze moves strands of her hair and her coat slightly as she takes calm, steady steps. The camera follows from a slight angle behind her, capturing the gentle motion of her walk and the soft light reflecting off her pink hair. The scene feels cinematic, serene, and realistic, evoking a sense of quiet departure and peaceful determination.