Back to Models
Kling AIKling AI

Kling O1 | Reference Image to Video

Video
Image to Video
Animate Photo
Text to Video

Turn static images into short, cinematic videos while keeping characters, products, and key details consistent across every frame. This reference-driven image-to-video tool lets you supply multiple element images (front + angles), style references, and an optional start frame, then control motion and look with a clear text prompt. Define camera moves (pan, dolly, orbit), lighting, and scene composition to achieve smooth, film-like results. Start with a 5-second HD test, refine prompts, then scale to 10 seconds or higher resolution. Ideal for multi-character storytelling, brand-consistent product shots, and fast previsualization where identity stability and visual continuity really matter.

Multi-Reference Consistency
Image To Video
Kling O1 | Reference Image to Video

Output Example

Used Prompt

Take @Image1 as the start frame. Begin with a high-angle aerial shot of the full luxury yacht cruising over crystal-clear turquoise water. The camera gently descends toward the vessel, gliding smoothly along its sunlit decks and polished railings. As it reaches the main walkway, transition into @Element1, revealing the woman standing on the side deck with her back turned, looking out over the ocean.\nThe camera continues forward in a seamless movement until the woman slowly turns her head toward the camera. Match the style, lighting, color palette, and overall aesthetic of @Image2 as her face comes into full view.\nMaintain fluid momentum as the camera transitions into a gradual zoom-out, revealing @Element2 — the vintage Polaroid camera she is holding. End the sequence with the camera stabilized in a cinematic, softly lit final composition, capturing the elegance of the woman, the yacht scenery, and the warm golden-hour atmosphere.