Try Kling O1 - Image to Video in the Workbench
Run this model interactively, tune parameters, and compare outputs.
kling-video-o1-image-to-video
Kling O1 - Image to Video is a diffusion-based multimodal video generation model designed for turning images (optionally combined with text and other visual references) into short, coherent video clips.
It excels in image-to-video generation where a single image or a small set of reference images is transformed into cinematic clips with stable characters, controlled motion, and consistent environments, while respecting prompts for style, mood, and camera movement.
Some other noteworthy features of Kling O1 - Image to Video include support for multiple image references (up to 7 in some workflows) to refine character identity and scene details, and controls such as start/end frame specification and motion/camera behavior guidance to achieve smoother, more intentional transitions and storytelling shots.
| Metric | Value |
|---|---|
| Parameter Count | Unknown |
| Mixture of Experts | Unknown |
| Context Length | Unknown |
| Multilingual | No |
| Quantized* | Unknown |
Example request
- Sync
- Async
- Async with SSE
This blocks until the video is ready (typically 5-15 minutes). Prefer Async or Async with SSE for anything beyond quick experimentation.See the video generation reference for more details.
- Minimal
- Basic parameters
- All parameters
Fetch model details
The models endpoint returns the full model object, including itsjson_request_schema.
Request parameters
Required parameters
| Field | Type | Default | Description |
|---|---|---|---|
prompt | string | "Zoom into the ox, while it is walking forward on the road, change the season from winter to summer as you zoom in, cinematic" | Text description of what you want to generate, or the instruction on how to edit the given image. |
input_image | string | "https://hub.oxen.ai/api/repos/ox/Oxen-AI-Assets/file/main/images/ox_zoom_out_1926_1076.png" | Image to use as reference. Must be jpeg, png, gif, or webp. Format: uri. |
Optional parameters
| Field | Type | Default | Description |
|---|---|---|---|
tail_image_url | string | — | Image to use as reference. Must be jpeg, png, gif, or webp. Format: uri. |
duration | integer | 5 | Video duration in seconds One of: 5, 10. |