Try Kling O3 4K - Reference to Video in the Workbench
Run this model interactively, tune parameters, and compare outputs.
kling-video-o3-4k-reference-to-video
Part of the Kling 3.0/o3 family on fal.ai, this 4K reference-to-video model generates native 4K (3840x2160) video from reference images and text prompts with no upscaling step. It excels at preserving identity, layout, and stylistic detail while adding cinematic motion, camera movement, and scene progression. Supports element and style references (combined limit of 7), optional start and end frame anchoring, multi-shot generation up to 15 seconds, and optional native synchronized audio in English and Chinese. Optimized for advertising, branded content, and production-grade scene work where 4K detail is required.
Example request
- Sync
- Async
- Async with SSE
This blocks until the video is ready (typically 5-15 minutes). Prefer Async or Async with SSE for anything beyond quick experimentation.See the video generation reference for more details.
- Minimal
- Basic parameters
- All parameters
Fetch model details
The models endpoint returns the full model object, including itsjson_request_schema.
Request parameters
Optional parameters
| Field | Type | Default | Description |
|---|---|---|---|
start_image_url | string | — | The first frame of the video. The model will try to extend the contents of this frame. Format: uri. |
tail_image_url | string | — | The last frame of the video. Requires start frame to be configured. The model will try to fill in between the frames. Format: uri. |
input_image | array<string> | — | Reference images for style/appearance. Use @Image1, @Image2, etc. in the prompt to refer to them. Maximum 4 total (elements + reference images) when using video. |
elements | array<object> | — | Optional element references. Use @Element1, @Element2, etc. in the prompt to refer to them. |
negative_prompt | string | — | Text describing what to avoid in the generated video. |
aspect_ratio | string | "16:9" | Video aspect ratio One of: 9:16, 1:1, 16:9. |
generate_audio | boolean | false | Whether to generate native audio for the video. Supports Chinese and English voice output. |