Skip to main content

Try Kling O3 Pro - Reference to Video in the Workbench

Run this model interactively, tune parameters, and compare outputs.
Model ID: kling-video-o3-pro-reference-to-video Part of the Kling 3.0/o3 family exclusively available on fal.ai, this reference-to-video model transforms static reference images into dynamic video sequences. It excels at preserving image details like identity, layout, and text while adding realistic motion, camera movements, and scene progression based on cinematic prompts. Supports multi-shot generation, explicit motion instructions, flexible durations up to 15 seconds, and native audio when specified. Optimized for advertising, branded content, and realistic scene extensions with smooth transitions and narrative continuity.

Example request

Use the Workbench as a request builder: configure parameters for this model in the UI, then open the API tab to copy the exact cURL or Python call.
This blocks until the video is ready (typically 5-15 minutes). Prefer Async or Async with SSE for anything beyond quick experimentation.See the video generation reference for more details.
curl -X POST https://hub.oxen.ai/api/ai/videos/generate \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OXEN_API_KEY" \
  -d '{
  "model": "kling-video-o3-pro-reference-to-video"
}'

Fetch model details

The models endpoint returns the full model object, including its json_request_schema.
curl -H "Authorization: Bearer $OXEN_API_KEY" https://hub.oxen.ai/api/ai/models/kling-video-o3-pro-reference-to-video

Request parameters

Optional parameters

FieldTypeDefaultDescription
start_image_urlstringThe first frame of the video. The model will try to extend the contents of this frame. Format: uri.
tail_image_urlstringThe last frame of the video. Requires start frame to be configured. The model will try to fill in between the frames. Format: uri.
input_imagearray<string>Reference images for style/appearance. Use @Image1, @Image2, etc. in the prompt to refer to them. Maximum 4 total (elements + reference images) when using video.
elementsarray<object>Optional element references. Use @Element1, @Element2, etc. in the prompt to refer to them.
negative_promptstringText describing what to avoid in the generated video.
aspect_ratiostring"16:9"Video aspect ratio One of: 9:16, 1:1, 16:9.
generate_audiobooleanfalseWhether to generate native audio for the video. Supports Chinese and English voice output.