Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.oxen.ai/llms.txt

Use this file to discover all available pages before exploring further.

Try Kling O3 4K - Reference to Video in the Workbench

Run this model interactively, tune parameters, and compare outputs.
Model ID: kling-video-o3-4k-reference-to-video Part of the Kling 3.0/o3 family on fal.ai, this 4K reference-to-video model generates native 4K (3840x2160) video from reference images and text prompts with no upscaling step. It excels at preserving identity, layout, and stylistic detail while adding cinematic motion, camera movement, and scene progression. Supports element and style references (combined limit of 7), optional start and end frame anchoring, multi-shot generation up to 15 seconds, and optional native synchronized audio in English and Chinese. Optimized for advertising, branded content, and production-grade scene work where 4K detail is required.

Example request

Use the Workbench as a request builder: configure parameters for this model in the UI, then open the API tab to copy the exact cURL or Python call.
This blocks until the video is ready (typically 5-15 minutes). Prefer Async or Async with SSE for anything beyond quick experimentation.See the video generation reference for more details.
curl -X POST https://hub.oxen.ai/api/ai/videos/generate \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OXEN_API_KEY" \
  -d '{
  "model": "kling-video-o3-4k-reference-to-video"
}'

Fetch model details

The models endpoint returns the full model object, including its json_request_schema.
curl -H "Authorization: Bearer $OXEN_API_KEY" https://hub.oxen.ai/api/ai/models/kling-video-o3-4k-reference-to-video

Request parameters

Optional parameters

FieldTypeDefaultDescription
start_image_urlstringThe first frame of the video. The model will try to extend the contents of this frame. Format: uri.
tail_image_urlstringThe last frame of the video. Requires start frame to be configured. The model will try to fill in between the frames. Format: uri.
input_imagearray<string>Reference images for style/appearance. Use @Image1, @Image2, etc. in the prompt to refer to them. Maximum 4 total (elements + reference images) when using video.
elementsarray<object>Optional element references. Use @Element1, @Element2, etc. in the prompt to refer to them.
negative_promptstringText describing what to avoid in the generated video.
aspect_ratiostring"16:9"Video aspect ratio One of: 9:16, 1:1, 16:9.
generate_audiobooleanfalseWhether to generate native audio for the video. Supports Chinese and English voice output.