Skip to main content

Try Kling 3.0 Pro: Motion Control in the Workbench

Run this model interactively, tune parameters, and compare outputs.
Model ID: kling-video-v3-pro-motion-control Transfer movements from a reference video to any character image. Cost-effective mode for motion transfer, perfect for portraits and simple animations.

Example request

Use the Workbench as a request builder: configure parameters for this model in the UI, then open the API tab to copy the exact cURL or Python call.
This blocks until the video is ready (typically 5-15 minutes). Prefer Async or Async with SSE for anything beyond quick experimentation.See the video generation reference for more details.
curl -X POST https://hub.oxen.ai/api/ai/videos/generate \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OXEN_API_KEY" \
  -d '{
  "model": "kling-video-v3-pro-motion-control",
  "prompt": "<prompt>",
  "input_image": "https://hub.oxen.ai/api/repos/elau/assets/file/main/bloxy/bloxy_cropped_512x512.png",
  "input_video": "https://hub.oxen.ai/api/repos/ox/Oxen-AI-Assets/file/main/images/winter_summer_ox.mp4",
  "character_orientation": "video"
}'

Fetch model details

The models endpoint returns the full model object, including its json_request_schema.
curl -H "Authorization: Bearer $OXEN_API_KEY" https://hub.oxen.ai/api/ai/models/kling-video-v3-pro-motion-control

Request parameters

Required parameters

FieldTypeDefaultDescription
promptstringText prompt for the scene or character.
input_imagestringReference image URL. The characters, backgrounds, and other elements in the generated video are based on this reference image. Characters should have clear body proportions, avoid occlusion, and occupy more than 5% of the image area. Format: uri.
input_videostringReference video URL (mp4, mov). The character actions in the generated video will be consistent with this reference video. Should contain a realistic style character with entire body or upper body visible, including head, without obstruction. Duration limit depends on character_orientation: 10s max for ‘image’, 30s max for ‘video’. Format: uri.
character_orientationstring"video"Controls whether the output character’s orientation matches the reference image or video. ‘video’: orientation matches reference video - better for complex motions (max 30s). ‘image’: orientation matches reference image - better for following camera movements (max 10s). One of: image, video.

Optional parameters

FieldTypeDefaultDescription
keep_original_soundbooleantrueWhether to keep the original sound from the reference video.
elementsarray<object>Optional element for facial consistency binding. Upload a facial element to enhance identity preservation in the generated video. Only 1 element is supported. Reference in prompt as @Element1. Element binding is only supported when character_orientation is ‘video’.