Skip to main content

Try Kling O1 - Image to Video in the Workbench

Run this model interactively, tune parameters, and compare outputs.
Model ID: kling-video-o1-image-to-video Kling O1 - Image to Video is a diffusion-based multimodal video generation model designed for turning images (optionally combined with text and other visual references) into short, coherent video clips. It excels in image-to-video generation where a single image or a small set of reference images is transformed into cinematic clips with stable characters, controlled motion, and consistent environments, while respecting prompts for style, mood, and camera movement. Some other noteworthy features of Kling O1 - Image to Video include support for multiple image references (up to 7 in some workflows) to refine character identity and scene details, and controls such as start/end frame specification and motion/camera behavior guidance to achieve smoother, more intentional transitions and storytelling shots.
MetricValue
Parameter CountUnknown
Mixture of ExpertsUnknown
Context LengthUnknown
MultilingualNo
Quantized*Unknown
*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.

Example request

Use the Workbench as a request builder: configure parameters for this model in the UI, then open the API tab to copy the exact cURL or Python call.
This blocks until the video is ready (typically 5-15 minutes). Prefer Async or Async with SSE for anything beyond quick experimentation.See the video generation reference for more details.
curl -X POST https://hub.oxen.ai/api/ai/videos/generate \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OXEN_API_KEY" \
  -d '{
  "model": "kling-video-o1-image-to-video",
  "prompt": "Zoom into the ox, while it is walking forward on the road, change the season from winter to summer as you zoom in, cinematic",
  "input_image": "https://hub.oxen.ai/api/repos/ox/Oxen-AI-Assets/file/main/images/ox_zoom_out_1926_1076.png"
}'

Fetch model details

The models endpoint returns the full model object, including its json_request_schema.
curl -H "Authorization: Bearer $OXEN_API_KEY" https://hub.oxen.ai/api/ai/models/kling-video-o1-image-to-video

Request parameters

Required parameters

FieldTypeDefaultDescription
promptstring"Zoom into the ox, while it is walking forward on the road, change the season from winter to summer as you zoom in, cinematic"Text description of what you want to generate, or the instruction on how to edit the given image.
input_imagestring"https://hub.oxen.ai/api/repos/ox/Oxen-AI-Assets/file/main/images/ox_zoom_out_1926_1076.png"Image to use as reference. Must be jpeg, png, gif, or webp. Format: uri.

Optional parameters

FieldTypeDefaultDescription
tail_image_urlstringImage to use as reference. Must be jpeg, png, gif, or webp. Format: uri.
durationinteger5Video duration in seconds One of: 5, 10.