Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.oxen.ai/llms.txt

Use this file to discover all available pages before exploring further.

Try WAN 2.7 - Image to Video in the Workbench

Run this model interactively, tune parameters, and compare outputs.
Model ID: wan-v2-7-image-to-video WAN 2.7 image-to-video animates a static image into video with enhanced motion smoothness, superior scene fidelity, and greater visual coherence over previous versions. Supports first-frame-to-video, first-and-last-frame-to-video, and video continuation from a starting clip with an optional last frame. Output up to 1080P with durations from 2-15 seconds, plus optional driving audio for lip-sync and action timing.

Example request

Use the Workbench as a request builder: configure parameters for this model in the UI, then open the API tab to copy the exact cURL or Python call.
This blocks until the video is ready (typically 5-15 minutes). Prefer Async or Async with SSE for anything beyond quick experimentation.See the video generation reference for more details.
curl -X POST https://hub.oxen.ai/api/ai/videos/generate \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OXEN_API_KEY" \
  -d '{
  "model": "wan-v2-7-image-to-video",
  "prompt": "The camera slowly zooms in while the subject comes to life, cinematic lighting."
}'

Fetch model details

The models endpoint returns the full model object, including its json_request_schema.
curl -H "Authorization: Bearer $OXEN_API_KEY" https://hub.oxen.ai/api/ai/models/wan-v2-7-image-to-video

Request parameters

Required parameters

FieldTypeDefaultDescription
promptstring"The camera slowly zooms in while the subject comes to life, cinematic lighting."Text prompt describing the desired video. Supports Chinese and English. Max 5000 characters.

Optional parameters

FieldTypeDefaultDescription
input_imagestringURL of the first frame image. Required for first-frame-to-video and first-and-last-frame-to-video. Formats: JPEG, JPG, PNG, BMP, WEBP. Resolution 240-8000 px per side, aspect ratio 1:8 to 8:1, max 20 MB. Mutually exclusive with input_video. Format: uri.
end_image_urlstringOptional URL of the last frame image for first-and-last-frame-to-video. Same format constraints as the first frame image. Format: uri.
input_videostringURL of an input video for continuation (first_clip). Formats: MP4, MOV. Duration 2-10s, resolution 240-4096 px per side, aspect ratio 1:8 to 8:1, max 100 MB. Mutually exclusive with input_image. Format: uri.
audio_urlstringOptional driving audio URL. Used for lip-sync and action timing. Supports WAV and MP3. Duration 2-30s, max 15 MB. If shorter than the video, the remainder is silent; if longer, it is truncated. Format: uri.
resolutionstring"1080P"Output video resolution tier. One of: 720P, 1080P.
durationinteger5Output video duration in seconds (2-15). Range: 2 – 15.
negative_promptstringContent to avoid in the video. Supports Chinese and English. Max 500 characters.
prompt_extendbooleantrueWhether the model rewrites short prompts to improve quality. Adds processing time.
watermarkbooleanfalseAdds an ‘AI Generated’ watermark to the bottom-right corner.
seedintegerRandom seed for reproducibility (0-2147483647). Range: 0 – 2147483647.