Skip to main content

Try WAN 2.7 - Reference to Video in the Workbench

Run this model interactively, tune parameters, and compare outputs.
Model ID: wan-v2-7-reference-to-video WAN 2.7 reference-to-video generates video from reference images or videos using the R2V model with enhanced motion smoothness, superior scene fidelity, and greater visual coherence. It supports single-character performances, multi-character interactions, and multi-shot narration. Output up to 1080p resolution with durations from 2-10 seconds.

Example request

Use the Workbench as a request builder: configure parameters for this model in the UI, then open the API tab to copy the exact cURL or Python call.
This blocks until the video is ready (typically 5-15 minutes). Prefer Async or Async with SSE for anything beyond quick experimentation.See the video generation reference for more details.
curl -X POST https://hub.oxen.ai/api/ai/videos/generate \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OXEN_API_KEY" \
  -d '{
  "model": "wan-v2-7-reference-to-video",
  "prompt": "A person walking through a beautiful garden, cinematic style."
}'

Fetch model details

The models endpoint returns the full model object, including its json_request_schema.
curl -H "Authorization: Bearer $OXEN_API_KEY" https://hub.oxen.ai/api/ai/models/wan-v2-7-reference-to-video

Request parameters

Required parameters

FieldTypeDefaultDescription
promptstring"A person walking through a beautiful garden, cinematic style."Text prompt describing the desired video. Max 5000 characters.

Optional parameters

FieldTypeDefaultDescription
input_imagearray<string>Reference image URLs for character/object appearance. Pass multiple images for multi-subject generation. Max 20 MB each.
input_videosarray<string>Reference video URLs for character/object appearance and motion. Pass multiple videos for multi-subject generation. Max 100 MB each.
aspect_ratiostring"16:9"Aspect ratio of the generated video. One of: 16:9, 9:16, 1:1, 4:3, 3:4.
resolutionstring"1080p"Output video resolution tier. One of: 720p, 1080p.
durationinteger5Output video duration in seconds (2-10). One of: 2, 3, 4, 5, 6, 7, 8, 9, 10.
negative_promptstringContent to avoid in the video. Max 500 characters.
multi_shotsbooleanfalseWhen true, enables intelligent multi-shot segmentation. When false, generates a single continuous shot.
seedintegerRandom seed for reproducibility (0-2147483647).