Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.oxen.ai/llms.txt

Use this file to discover all available pages before exploring further.

Try WAN 2.7 - Text to Video in the Workbench

Run this model interactively, tune parameters, and compare outputs.
Model ID: wan-v2-7-text-to-video WAN 2.7 text-to-video generates video from a text prompt with enhanced motion smoothness, superior scene fidelity, and greater visual coherence over previous versions. Supports 720P and 1080P output, durations from 2-15 seconds, multiple aspect ratios (16:9, 9:16, 1:1, 4:3, 3:4), multi-shot generation controlled via natural language in the prompt, and optional driving audio.

Example request

Use the Workbench as a request builder: configure parameters for this model in the UI, then open the API tab to copy the exact cURL or Python call.
This blocks until the video is ready (typically 5-15 minutes). Prefer Async or Async with SSE for anything beyond quick experimentation.See the video generation reference for more details.
curl -X POST https://hub.oxen.ai/api/ai/videos/generate \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OXEN_API_KEY" \
  -d '{
  "model": "wan-v2-7-text-to-video",
  "prompt": "A kitten running in a meadow, cinematic lighting, smooth camera movement."
}'

Fetch model details

The models endpoint returns the full model object, including its json_request_schema.
curl -H "Authorization: Bearer $OXEN_API_KEY" https://hub.oxen.ai/api/ai/models/wan-v2-7-text-to-video

Request parameters

Required parameters

FieldTypeDefaultDescription
promptstring"A kitten running in a meadow, cinematic lighting, smooth camera movement."Text prompt describing the desired video. Supports Chinese and English. Max 500 characters. Use ‘generate a single-shot video’ or ‘generate a multi-shot video’ (or timestamped shot descriptions like ‘Shot 1 [0-3s] Wide angle: …’) to control shot structure.

Optional parameters

FieldTypeDefaultDescription
aspect_ratiostring"16:9"Aspect ratio of the generated video. One of: 16:9, 9:16, 1:1, 4:3, 3:4.
resolutionstring"1080P"Output video resolution tier. One of: 720P, 1080P.
durationinteger5Output video duration in seconds (2-15). Range: 2 – 15.
negative_promptstringContent to avoid in the video. Supports Chinese and English. Max 500 characters.
audio_urlstringURL of driving audio. Supports WAV and MP3. Duration: 2-30s. Max 15 MB. If omitted, the model auto-generates matching background audio. Format: uri.
prompt_extendbooleantrueWhether the model rewrites short prompts to improve quality. Adds processing time.
watermarkbooleanfalseAdds an ‘AI-generated’ watermark to the bottom-right corner.
seedintegerRandom seed for reproducibility (0-2147483647). Range: 0 – 2147483647.