Skip to main content

Try Wan2.1 1.3B - Text to Video in the Workbench

Run this model interactively, tune parameters, and compare outputs.
Model ID: wan-ai-wan2-1-t2v-1-3b-diffusers Wan-AI/Wan2.1-T2V-1.3B-Diffusers is a text-to-video diffusion model. It excels in generating 480P videos from text prompts efficiently on consumer-grade GPUs, requiring only 8.19GB of VRAM, while maintaining competitive video quality. Some other noteworthy features of Wan-AI/Wan2.1-T2V-1.3B-Diffusers include multilingual support (English and Chinese), image-to-video conversion, aspect ratio control, visual text rendering inside videos, prompt enhancement, and the ability to add sound effects or background music to generated videos.
MetricValue
Parameter Count1.3 billion
Mixture of ExpertsNo
Context LengthUnknown
MultilingualYes
Quantized*No
*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.

Example request

Use the Workbench as a request builder: configure parameters for this model in the UI, then open the API tab to copy the exact cURL or Python call.
This blocks until the video is ready (typically 5-15 minutes). Prefer Async or Async with SSE for anything beyond quick experimentation.See the video generation reference for more details.
curl -X POST https://hub.oxen.ai/api/ai/videos/generate \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OXEN_API_KEY" \
  -d '{
  "model": "wan-ai-wan2-1-t2v-1-3b-diffusers",
  "prompt": "A beautiful landscape painting of a serene lake with mountains in the background and an ox in the foreground."
}'

Fetch model details

The models endpoint returns the full model object, including its json_request_schema.
curl -H "Authorization: Bearer $OXEN_API_KEY" https://hub.oxen.ai/api/ai/models/wan-ai-wan2-1-t2v-1-3b-diffusers

Request parameters

Required parameters

FieldTypeDefaultDescription
promptstring"A beautiful landscape painting of a serene lake with mountains in the background and an ox in the foreground."Prompt for generated image

Optional parameters

FieldTypeDefaultDescription
heightinteger480Height of the video Range: 1 – 480.
widthinteger640Width of the video Range: 1 – 640.
negative_promptstring" "Negative prompt for generated image
num_inference_stepsinteger30Number of diffusion steps to take Range: 1 – 100.
num_framesinteger81Number of frames of video to generate Range: 1 – 120.
guidance_scalenumber5.0Guidance for generated video. Lower values can give more realistic videos. Range: 0 – 10.
seedintegerRandom seed. Set for reproducible generation