Try WAN 2.7 - Image to Video in the Workbench
Run this model interactively, tune parameters, and compare outputs.
wan-v2-7-image-to-video
WAN 2.7 image-to-video animates a static image into video with enhanced motion smoothness, superior scene fidelity, and greater visual coherence over previous versions. Supports first-frame-to-video, first-and-last-frame-to-video, and video continuation from a starting clip with an optional last frame. Output up to 1080P with durations from 2-15 seconds, plus optional driving audio for lip-sync and action timing.
Example request
- Sync
- Async
- Async with SSE
This blocks until the video is ready (typically 5-15 minutes). Prefer Async or Async with SSE for anything beyond quick experimentation.See the video generation reference for more details.
- Minimal
- Basic parameters
- All parameters
Fetch model details
The models endpoint returns the full model object, including itsjson_request_schema.
Request parameters
Required parameters
| Field | Type | Default | Description |
|---|---|---|---|
prompt | string | "The camera slowly zooms in while the subject comes to life, cinematic lighting." | Text prompt describing the desired video. Supports Chinese and English. Max 5000 characters. |
Optional parameters
| Field | Type | Default | Description |
|---|---|---|---|
input_image | string | — | URL of the first frame image. Required for first-frame-to-video and first-and-last-frame-to-video. Formats: JPEG, JPG, PNG, BMP, WEBP. Resolution 240-8000 px per side, aspect ratio 1:8 to 8:1, max 20 MB. Mutually exclusive with input_video. Format: uri. |
end_image_url | string | — | Optional URL of the last frame image for first-and-last-frame-to-video. Same format constraints as the first frame image. Format: uri. |
input_video | string | — | URL of an input video for continuation (first_clip). Formats: MP4, MOV. Duration 2-10s, resolution 240-4096 px per side, aspect ratio 1:8 to 8:1, max 100 MB. Mutually exclusive with input_image. Format: uri. |
audio_url | string | — | Optional driving audio URL. Used for lip-sync and action timing. Supports WAV and MP3. Duration 2-30s, max 15 MB. If shorter than the video, the remainder is silent; if longer, it is truncated. Format: uri. |
resolution | string | "1080P" | Output video resolution tier. One of: 720P, 1080P. |
duration | integer | 5 | Output video duration in seconds (2-15). Range: 2 – 15. |
negative_prompt | string | — | Content to avoid in the video. Supports Chinese and English. Max 500 characters. |
prompt_extend | boolean | true | Whether the model rewrites short prompts to improve quality. Adds processing time. |
watermark | boolean | false | Adds an ‘AI Generated’ watermark to the bottom-right corner. |
seed | integer | — | Random seed for reproducibility (0-2147483647). Range: 0 – 2147483647. |