Try Seedance 2.0 - Image to Video in the Workbench
Run this model interactively, tune parameters, and compare outputs.
bytedance-seedance-2-0-image-to-video
ByteDance Seedance 2.0 (Pro) animates a starting frame from a text motion prompt, with optional end-frame control for transitions. Supports 480p, 720p, or 1080p output, durations from 4-15 seconds, and synchronized audio.
Example request
- Sync
- Async
- Async with SSE
This blocks until the video is ready (typically 5-15 minutes). Prefer Async or Async with SSE for anything beyond quick experimentation.See the video generation reference for more details.
- Minimal
- Basic parameters
- All parameters
Fetch model details
The models endpoint returns the full model object, including itsjson_request_schema.
Request parameters
Required parameters
| Field | Type | Default | Description |
|---|---|---|---|
prompt | string | — | The text prompt describing the desired motion and action for the video. |
input_image | string | — | The URL of the starting frame image. Formats: JPEG, PNG, WebP. Max 30 MB. Format: uri. |
Optional parameters
| Field | Type | Default | Description |
|---|---|---|---|
input_image_has_face | boolean | false | Turn this on when the start frame contains a real human face. Content filters may block the request if this is not enabled. |
tail_image_url | string | — | Optional URL of the end-frame image. When provided, the generated video transitions from the start frame to this end frame. Format: uri. |
tail_image_url_has_face | boolean | false | Turn this on when the end frame contains a real human face. Content filters may block the request if this is not enabled. |
resolution | string | "720p" | Video resolution. 480p for faster generation, 720p for balance, 1080p for highest quality. One of: 480p, 720p, 1080p. |
duration | integer | -1 | Duration of the video in seconds. Supports 4 to 15 seconds, or auto to let the model decide based on the prompt. One of: -1, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15. |
aspect_ratio | string | "adaptive" | The aspect ratio of the generated video. Use 16:9 for landscape, 9:16 for portrait/vertical, 1:1 for square, 21:9 for ultrawide cinematic, or auto to let the model decide. One of: adaptive, 21:9, 16:9, 4:3, 1:1, 3:4, 9:16. |
generate_audio | boolean | false | Whether to generate synchronized audio. |
watermark | boolean | false | Whether to add an ‘AI generated’ watermark to the output. |