Skip to main content

Try Seedance 2.0 - Image to Video in the Workbench

Run this model interactively, tune parameters, and compare outputs.
Model ID: bytedance-seedance-2-0-image-to-video ByteDance Seedance 2.0 (Pro) animates a starting frame from a text motion prompt, with optional end-frame control for transitions. Supports 480p, 720p, 1080p, or 4k output, durations from 4-15 seconds, and synchronized audio.

Example request

Use the Workbench as a request builder: configure parameters for this model in the UI, then open the API tab to copy the exact cURL or Python call.
This blocks until the video is ready (typically 5-15 minutes). Prefer Async or Async with SSE for anything beyond quick experimentation.See the video generation reference for more details.
curl -X POST https://hub.oxen.ai/api/ai/videos/generate \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OXEN_API_KEY" \
  -d '{
  "model": "bytedance-seedance-2-0-image-to-video",
  "prompt": "Slow cinematic dolly-in as warm golden-hour light sweeps across the scene, drifting dust and a gentle camera push, 35mm film grain, shallow depth of field.",
  "input_image": "https://hub.oxen.ai/api/repos/elau/assets/file/main/bloxy/bloxy_cropped_512x512.png"
}'

Fetch model details

The models endpoint returns the full model object, including its json_request_schema.
curl -H "Authorization: Bearer $OXEN_API_KEY" https://hub.oxen.ai/api/ai/models/bytedance-seedance-2-0-image-to-video

Request parameters

Required parameters

FieldTypeDefaultDescription
promptstring"Slow cinematic dolly-in as warm golden-hour light sweeps across the scene, drifting dust and a gentle camera push, 35mm film grain, shallow depth of field."The text prompt describing the desired motion and action for the video.
input_imagestringThe URL of the starting frame image. Formats: JPEG, PNG, WebP. Max 30 MB. Format: uri.

Optional parameters

FieldTypeDefaultDescription
input_image_has_facebooleanfalseTurn this on when the start frame contains a real human face. Content filters may block the request if this is not enabled.
tail_image_urlstringOptional URL of the end-frame image. When provided, the generated video transitions from the start frame to this end frame. Format: uri.
tail_image_url_has_facebooleanfalseTurn this on when the end frame contains a real human face. Content filters may block the request if this is not enabled.
resolutionstring"720p"Video resolution. 480p for faster generation, 720p for balance, 1080p for high quality, 4k for maximum detail. One of: 480p, 720p, 1080p, 4k.
durationinteger-1Duration of the video in seconds. Supports 4 to 15 seconds, or auto to let the model decide based on the prompt. One of: -1, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15.
aspect_ratiostring"16:9"The aspect ratio of the generated video. Use 16:9 for landscape, 9:16 for portrait/vertical, 1:1 for square, 21:9 for ultrawide cinematic, or auto to let the model decide. One of: adaptive, 21:9, 16:9, 4:3, 1:1, 3:4, 9:16.
generate_audiobooleanfalseWhether to generate synchronized audio.
watermarkbooleanfalseWhether to add an ‘AI generated’ watermark to the output.