Skip to main content

Try Seedance 2.0 Fast - Reference to Video in the Workbench

Run this model interactively, tune parameters, and compare outputs.
Model ID: bytedance-seedance-2-0-fast-reference-to-video ByteDance Seedance 2 Fast reference-to-video generates video from a text prompt guided by reference images, videos, and/or audio. This is the enterprise fast tier with lower latency and cost. Reference media are addressed in the prompt as @Image1, @Image2, @Video1, @Video2, @Audio1, etc.

Example request

Use the Workbench as a request builder: configure parameters for this model in the UI, then open the API tab to copy the exact cURL or Python call.
This blocks until the video is ready (typically 5-15 minutes). Prefer Async or Async with SSE for anything beyond quick experimentation.See the video generation reference for more details.
curl -X POST https://hub.oxen.ai/api/ai/videos/generate \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OXEN_API_KEY" \
  -d '{
  "model": "bytedance-seedance-2-0-fast-reference-to-video",
  "prompt": "<prompt>"
}'

Fetch model details

The models endpoint returns the full model object, including its json_request_schema.
curl -H "Authorization: Bearer $OXEN_API_KEY" https://hub.oxen.ai/api/ai/models/bytedance-seedance-2-0-fast-reference-to-video

Request parameters

Required parameters

FieldTypeDefaultDescription
promptstringThe text prompt used to generate the video. Use @Image1, @Video1, @Audio1, etc. to refer to reference media.

Optional parameters

FieldTypeDefaultDescription
input_imagesarray<string>Reference images to guide video generation. Refer to them in the prompt as @Image1, @Image2, etc. Supported formats: JPEG, PNG, WebP. Max 30 MB per image. Up to 9 images. Total files across all modalities must not exceed 12.
input_videosarray<string>Reference videos to guide video generation. Refer to them in the prompt as @Video1, @Video2, etc. Supported formats: MP4, MOV. Up to 3 videos, combined duration must be between 2 and 15 seconds, total size under 50 MB. Each video must be between ~480p (640x640) and ~720p (834x1112) in resolution.
input_audiosarray<string>Reference audio to guide video generation. Refer to them in the prompt as @Audio1, @Audio2, etc. Supported formats: MP3, WAV. Up to 3 files, combined duration must not exceed 15 seconds. Max 15 MB per file. If audio is provided, at least one reference image or video is required.
resolutionstring"720p"Video resolution - 480p for faster generation, 720p for better quality. One of: 480p, 720p.
durationstring"auto"Duration of the video in seconds. Supports 4 to 15 seconds, or auto to let the model decide based on the prompt. One of: auto, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15.
generate_audiobooleantrueWhether to generate synchronized audio for the video, including sound effects, ambient sounds, and lip-synced speech. The cost of video generation is the same regardless of whether audio is generated or not.
aspect_ratiostring"auto"The aspect ratio of the generated video. Use 16:9 for landscape, 9:16 for portrait/vertical, 1:1 for square, 21:9 for ultrawide cinematic, or auto to let the model decide. One of: auto, 21:9, 16:9, 4:3, 1:1, 3:4, 9:16.
seedintegerRandom seed for reproducibility. Note that results may still vary slightly even with the same seed.