Try Depth Anything Video in the Workbench
Run this model interactively, tune parameters, and compare outputs.
bytedance-depth-anything-video
Estimate temporally consistent depth maps from video using Video Depth Anything.
Example request
- Sync
- Async
- Async with SSE
This blocks until the video is ready (typically 5-15 minutes). Prefer Async or Async with SSE for anything beyond quick experimentation.See the video generation reference for more details.
- Minimal
- All parameters
Fetch model details
The models endpoint returns the full model object, including itsjson_request_schema.
Request parameters
Required parameters
| Field | Type | Default | Description |
|---|---|---|---|
video_url | string | — | URL of the input video to estimate depth for. Format: uri. |
Optional parameters
| Field | Type | Default | Description |
|---|---|---|---|
depth_model | string | "VDA-Large" | Depth estimation model size. VDA-Large = best quality, VDA-Small = fastest. One of: VDA-Small, VDA-Base, VDA-Large. |
colormap | string | "grayscale" | Colormap for depth visualization. One of: grayscale, turbo, inferno, magma, viridis. |
resolution | string | "auto" | Output resolution. Auto preserves input resolution up to 1080p. One of: auto, 360p, 480p, 720p, 1080p. |
max_frames | integer | — | Maximum number of frames to process. Leave unset to process all frames. |
output_fps | number | — | Output video FPS. Leave unset to use the input frame rate. |
side_by_side | boolean | false | If true, output a side-by-side original and depth comparison video. |
include_raw_depths | boolean | false | If true, exports raw float32 depths as an NPZ file. |