Depth Anything Video

Try Depth Anything Video in the Workbench

Run this model interactively, tune parameters, and compare outputs.

Model ID: bytedance-depth-anything-video Estimate temporally consistent depth maps from video using Video Depth Anything.

Example request

Use the Workbench as a request builder: configure parameters for this model in the UI, then open the API tab to copy the exact cURL or Python call.

Sync
Async
Async with SSE

This blocks until the video is ready (typically 5-15 minutes). Prefer Async or Async with SSE for anything beyond quick experimentation.See the video generation reference for more details.

Minimal
All parameters

curl -X POST https://hub.oxen.ai/api/ai/videos/generate \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OXEN_API_KEY" \
  -d '{
  "model": "bytedance-depth-anything-video",
  "video_url": "https://hub.oxen.ai/api/repos/ox/Oxen-AI-Assets/file/main/images/winter_summer_ox.mp4"
}'

curl -X POST https://hub.oxen.ai/api/ai/videos/generate \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OXEN_API_KEY" \
  -d '{
  "model": "bytedance-depth-anything-video",
  "video_url": "https://hub.oxen.ai/api/repos/ox/Oxen-AI-Assets/file/main/images/winter_summer_ox.mp4",
  "depth_model": "VDA-Large",
  "colormap": "grayscale",
  "resolution": "auto",
  "side_by_side": false,
  "include_raw_depths": false
}'

See the async queue reference for more details.

Minimal
All parameters

# Enqueue, capture the generation id.
GEN_ID=$(curl -s -X POST https://hub.oxen.ai/api/ai/queue \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OXEN_API_KEY" \
  -d '{
  "model": "bytedance-depth-anything-video",
  "video_url": "https://hub.oxen.ai/api/repos/ox/Oxen-AI-Assets/file/main/images/winter_summer_ox.mp4"
}' | jq -r '.generations[0].generation_id')

# Poll until the generation reaches a terminal status.
while true; do
  STATUS=$(curl -s -H "Authorization: Bearer $OXEN_API_KEY" \
    "https://hub.oxen.ai/api/ai/queue/$GEN_ID" | jq -r '.status')
  echo "Status: $STATUS"
  case $STATUS in succeeded|failed|cancelled) break;; esac
  sleep 5
done

# Print the result.
curl -s -H "Authorization: Bearer $OXEN_API_KEY" \
  "https://hub.oxen.ai/api/ai/queue/$GEN_ID" | jq .

# Enqueue, capture the generation id.
GEN_ID=$(curl -s -X POST https://hub.oxen.ai/api/ai/queue \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OXEN_API_KEY" \
  -d '{
  "model": "bytedance-depth-anything-video",
  "video_url": "https://hub.oxen.ai/api/repos/ox/Oxen-AI-Assets/file/main/images/winter_summer_ox.mp4",
  "depth_model": "VDA-Large",
  "colormap": "grayscale",
  "resolution": "auto",
  "side_by_side": false,
  "include_raw_depths": false
}' | jq -r '.generations[0].generation_id')

# Poll until the generation reaches a terminal status.
while true; do
  STATUS=$(curl -s -H "Authorization: Bearer $OXEN_API_KEY" \
    "https://hub.oxen.ai/api/ai/queue/$GEN_ID" | jq -r '.status')
  echo "Status: $STATUS"
  case $STATUS in succeeded|failed|cancelled) break;; esac
  sleep 5
done

# Print the result.
curl -s -H "Authorization: Bearer $OXEN_API_KEY" \
  "https://hub.oxen.ai/api/ai/queue/$GEN_ID" | jq .

See the async queue reference for more details.

Minimal
All parameters

# Enqueue, capture the generation id.
GEN_ID=$(curl -s -X POST https://hub.oxen.ai/api/ai/queue \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OXEN_API_KEY" \
  -d '{
  "model": "bytedance-depth-anything-video",
  "video_url": "https://hub.oxen.ai/api/repos/ox/Oxen-AI-Assets/file/main/images/winter_summer_ox.mp4"
}' | jq -r '.generations[0].generation_id')

# Stream the SSE channel, grab the data line that follows a
# media_generation_completed event for our id, and pretty-print it.
curl -sN -H "Authorization: Bearer $OXEN_API_KEY" https://hub.oxen.ai/api/events \
  | awk -v id="$GEN_ID" '
    /^event: media_generation_completed$/ { expect=1; next }
    /^data: / && expect {
      payload = substr($0, 7)
      if (index(payload, "\"generation_id\":\"" id "\"")) { print payload; exit }
      expect = 0
    }
  ' | jq .

# Enqueue, capture the generation id.
GEN_ID=$(curl -s -X POST https://hub.oxen.ai/api/ai/queue \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OXEN_API_KEY" \
  -d '{
  "model": "bytedance-depth-anything-video",
  "video_url": "https://hub.oxen.ai/api/repos/ox/Oxen-AI-Assets/file/main/images/winter_summer_ox.mp4",
  "depth_model": "VDA-Large",
  "colormap": "grayscale",
  "resolution": "auto",
  "side_by_side": false,
  "include_raw_depths": false
}' | jq -r '.generations[0].generation_id')

# Stream the SSE channel, grab the data line that follows a
# media_generation_completed event for our id, and pretty-print it.
curl -sN -H "Authorization: Bearer $OXEN_API_KEY" https://hub.oxen.ai/api/events \
  | awk -v id="$GEN_ID" '
    /^event: media_generation_completed$/ { expect=1; next }
    /^data: / && expect {
      payload = substr($0, 7)
      if (index(payload, "\"generation_id\":\"" id "\"")) { print payload; exit }
      expect = 0
    }
  ' | jq .

Fetch model details

The models endpoint returns the full model object, including its json_request_schema.

curl -H "Authorization: Bearer $OXEN_API_KEY" https://hub.oxen.ai/api/ai/models/bytedance-depth-anything-video

Request parameters

Required parameters

Field	Type	Default	Description
`video_url`	`string`	—	URL of the input video to estimate depth for. Format: uri.

Optional parameters

Field	Type	Default	Description
`depth_model`	`string`	`"VDA-Large"`	Depth estimation model size. VDA-Large = best quality, VDA-Small = fastest. One of: VDA-Small, VDA-Base, VDA-Large.
`colormap`	`string`	`"grayscale"`	Colormap for depth visualization. One of: grayscale, turbo, inferno, magma, viridis.
`resolution`	`string`	`"auto"`	Output resolution. Auto preserves input resolution up to 1080p. One of: auto, 360p, 480p, 720p, 1080p.
`max_frames`	`integer`	—	Maximum number of frames to process. Leave unset to process all frames.
`output_fps`	`number`	—	Output video FPS. Leave unset to use the input frame rate.
`side_by_side`	`boolean`	`false`	If true, output a side-by-side original and depth comparison video.
`include_raw_depths`	`boolean`	`false`	If true, exports raw float32 depths as an NPZ file.

Inference API

Documentation Index

Try Depth Anything Video in the Workbench

​Example request

​Fetch model details

​Request parameters

​Required parameters

​Optional parameters

Example request

Fetch model details

Request parameters

Required parameters

Optional parameters