Skip to main content

Try Segment Anything 3 - Video in the Workbench

Run this model interactively, tune parameters, and compare outputs.
Model ID: sam-3-video Segment Anything 3 - Video is a vision model designed for video object segmentation and tracking. It excels in detecting, segmenting, and tracking objects across video frames using text prompts, points, boxes, masks, or exemplars, with memory mechanisms that propagate predictions while handling occlusions and re-appearances. Some other noteworthy features of Segment Anything 3 - Video include real-time streaming processing, interactive refinement across frames, and support for concept-driven detection in complex scenes.
MetricValue
Parameter CountUnknown
Mixture of ExpertsUnknown
Context LengthUnknown
MultilingualNo
Quantized*Unknown
*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.

Example request

Use the Workbench as a request builder: configure parameters for this model in the UI, then open the API tab to copy the exact cURL or Python call.
This blocks until the video is ready (typically 5-15 minutes). Prefer Async or Async with SSE for anything beyond quick experimentation.See the video generation reference for more details.
curl -X POST https://hub.oxen.ai/api/ai/videos/generate \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OXEN_API_KEY" \
  -d '{
  "model": "sam-3-video",
  "input_video": "https://hub.oxen.ai/api/repos/ox/Oxen-AI-Assets/file/main/images/winter_summer_ox.mp4",
  "prompt": "the ox horns"
}'

Fetch model details

The models endpoint returns the full model object, including its json_request_schema.
curl -H "Authorization: Bearer $OXEN_API_KEY" https://hub.oxen.ai/api/ai/models/sam-3-video

Request parameters

Required parameters

FieldTypeDefaultDescription
input_videostring"https://hub.oxen.ai/api/repos/ox/Oxen-AI-Assets/file/main/images/winter_summer_ox.mp4"Video to use as reference. Format: uri.
promptstring"the ox horns"Text description of what you want to segment out of the video.

Optional parameters

FieldTypeDefaultDescription
apply_maskbooleantrueApply the mask on the image.