Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.oxen.ai/llms.txt

Use this file to discover all available pages before exploring further.

Try DeepSeek V4 Flash in the Workbench

Run this model interactively, tune parameters, and compare outputs.
Model ID: deepseek-v4-flash DeepSeek V4 Flash is the efficiency variant of the V4 family. With 284B total parameters and 13B active per token, it targets fast, cheap inference while keeping the family’s strong reasoning and tool-use behaviour. Pairs naturally with V4 Pro: route the hard prompts to Pro, everyday traffic to Flash.
MetricValue
Parameter Count284 billion (13 billion active)
Mixture of ExpertsYes
Active Parameter Count13 billion
Context Length1,048,576 tokens
MultilingualYes
Tool UseYes
Structured OutputsYes

Example request

Use the Workbench as a request builder: configure parameters for this model in the UI, then open the API tab to copy the exact cURL or Python call.
curl -X POST https://hub.oxen.ai/api/ai/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OXEN_API_KEY" \
  -d '{
  "model": "deepseek-v4-flash",
  "messages": [
    {
      "role": "user",
      "content": "Hello, what can you do?"
    }
  ]
}'

Fetch model details

The models endpoint returns the full model object, including its json_request_schema.
curl -H "Authorization: Bearer $OXEN_API_KEY" https://hub.oxen.ai/api/ai/models/deepseek-v4-flash

Request parameters

This model follows the standard OpenAI chat completions request body. See the chat completions reference for the full parameter list.