Deploy a checkpoint from a fine-tune job
Deploy a specific training checkpoint to make it available for inference. Pass retry=true to tear down any existing deployment for this checkpoint before creating a new one.
Documentation Index
Fetch the complete documentation index at: https://docs.oxen.ai/llms.txt
Use this file to discover all available pages before exploring further.
Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Query Parameters
If true, cleanup the existing deployment and model for this checkpoint before creating a new one.
Body
Deployment configuration
Request payload to deploy a specific checkpoint from a fine-tune job.
Deployment type: 'fast' for faster inference instances, 'normal' for standard instances.
fast, normal Response
Deployment created response
Standard wrapper for checkpoint deployment responses.
ID of the created Baseten deployment.
Fine-tune job the deployed checkpoint belongs to.
Model record created for this checkpoint deployment.
High-level status string (for example, 'success').
Human-readable status message (for example, 'resource_found').