Oxen.ai gives you the tools and infrastructure to build the best model for your use case.

The platform makes it easy to spin up serverless GPU infrastructure to train models or run inference. When you are done, the datasets, models weights, and code are all versioned and stored in an Oxen.ai repository for easy iteration, comparison, and collaboration.

βœ… Features

Whether you are iterating on your prompts while using a closed source model or fine-tuning an open source model, Oxen.ai gives you the tools to focus on what matters most: building great models.

  • ⚑️ Inference - Quickly iterate on prompts and models
  • πŸš€ Fine-Tuning - Go from dataset to deployable model in a few clicks
  • πŸ“Š Datasets - Build datasets for training, fine-tuning, or evaluating models
  • πŸ”¬ Evaluation - Find the best model and prompt given your dataset and use case
  • πŸ““ Notebooks - Write custom code to interact with your datasets and models
  • πŸ’Ύ Version Control - Sync your datasets, model weights, and code with a collaborative hub

With Oxen’s open source data version control system, you have access to the raw model weights, datasets, and code through the command line interface, python library or HTTP API. This makes it seamless to integrate prompts, models, and datasets into your existing workflows.

⚑ Quickly Iterate on Models

Whether you are making your first LLM call or need to deploy a fine-tuned model, Oxen.ai gives you the flexibility to swap models through a unified Model API. The interface is OpenAI compatible and supports foundation models from Anthropic, Google, Meta, and OpenAI. See the list of supported models to get started.

Closed source models not working for your use case? Fine-tune your own model, optimizing it for accuracy, speed, or cost, and deploy it to the same interface in minutes.

πŸš€ Fine-Tune Models

The best models are the ones that understand your context and continue to learn from your data over time.

Go from dataset to model in a few clicks with Oxen.ai’s fine-tuning tooling. Select a dataset, define your inputs and outputs, and let Oxen.ai do the grunt work. Oxen saves model weights to it’s version store tying model weights to the dataset and code that was used to train them.

Once the model has been fine-tuned, you can easily deploy the model behind an inference endpoint and start the evaluation loop over again.

πŸ“Š Build Datasets

Quality datasets are the difference between prototypes and production models. Collaborate on multi-modal datasets used for training, fine-tuning, or evaluating models. Backed by Oxen.ai’s version control, you’ll never worry about remembering what data a model was trained or evaluated on.

Learn how to interface with datasets in the Oxen Python Library or more about supported dataset types and formats here.

πŸ”¬ Evaluate Models

Find the best model and prompt for your use case. Leverage your own datasets to build custom evaluations. Evaluation results are versioned and saved as datasets in the repository for easy performance tracking over time.

πŸ““ Write Notebooks

Spin up a Marimo Notebook on a GPU or CPU in seconds. Use the Oxen Python Library to interact with your datasets and models. Write custom code to process your data, visualize distributions, compute metrics, and even train models.

πŸ’Ύ Version Control

The through line of Oxen.ai is that all model weights, datasets, and code are versioned and can be stored in a single repository. This makes it easy to track changes, compare models, and share datasets with your team. You can interact with the repository through the command line interface, python library, or web interface.

We built the version control system to be blazing fast, open source, and extensible for anyone to build upon. It can be used to version any type of data, not just machine learning datasets. It scales up to monorepos with millions of files and terabytes of data.

🌾 Why Build Oxen?

Oxen was built by a team of machine learning engineers, who have spent countless hours in their careers managing datasets and training models. We have used many different tools, but none of them were as easy to use and as ergonomic as we would like.

Production grade AI applications are constantly juggling models, datasets, and code, and it’s easy to get lost. If you have every been stuck syncing massive model weights, datasets, or aligning them with the code that was used to train them, we feel your pain.

Oxen is the tool we wish we had.

πŸ‚ Why the name Oxen?

β€œOxen” comes from the fact that we will plow, maintain, and version your data like a good farmer tends to their fields 🌾. During the agricultural revolution, the plow and offloading work to Oxen helped people specialize and start working on other important societal tasks. Let Oxen take care of the grunt work of your infrastructure so you can focus on solving the higher-level problems that matter to your product.