Oxen.ai home page
Search or ask...
Support
Sign Up
Sign Up
Search...
Navigation
📝 Notebooks
⚖️ LLM as a Judge Evaluation
Documentation
Python API
HTTP API
Documentation
Blog
GitHub
Get Started
🐂 What is Oxen?
⚒️ Installation
🐮 Learn The Basics
💻 Command Line Interface
🐍 Python
📡 Oxen Server
Key Features
🔥 Performance
🔎 Explore Datasets
🏷️ Labeling Workflows
🤝 Collaboration
📝 Notebooks
🍃 Marimo Notebooks
🗺️ Explore, Process, and Version Data
🏷️ Build a Custom Labeling Tool
🔎 Compute Text Embeddings
🧪 Generate Synthetic Datasets
🏋️♀️ Fine-Tune an LLM
⚖️ LLM as a Judge Evaluation
🕵️♂️ Evaluation w/ Human in the Loop
🚀 Model Inference
Other Concepts
Dataset Diffs
Comparing Models
Schemas
Remote Repositories
Workspaces
🐂 Feature Updates
📝 Notebooks
⚖️ LLM as a Judge Evaluation
How to build a LLM as a judge evaluation workflow.
Coming soon!
🏋️♀️ Fine-Tune an LLM
🕵️♂️ Evaluation w/ Human in the Loop