Oxen.ai home page
Search...
⌘K
Ask AI
Support
Sign Up
Sign Up
Search...
Navigation
📓 Notebooks
⚖️ LLM as a Judge Evaluation
Documentation
Python API
HTTP API
Documentation
Blog
GitHub
Get Started
🐂 What is Oxen?
⚡️ Model Inference
🦾 Fine-Tuning
📊 Datasets
🔬 Evaluations
📓 Notebooks
📓 Notebooks
🗺️ Explore, Process, and Version Data
🏷️ Build a Custom Labeling Tool
🔎 Compute Text Embeddings
🧪 Generate Synthetic Datasets
🏋️♀️ Fine-Tune an LLM
⚖️ LLM as a Judge Evaluation
🕵️♂️ Evaluation w/ Human in the Loop
💻 Running Notebooks as Scripts
💾 Version Control
Developer Tools
⚒️ Installation
💻 Command Line Interface
🐍 Python
📡 Oxen Server
🔥 CLI Performance
Other Concepts
Dataset Diffs
File Metadata
Workspaces
Release Notes
🐂 Feature Updates
📓 Notebooks
⚖️ LLM as a Judge Evaluation
How to build a LLM as a judge evaluation workflow.
Coming soon!
🏋️♀️ Fine-Tune an LLM
🕵️♂️ Evaluation w/ Human in the Loop
Assistant
Responses are generated using AI and may contain mistakes.