Oxen.ai home pagelight logodark logo
  • Support
  • Sign Up
  • Sign Up
📝 Notebooks
⚖️ LLM as a Judge Evaluation
Documentation
Python API
HTTP API
  • Documentation
  • Blog
  • GitHub
  • Get Started
    • 🐂 What is Oxen?
    • ⚒️ Installation
    • 🐮 Learn The Basics
    • 💻 Command Line Interface
    • 🐍 Python
    • 📡 Oxen Server
    Key Features
    • 🔥 Performance
    • 🔎 Explore Datasets
    • 🏷️ Labeling Workflows
    • 🤝 Collaboration
    • 📝 Notebooks
      • 🍃 Marimo Notebooks
      • 🗺️ Explore, Process, and Version Data
      • 🏷️ Build a Custom Labeling Tool
      • 🔎 Compute Text Embeddings
      • 🧪 Generate Synthetic Datasets
      • 🏋️‍♀️ Fine-Tune an LLM
      • ⚖️ LLM as a Judge Evaluation
      • 🕵️‍♂️ Evaluation w/ Human in the Loop
    • 🚀 Model Inference
    Other Concepts
    • Dataset Diffs
    • Comparing Models
    • Schemas
    • Remote Repositories
    • Workspaces
    • 🐂 Feature Updates
    📝 Notebooks

    ⚖️ LLM as a Judge Evaluation

    How to build a LLM as a judge evaluation workflow.

    Coming soon!

    🏋️‍♀️ Fine-Tune an LLM🕵️‍♂️ Evaluation w/ Human in the Loop
    twittergithublinkedin
    Powered by Mintlify