Oxen.ai home pagelight logodark logo
  • Support
  • Sign Up
  • Sign Up
Documentation
Python API
HTTP API
  • Documentation
  • Blog
  • GitHub
  • Get Started
    • 🐂 What is Oxen?
    • ⚡️ Model Inference
    • 🦾 Fine-Tuning
    • 📊 Datasets
    • 🔬 Evaluations
    • 📓 Notebooks
      • 📓 Notebooks
      • 🗺️ Explore, Process, and Version Data
      • 🏷️ Build a Custom Labeling Tool
      • 🔎 Compute Text Embeddings
      • 🧪 Generate Synthetic Datasets
      • 🏋️‍♀️ Fine-Tune an LLM
      • ⚖️ LLM as a Judge Evaluation
      • 🕵️‍♂️ Evaluation w/ Human in the Loop
      • 💻 Running Notebooks as Scripts
    • 💾 Version Control
    Developer Tools
    • ⚒️ Installation
    • 💻 Command Line Interface
    • 🐍 Python
    • 📡 Oxen Server
    • 🔥 CLI Performance
    Other Concepts
    • Dataset Diffs
    • File Metadata
    • Workspaces
    Release Notes
    • 🐂 Feature Updates
    📓 Notebooks

    ⚖️ LLM as a Judge Evaluation

    How to build a LLM as a judge evaluation workflow.

    Coming soon!

    🏋️‍♀️ Fine-Tune an LLM🕵️‍♂️ Evaluation w/ Human in the Loop
    twittergithublinkedin
    Powered by Mintlify
    Assistant
    Responses are generated using AI and may contain mistakes.