Agent as Judge

On this page

Create a Test Suite
Update config.json
Create a Test in a Suite
Update start.py
Run a Test Suite

Lytix supports evaluating agentic workflows. This means not only do we evaluate the input/output of the flow, but also can pass in a data source (e.g. repository) to further evaluate the output.

🚨 Prerequisite Login & setup your lytix CLI here.

Create a Test Suite

The first step is to create a test suite. This is a group of tests that will be run together.

lytix agent-test create-suite --suiteName "suite0"

Update `config.json`

After creating the test suite, you’ll need to update the config.json file to define what repository you’d like to evaluate.

{
  "repository": {
    "remote": "github",
    "branch": "main",
    "repository": "Lytix-Labs/optimodel"
  }
}

Note We currently only support public GitHub repositories. Please reach out to support@lytix.com if you’d like to evaluate a private repository.

Create a Test in a Suite

lytix agent-test add-test --testName "test0" --suiteName "suite0"

Update `start.py`

To collect test data, we want to allow full flexibility. Thus, we create a start.py file in the folder of the test. This file will be executed to collect the data. The only requirement is that the start.py file prints the following JSON object to stdout:

{
  "messages": ["..."],
  "output": "...",
  "sources": ["..."]
}

Where messages is an array of {role: "user" | "assistant", content: "..."} and sources is an array of file paths that contain the data we want to evaluate the output against.

Note Currently we only support a single user message.

Run a Test Suite

lytix agent-test run --suiteName "suite0"

Manually Importing Events User Overview

Get Started

API Keys

Proxy

Image & Video

Async Logging

Testing

Users

Metadata

Workflows

Datasets

Caching

Playground

Alerts

Bot Evaluation

Prompts

Integrations

Guardrails

Custom Errors

Custom LLM Tracing

CLI

Agent as Judge

Create a Test Suite

Update `config.json`

Create a Test in a Suite

Update `start.py`

Run a Test Suite

Get Started

API Keys

Proxy

Image & Video

Async Logging

Testing

Users

Metadata

Workflows

Datasets

Caching

Playground

Alerts

Bot Evaluation

Prompts

Integrations

Guardrails

Custom Errors

Custom LLM Tracing

CLI

​Create a Test Suite

​Update config.json

​Create a Test in a Suite

​Update start.py

​Run a Test Suite

Create a Test Suite

Update `config.json`

Create a Test in a Suite

Update `start.py`

Run a Test Suite