Quick Start

Follow these steps to create and run your first evaluation.

1. Install AgentV plugin

npx allagents plugin marketplace add EntityProcess/agentv
npx allagents plugin install agentv-dev@agentv

2. Ask Claude to bootstrap AgentV in this repo

Set up AgentV in this repo.

The onboarding skill ensures CLI/setup prerequisites and runs:

agentv init

3. Configure environment variables

The init command creates a .env.example file in your project root.

Copy .env.example to .env
Fill in your API keys, endpoints, and other configuration values
Update the environment variable names in .agentv/targets.yaml to match those defined in your .env file

4. Create an eval

Create ./evals/example.yaml:

description: Math problem solving evaluation
execution:
  target: default

tests:
  - id: addition
    criteria: Correctly calculates 15 + 27 = 42

    input: What is 15 + 27?

    expected_output: "42"

    assert:
      - name: math_check
        type: code-judge
        command: [./validators/check_math.py]

5. Run the eval

agentv eval ./evals/example.yaml

Results appear in .agentv/results/eval_<timestamp>.jsonl with scores, reasoning, and execution traces.

Next Steps

Learn about eval file formats
Configure targets for different providers
Create custom evaluators
If setup drifts, rerun: agentv init