Skip to content

Quick Start

Follow these steps to create and run your first evaluation.

Terminal window
npx allagents plugin marketplace add EntityProcess/agentv
npx allagents plugin install agentv-dev@agentv

2. Ask Claude to bootstrap AgentV in this repo

Section titled “2. Ask Claude to bootstrap AgentV in this repo”
Set up AgentV in this repo.

The onboarding skill ensures CLI/setup prerequisites and runs:

Terminal window
agentv init

The init command creates a .env.example file in your project root.

  1. Copy .env.example to .env
  2. Fill in your API keys, endpoints, and other configuration values
  3. Update the environment variable names in .agentv/targets.yaml to match those defined in your .env file

Create ./evals/example.yaml:

description: Math problem solving evaluation
execution:
target: default
tests:
- id: addition
criteria: Correctly calculates 15 + 27 = 42
input: What is 15 + 27?
expected_output: "42"
assert:
- name: math_check
type: code-judge
command: [./validators/check_math.py]
Terminal window
agentv eval ./evals/example.yaml

Results appear in .agentv/results/eval_<timestamp>.jsonl with scores, reasoning, and execution traces.