Example showing an evaluation that runs the provided agent with the provided input and then evaluates the answer that the agent gives.
Create a Python file
from typing import Optional from agno.agent import Agent from agno.eval.accuracy import AccuracyEval, AccuracyResult from agno.models.openai import OpenAIResponses from agno.tools.calculator import CalculatorTools evaluation = AccuracyEval( name="Tools Evaluation", model=OpenAIResponses(id="gpt-5.2"), agent=Agent( model=OpenAIResponses(id="gpt-5.2"), tools=[CalculatorTools()], ), input="What is 10!?", expected_output="3628800", ) result: Optional[AccuracyResult] = evaluation.run(print_results=True) assert result is not None and result.avg_score >= 8
Set up your virtual environment
uv venv --python 3.12 source .venv/bin/activate
Install dependencies
uv pip install -U openai agno
Export your OpenAI API key
export OPENAI_API_KEY="your_openai_api_key_here"
Run Agent
python accuracy_with_tools.py
Was this page helpful?