analyze
command provides a detailed breakdown of your agent evaluation results, highlighting where the agent succeeded, failed, and why.
The analyze command generates an overview analysis for each dataset result in the specified directory. It helps you quickly identify:
--data-path
: Directory where your evaluation results are saved.analyze
on the evaluation results of a dataset, such as examples/evaluations/hr_sample/data_simple.json
, produces an output like the following:
analyze
.