Run Evaluation

Compare multiple models against a dataset with scoring.

Multi-Model Evaluation
Select one or more models to evaluate against a dataset.

No models configured. Add models in the Models tab.

Setup Status
Configure all components to run comprehensive evaluations.

Models

0 models configured

Not configured

Datasets

0 datasets available

Not configured

Scorers

0 scorers configured

Not configured

Complete the setup by configuring 3 more components to run evaluations.