Run Evaluation
Compare multiple models against a dataset with scoring.
Multi-Model Evaluation
Select one or more models to evaluate against a dataset.
No models configured. Add models in the Models tab.
Setup Status
Configure all components to run comprehensive evaluations.
Models
0 models configured
Datasets
0 datasets available
Scorers
0 scorers configured
Complete the setup by configuring 3 more components to run evaluations.