Skip to main content

View evaluation

Overview

In H2O Eval Studio, you can view the created evaluations in interactive dashboards that compare LLMs based on metrics calculated by the 1 or more evaluators. When viewing a specific evaluation in a dashboard, you can view a visualization of the dashboard, obtain an HTML report, and download a zip archive with the evaluation results.

Instructions

To view an evaluation, consider the following instructions:

  1. In the left navigation menu, click Evaluations.
  2. From the Evaluations page, click on the name of the evaluation you want to view. Your evaluation's dashboard appears.

The evaluation dashboard

The first tab of the page, Evaluation, features an interactive visualization of evaluator result metrics as an Evaluation Eye. Dashboard visualizations can help you understand the evaluation results for a given metric and the LLM models being compared. Evaluation Eye

The evaluation leaderboard

The Leaderboards tab compares LLMs based on metrics calculated by the one evaluator. For more information on evaluators in H2O Eval Studio, see Evaluators. Leaderboards

It displays evaluator result metrics as the following visualizations:

  • Radar plot (in cases where more than one metric is produced by the evaluator)
  • Bar chart (in cases where there are three or fewer metrics produced by the evaluator)
  • Heatmap

Leaderboard visualizations can help you understand the evaluation results for a given metric and the LLM models being compared.

Obtain an HTML report of a dashboard

To view an HTML report of a dashboard, click the Show Report button. This report provides in-depth information about potential problems with the model, evaluation parameters, the evaluated models, and more. Show report

Download a zip archive with evaluation results

To download a zip archive with evaluation results, click the Download Report button. Download report


Feedback