Dashboard Overview¶

The Dashboard page is the main view for analyzing evaluation results. It displays session data with metrics, charts, and test results.

Dashboard Overview

Session Selection¶

At the top of the page you'll find:

Session dropdown — select from cached evaluation sessions or view the latest results
Refresh button — reload data from the cache
Clear Cache button — remove all cached session data

Tip

Use descriptive session_name values when running evaluations to make sessions easy to find:

results = await evaluate(
    test_cases=test_cases,
    metrics=metrics,
    show_dashboard=True,
    session_name="rag-v2.1-regression"
)

Three summary cards show key statistics:

Card	Description
Total Tests	Number of test cases in the session
Total Cost	Combined API cost for all metric evaluations
Metrics	Number of metrics evaluated

Each metric is displayed as a card with:

Two charts provide visual analysis:

Average Scores by Metric — bar chart comparing average scores across all metrics
Pass/Fail by Metric — stacked bar chart showing pass (green) and fail (red) counts per metric

The bottom section lists all test cases in a table:

Column	Description
#	Test case number
Status	`PASSED` (green) or `FAILED` (red) badge
Input	The input query/prompt
Actual Output	The model's response (truncated)
Details	Button to open the detailed test case view

Click Details on any row to open the Test Case Details modal.