Skip to content
GitHubDiscord

Overview

A dataset is a collection of conversations used to evaluate your agents. We allow manual test creation for fine-grained control, but since generative AI agents can encounter an infinite number of test cases, automated test case generation is often necessary, especially when you don’t have any test conversations to import.

In this section, we will walk you through how to create test cases and datasets using the Hub interface. In general, we cover four different ways to create datasets:

graph LR
    A[Create Dataset] --> B{Source}
    B --> C([<a href="/hub/ui/datasets/manual" target="_self">Create Manually</a>])
    B --> D([<a href="/hub/ui/datasets/import" target="_self">Import Existing</a>])
    B --> E([<a href="/hub/ui/datasets/knowledge-base" target="_self">Knowledge Base Tests</a>])
    B --> F([<a href="/hub/ui/datasets/scenario" target="_self">Scenario Tests</a>])
    B --> G([<a href="/hub/ui/scan" target="_self">From Scan</a>])
    C --> H[<a href="/hub/ui/annotate" target="_self">Review Test Cases</a>]
    D --> H
    E --> H
    F --> H
    G --> H