Skip to content
GitHubDiscord

Overview

The annotation workflow in Giskard Hub enables you to continuously improve your test cases and evaluation metrics through an iterative, collaborative process.

Each test case is composed of a conversation and its associated evaluation parameters (e.g., an expected answer, rules that the agent must respect, etc.).

The annotation workflow follows a task-oriented approach with two distinct personas and workflows:

  1. Distribute tasks - Organize your review work by creating and assigning tasks to team members
  2. Review test results - Business workflow for reviewing evaluation results and understanding failures
  3. Modify test cases - Product owner workflow for refining test cases and validation rules

This section guides you through the complete task-oriented workflow from task distribution to test case refinement.

The annotation workflow involves two personas with distinct workflows:

Business Persona (Review Workflow):

  • Reviews test results from evaluation runs or tasks
  • Understands check results and failure reasons
  • Reviews conversation flow and metadata
  • Takes action: closes tasks if results are acceptable, or assigns modification work

Product Owner Persona (Modification Workflow):

  • Modifies test cases based on review feedback
  • Drafts/undrafts test cases
  • Enables/disables checks
  • Modifies check requirements
  • Validates checks and structures test cases

Now that you understand the task-oriented annotation workflow, explore the specific workflows:

  • Start with task distribution - Learn how to create and manage tasks to organize your review work task management
  • Review test results - Follow the business workflow to review evaluation results review test results
  • Modify test cases - Follow the product owner workflow to refine test cases and checks modify test cases