Introduction

The Giskard Hub Python SDK (giskard-hub) gives you programmatic access to everything the Hub can do: define agents, build evaluation datasets, run evaluations and vulnerability scans, schedule recurring runs, and manage your entire quality workflow from code.

pip install giskard-hub

from giskard_hub import HubClient

hub = HubClient()  # reads GISKARD_HUB_BASE_URL and GISKARD_HUB_API_KEY from env

project = hub.projects.list()[0]
evaluation = hub.evaluations.create(
    agent_id="<agent-id>",
    dataset_id="<dataset-id>",
    project_id=project.id,
    name="My first evaluation",
)

Where to start

Quickstart Install the SDK, set up authentication, and run your first evaluation in minutes.

Core Concepts Understand how Projects, Agents, Datasets, Evaluations, and Scans fit together.

How-to Guides Step-by-step guides for every feature: datasets, evaluations, scans, scheduling, and more.

API Reference Full reference for every resource, method, and type exported by the SDK.

Key capabilities

Evaluate — run your LLM agent against datasets of test cases with configurable checks (LLM judge, embedding similarity, rule-based)
Scan — automated red-teaming covering the OWASP LLM Top 10 and additional threat categories
Generate — auto-generate test cases from scenarios, knowledge bases, or playground conversations
Schedule — set up recurring evaluations (daily, weekly, monthly) for continuous quality monitoring
Track — create tasks from failed results, annotate test cases with comments, and audit every change

Migrating from v2?

If you’re upgrading from v2.x, see the Migration Guide for a complete list of breaking changes and before/after code examples.

Need help?

Documentation: Browse the How-to Guides for step-by-step walkthroughs of every feature
Examples: Check our GitHub repository for runnable examples and notebooks
Community: Join our Discord for support and discussions
Enterprise: Need on-premise deployment or dedicated support? Contact us