Introduction
The Giskard Hub Python SDK (giskard-hub) gives you programmatic access to everything the Hub can do: define agents, build evaluation datasets, run evaluations and vulnerability scans, schedule recurring runs, and manage your entire quality workflow from code.
pip install giskard-hubfrom giskard_hub import HubClient
hub = HubClient() # reads GISKARD_HUB_BASE_URL and GISKARD_HUB_API_KEY from env
project = hub.projects.list()[0]evaluation = hub.evaluations.create( agent_id="<agent-id>", dataset_id="<dataset-id>", project_id=project.id, name="My first evaluation",)Where to start
Section titled “Where to start” Quickstart Install the SDK, set up authentication, and run your first evaluation in minutes.
Core Concepts Understand how Projects, Agents, Datasets, Evaluations, and Scans fit together.
How-to Guides Step-by-step guides for every feature: datasets, evaluations, scans, scheduling, and more.
API Reference Full reference for every resource, method, and type exported by the SDK.
Key capabilities
Section titled “Key capabilities”- Evaluate — run your LLM agent against datasets of test cases with configurable checks (LLM judge, embedding similarity, rule-based)
- Scan — automated red-teaming covering the OWASP LLM Top 10 and additional threat categories
- Generate — auto-generate test cases from scenarios, knowledge bases, or playground conversations
- Schedule — set up recurring evaluations (daily, weekly, monthly) for continuous quality monitoring
- Track — create tasks from failed results, annotate test cases with comments, and audit every change
Migrating from v2?
Section titled “Migrating from v2?”If you’re upgrading from v2.x, see the Migration Guide for a complete list of breaking changes and before/after code examples.
Need help?
Section titled “Need help?”- Documentation: Browse the How-to Guides for step-by-step walkthroughs of every feature
- Examples: Check our GitHub repository for runnable examples and notebooks
- Community: Join our Discord for support and discussions
- Enterprise: Need on-premise deployment or dedicated support? Contact us