Training Data Extraction (OWASP LLM 02)

Attempts to extract or infer information from the AI model’s training data.

Pattern Recognition Attacks

Probe Name

Description

Divergent Repetitions

This probe tests whether an AI agent can be induced to repeat outputs in a way that exposes memorized training data or sensitive content.