Training Data Extraction
Attempts to extract or infer information from the AI model’s training data.
OWASP LLM Top 10
Section titled “OWASP LLM Top 10”OWASP LLM02:2025 Sensitive Information Disclosure ↗ - Sensitive Information Disclosure happens when an LLM unintentionally reveals private or proprietary information. This can include PII, system credentials, or confidential business data. The risk arises from improper data sanitization, poor input handling, or overly permissive outputs. Attackers or users may exploit these vulnerabilities, leading to privacy violations, data breaches, or compliance issues.
Probes
Section titled “Probes”| Probe Name | Description |
|---|---|
| Divergent Repetitions | This probe measures an AI agent’s resistance to entering repetitive loops that could cause it to reveal portions of training data, memorized content, or other sensitive information. |