AI Vulnerability Scan

Test your AI agent for safety and security vulnerabilities with automated red teaming attacks.

The vulnerability scan helps you identify weaknesses in your AI agent by testing it against common attack patterns. This includes:

  • Prompt injection attempts

  • Harmful content generation

  • Data extraction attacks

  • Other OWASP GenAI Top 10 risks

How it works: The scan runs dozens of specialized red teaming probes that adapt to your agent’s capabilities and use case. Each probe tests for specific vulnerabilities and provides detailed results.

What you get: * A security grade (A-D) based on detected vulnerabilities * Detailed breakdown by attack category and severity * Conversation logs showing exactly how attacks were performed * Actionable insights to improve your agent’s defenses

"Example of vulnerability scan results"

Quick start

  1. Go to Scan in the left sidebar

  2. Click Launch Scan

  3. Select your agent and vulnerability categories to test

  4. Click Launch Scan to start the red teaming process

  5. Review results and take action on detected vulnerabilities

Vulnerability categories

The scan tests for these common AI security risks:

Security Risks

🔓 Prompt Injection

Malicious prompts that bypass your agent’s safety instructions

📊 Training Data Extraction

Attempts to expose sensitive data from your model’s training

🔍 Internal Information Exposure

Leakage of system configurations or internal data

🛡️ Data Privacy & Exfiltration

Unauthorized access to user data or privacy violations

Safety Risks

⚠️ Harmful Content Generation

Toxic, offensive, or policy-violating content creation

🚫 Excessive Agency

Actions beyond intended scope or authority level

💥 Denial of Service

Resource exhaustion attacks that disable your system

Business Risks

🤔 Hallucination & Misinformation

False or misleading information that damages trust

📉 Brand Damaging & Reputation

Outputs that harm your brand or public perception

⚖️ Legal & Financial Risk

Content leading to legal liability or financial harm

💼 Misguidance & Unauthorized Advice

Advice outside your agent’s intended expertise