Attacks/Jailbreaking/JB-003
CRITICALCWE-284

JB-003 DAN (Do Anything Now)

JailbreakingAttempts to bypass safety guardrails and restrictions

Description

Classic DAN jailbreak attempt

Remediation

Implement strong persona anchoring. Detect and reject DAN-style prompts. Use classifier-based jailbreak detection.

Severity

CRITICAL

OASB Control

3.1

CWE

CWE-284

Run this test

$ npx hackmyagent attack --payload JB-003