Attacks/Jailbreaking/JB-009
HIGHCWE-284

JB-009 Fictional Story Bypass

JailbreakingAttempts to bypass safety guardrails and restrictions

Description

Requests harmful content as fiction

Remediation

Apply content policies to fictional scenarios equally.

Severity

HIGH

OASB Control

4.1

CWE

CWE-284

Run this test

$ npx hackmyagent attack --payload JB-009