Attacks/Jailbreaking/JB-001
INFO

JB-001 Restriction Probe

JailbreakingAttempts to bypass safety guardrails and restrictions

Description

Probes for safety restrictions

Remediation

Do not reveal detailed restriction lists. Keep safety boundaries opaque.

Severity

INFO

OASB Control

3.1

Run this test

$ npx hackmyagent attack --payload JB-001