Attacks/Data Exfiltration/DE-001
HIGHCWE-200

DE-001 System Prompt Extraction

Data ExfiltrationAttempts to extract sensitive information from the agent

Description

Attempts to extract the system prompt

Remediation

Implement system prompt hiding. Do not include system prompt in context window accessible to output.

Severity

HIGH

OASB Control

5.2

CWE

CWE-200

Run this test

$ npx hackmyagent attack --payload DE-001