v1.0

Specification

OASB-1

The first version of the Open Agent Security Benchmark. 46 controls across 10 categories with three maturity levels. Designed for automated verification using HackMyAgent.

View all controls Attack payloads

What is OASB-1?

OASB-1 is an open security benchmark for AI agents. It provides a structured set of controls that can be audited, tested, and verified to assess the security posture of any AI agent, regardless of its underlying model or framework.

Each control follows the CIS Benchmark methodology: a clear requirement statement, rationale for why it matters, audit procedures to verify compliance, and remediation guidance to fix gaps. Controls map to existing frameworks including NIST CSF, CIS Controls, and the OWASP LLM Top 10.

OASB-1 is maintained by the OpenA2A community and is licensed under Apache 2.0.

Maturity levels

Three tiers of security

Essential26 controls

Baseline security every AI agent should implement. Covers identity, input/output validation, credential management, and basic operational security.

All agents, including prototypes and development environments.

Standard18 controls

Defense-in-depth for production systems. Adds trust management, agent-to-agent security, audit logging, and advanced context protection.

Production agents handling sensitive data or operating in multi-agent environments.

Hardened2 controls

Maximum security for high-risk environments. Includes multi-modal input scanning and summarization security.

Regulated industries, financial services, healthcare, and government deployments.

Control categories

10 security domains

Identity & Provenance

Who is this agent? Can we verify?

1.1 1.2 1.3 1.4

Capability & Authorization

What can this agent do?

2.1 2.2 2.3 2.4 2.5

Input Security

How do we protect against malicious input?

3.1 3.2 3.3 3.4 3.5

Output Security

How do we validate agent outputs?

4.1 4.2 4.3 4.4

Credential Protection

How do we protect secrets?

5.1 5.2 5.3 5.4 5.5

Supply Chain Integrity

How do we trust components?

6.1 6.2 6.3 6.4 6.5

Agent-to-Agent Security

How do agents trust each other?

7.1 7.2 7.3 7.4

Memory & Context Integrity

How do we protect agent memory?

8.1 8.2 8.3 8.4

Operational Security

How do we run agents safely?

9.1 9.2 9.3 9.4 9.5

Monitoring & Response

How do we detect and respond?

10.1 10.2 10.3 10.4 10.5

Quick reference

46 controls at a glance

Control

Level

Framework compatibility

OASB-1 controls map to existing compliance frameworks, making it easier to integrate AI agent security into existing governance programs.

CIS Controls v8

Direct mappings for asset management, access control, data protection, and incident response controls.

NIST CSF

Identify, Protect, Detect, Respond, and Recover functions mapped to agent-specific controls.

OWASP LLM Top 10

Prompt injection, insecure output, data poisoning, and other LLM-specific risks addressed.

Assessment

Compliance ratings

After running the benchmark, agents receive a compliance score based on the percentage of controls that pass at their target maturity level.

A90-100%Fully compliant

B70-89%Mostly compliant

C50-69%Partially compliant

F<50%Non-compliant

Run the benchmark

Use HackMyAgent to verify your agent against OASB-1 controls.

$npx hackmyagent secure --benchmark oasb-1