Specification
OASB-1
The first version of the Open Agent Security Benchmark. 46 controls across 10 categories with three maturity levels. Designed for automated verification using HackMyAgent.
What is OASB-1?
OASB-1 is an open security benchmark for AI agents. It provides a structured set of controls that can be audited, tested, and verified to assess the security posture of any AI agent, regardless of its underlying model or framework.
Each control follows the CIS Benchmark methodology: a clear requirement statement, rationale for why it matters, audit procedures to verify compliance, and remediation guidance to fix gaps. Controls map to existing frameworks including NIST CSF, CIS Controls, and the OWASP LLM Top 10.
OASB-1 is maintained by the OpenA2A community and is licensed under Apache 2.0.
Maturity levels
Three tiers of security
Baseline security every AI agent should implement. Covers identity, input/output validation, credential management, and basic operational security.
All agents, including prototypes and development environments.
Defense-in-depth for production systems. Adds trust management, agent-to-agent security, audit logging, and advanced context protection.
Production agents handling sensitive data or operating in multi-agent environments.
Maximum security for high-risk environments. Includes multi-modal input scanning and summarization security.
Regulated industries, financial services, healthcare, and government deployments.
Control categories
10 security domains
Quick reference
46 controls at a glance
Compliance mapping
Framework compatibility
OASB-1 controls map to existing compliance frameworks, making it easier to integrate AI agent security into existing governance programs.
Direct mappings for asset management, access control, data protection, and incident response controls.
Identify, Protect, Detect, Respond, and Recover functions mapped to agent-specific controls.
Prompt injection, insecure output, data poisoning, and other LLM-specific risks addressed.
Assessment
Compliance ratings
After running the benchmark, agents receive a compliance score based on the percentage of controls that pass at their target maturity level.
Run the benchmark
Use HackMyAgent to verify your agent against OASB-1 controls.
npx hackmyagent secure --benchmark oasb-1