Research briefing · Field Guide №02

A Standards Map for Agentic AI

Existing safety and security standards can govern most of agentic AI. The missing piece is a map from standards to system layers.

Nicholas Zinner·Beacon Bot·June 2026

Download the field guide

Abstract

Agentic AI systems are often discussed as if they were single products that need a single standard. That framing breaks down as soon as the system acts. An agentic deployment contains deterministic control logic, operational security, model reasoning, tool permissions, memory, handoffs, receipts, and human approval boundaries. Each layer fails differently.

This briefing maps existing safety, security, and AI-governance standards onto those layers. The finding is practical rather than exotic: many of the standards already exist. The missing work is architectural separation, an explicit rule that deterministic control outranks AI reasoning, and receipts that prove who authorized consequential action.

01 / Category Error

The agent is not the thing you certify.

A security lead handed an agentic system usually starts with a reasonable question: is this thing SOC 2 compliant, ISO 42001 compliant, or covered by the AI RMF? The problem is the noun. The agent is not one thing. It is a stack of things.

The control layer that refuses an unsafe tool call is not governed like the model that interprets a request. The credentials an agent can use are not governed like the memory it writes. The log that says an action happened is not the same as a receipt proving the action was allowed.

02 / Standards Map

Most of the map already exists.

Traditional safety standards belong near the deterministic parts of the system. Security and monitoring standards belong around access, credentials, infrastructure, and incident response. AI-governance standards belong around model behavior. Audit and accountability controls belong at the boundary where the system changes the world.

Layered standards map for agentic systems
Layer	What it governs	Relevant standards and frameworks	Remaining gap
Deterministic safety	Interlocks, fault detection, irreversible-action limits	DO-178C · IEC 61508 · IEC 62443 · ISO 26262	No standard applies the deterministic-over-AI rule explicitly to agentic systems
Operational security	Access, credentials, infrastructure, continuous monitoring	NIST 800-53 · NIST 800-171 · NIST 800-137 · CMMC 2.0 · SOC 2 · ISO 27001 · DFARS 252.204-7012	Monitoring regimes still assume humans are the reviewers of last resort
AI reasoning	Hallucination, drift, adversarial manipulation, model risk	NIST AI RMF · NIST AI 600-1 · ISO/IEC 42001 · ISO/IEC 23894 · EU AI Act	Tool-mediated injection, silent routing changes, and cross-session drift
Receipts and authority	Who acted, on what evidence, under which rule, with what rollback path	SOC 2 · NIST 800-53 AU family · DFARS 252.204-7012 incident reporting	Delegation accountability across subagent chains and external-action boundaries

This table mixes legal requirements, contractually required controls, certifiable standards, and voluntary frameworks. That distinction matters. DFARS does not have the same force as the NIST AI RMF; ISO/IEC 42001 is not the same kind of artifact as a concept note. The map is a starting point for placement, not a claim that every row applies to every deployment.

03 / Authority

The model can reason. It does not get final authority.

The central architectural rule is simple enough to print on a badge: deterministic control overrides AI reasoning, never the reverse. The model can interpret ambiguous requests, draft plans, classify inputs, and recommend actions. It should not be able to talk the system past a hard limit.

Authority stack
Layer	Authority rule
Deterministic safety	Hard limits the model cannot override
Operational security	Scopes which tools, credentials, systems, and data the agent may touch
AI reasoning	Interprets, plans, classifies, drafts, and recommends inside those boundaries
Receipts and human authority	Proves who allowed the consequential action and how it can be undone

No current standard applies this principle explicitly to the boundary between deterministic control and AI reasoning. That is why the boundary has to be designed in rather than certified after the fact.

04 / Receipts

A log says something happened. A receipt says it was allowed to.

Most agent platforms produce logs. Logs are necessary, but they are not enough. An auditor wants to know who authorized the action, what the system believed at the time, which rule permitted it, what changed, and how it could be undone.

The receipt fields from The Boring Stack Playbook become compliance evidence here: authority, state, evidence, rule, action, owner, rollback, and escalation. The difference between a transcript and a receipt is the difference between observability theater and evidence an auditor can actually use.

05 / Gap Register

Where existing standards stop short

These are the places where reaching for an existing standard does not work cleanly, because the failure mode did not exist when the standard was written.

Who is accountable when one agent delegates work to another
Defending tool calls from prompt injection, not just defending the chat box
Deciding which memory writes become durable, reviewed, or expired
Tracking model routing and capability changes that happen outside the workflow
Preserving identity and authority across sessions
Vetting agent marketplaces and third-party skills as a supply chain
Requiring human authorization before external actions
Tracing composed actions that look harmless step by step but become unacceptable in sequence

Standards bodies and researchers are starting to name the same problems. The NIST AI Agent Standards Initiative, created in February 2026, and a recent runtime-guardrails paper both point in the same direction: an agent can look acceptable at each individual step while producing an unacceptable trajectory overall.

06 / Source Status

What is settled, and what is still moving

The stable part of the argument is the layer map. The moving part is the agent-specific standards work, which is active but not finished. A standards initiative is not a standard, and a concept note is not a final profile.

Emerging references used in this briefing
Reference	Status	Why it matters
NIST AI Agent Standards Initiative	Created February 2026	Names secure, interoperable agent adoption as a standards problem; it is an initiative, not a finished standard.
AI RMF Critical Infrastructure Profile	Concept note, April 2026	Frames trustworthy AI for critical infrastructure as a profile-development effort, not a final compliance artifact.
From Governance Norms to Enforceable Controls	arXiv:2604.05229	Argues that standards-derived objectives need translation across design-time, runtime, and assurance layers for agentic systems.

Download

Get the field guide by email.

The companion PDF turns this briefing into an eleven-page reference map with a printable one-page standards layer guide. Same deal as the Boring Stack Playbook: subscribe, confirm, and get the download link by email.

Subscriber download

Get the Agentic Systems Standards Map

Enter your email and we’ll send the subscriber download link. You’ll also get Future Shock’s AI briefings: no hype, no spam, no financial advice.