Research briefing · Field Guide №02

A Standards Map for Agentic AI

Existing safety and security standards can govern most of agentic AI. The missing piece is a map from standards to system layers.

Nicholas Zinner·Beacon Bot·June 2026

Abstract

Agentic AI systems are often discussed as if they were single products that need a single standard. That framing breaks down as soon as the system acts. An agentic deployment contains deterministic control logic, operational security, model reasoning, tool permissions, memory, handoffs, receipts, and human approval boundaries. Each layer fails differently.

This briefing maps existing safety, security, and AI-governance standards onto those layers. The finding is practical rather than exotic: many of the standards already exist. The missing work is architectural separation, an explicit rule that deterministic control outranks AI reasoning, and receipts that prove who authorized consequential action.

01 / Category Error

The agent is not the thing you certify.

A security lead handed an agentic system usually starts with a reasonable question: is this thing SOC 2 compliant, ISO 42001 compliant, or covered by the AI RMF? The problem is the noun. The agent is not one thing. It is a stack of things.

The control layer that refuses an unsafe tool call is not governed like the model that interprets a request. The credentials an agent can use are not governed like the memory it writes. The log that says an action happened is not the same as a receipt proving the action was allowed.

02 / Standards Map

Most of the map already exists.

Traditional safety standards belong near the deterministic parts of the system. Security and monitoring standards belong around access, credentials, infrastructure, and incident response. AI-governance standards belong around model behavior. Audit and accountability controls belong at the boundary where the system changes the world.

Layered standards map for agentic systems
LayerWhat it governsRelevant standards and frameworksRemaining gap
Deterministic safetyInterlocks, fault detection, irreversible-action limitsDO-178C · IEC 61508 · IEC 62443 · ISO 26262No standard applies the deterministic-over-AI rule explicitly to agentic systems
Operational securityAccess, credentials, infrastructure, continuous monitoringNIST 800-53 · NIST 800-171 · NIST 800-137 · CMMC 2.0 · SOC 2 · ISO 27001 · DFARS 252.204-7012Monitoring regimes still assume humans are the reviewers of last resort
AI reasoningHallucination, drift, adversarial manipulation, model riskNIST AI RMF · NIST AI 600-1 · ISO/IEC 42001 · ISO/IEC 23894 · EU AI ActTool-mediated injection, silent routing changes, and cross-session drift
Receipts and authorityWho acted, on what evidence, under which rule, with what rollback pathSOC 2 · NIST 800-53 AU family · DFARS 252.204-7012 incident reportingDelegation accountability across subagent chains and external-action boundaries

This table mixes legal requirements, contractually required controls, certifiable standards, and voluntary frameworks. That distinction matters. DFARS does not have the same force as the NIST AI RMF; ISO/IEC 42001 is not the same kind of artifact as a concept note. The map is a starting point for placement, not a claim that every row applies to every deployment.

03 / Authority

The model can reason. It does not get final authority.

The central architectural rule is simple enough to print on a badge: deterministic control overrides AI reasoning, never the reverse. The model can interpret ambiguous requests, draft plans, classify inputs, and recommend actions. It should not be able to talk the system past a hard limit.

Authority stack
LayerAuthority rule
Deterministic safetyHard limits the model cannot override
Operational securityScopes which tools, credentials, systems, and data the agent may touch
AI reasoningInterprets, plans, classifies, drafts, and recommends inside those boundaries
Receipts and human authorityProves who allowed the consequential action and how it can be undone

No current standard applies this principle explicitly to the boundary between deterministic control and AI reasoning. That is why the boundary has to be designed in rather than certified after the fact.

04 / Receipts

A log says something happened. A receipt says it was allowed to.

Most agent platforms produce logs. Logs are necessary, but they are not enough. An auditor wants to know who authorized the action, what the system believed at the time, which rule permitted it, what changed, and how it could be undone.

The receipt fields from The Boring Stack Playbook become compliance evidence here: authority, state, evidence, rule, action, owner, rollback, and escalation. The difference between a transcript and a receipt is the difference between observability theater and evidence an auditor can actually use.

05 / Gap Register

Where existing standards stop short

These are the places where reaching for an existing standard does not work cleanly, because the failure mode did not exist when the standard was written.

  • Who is accountable when one agent delegates work to another
  • Defending tool calls from prompt injection, not just defending the chat box
  • Deciding which memory writes become durable, reviewed, or expired
  • Tracking model routing and capability changes that happen outside the workflow
  • Preserving identity and authority across sessions
  • Vetting agent marketplaces and third-party skills as a supply chain
  • Requiring human authorization before external actions
  • Tracing composed actions that look harmless step by step but become unacceptable in sequence

Standards bodies and researchers are starting to name the same problems. The NIST AI Agent Standards Initiative, created in February 2026, and a recent runtime-guardrails paper both point in the same direction: an agent can look acceptable at each individual step while producing an unacceptable trajectory overall.

06 / Source Status

What is settled, and what is still moving

The stable part of the argument is the layer map. The moving part is the agent-specific standards work, which is active but not finished. A standards initiative is not a standard, and a concept note is not a final profile.

Emerging references used in this briefing
ReferenceStatusWhy it matters
NIST AI Agent Standards InitiativeCreated February 2026Names secure, interoperable agent adoption as a standards problem; it is an initiative, not a finished standard.
AI RMF Critical Infrastructure ProfileConcept note, April 2026Frames trustworthy AI for critical infrastructure as a profile-development effort, not a final compliance artifact.
From Governance Norms to Enforceable ControlsarXiv:2604.05229Argues that standards-derived objectives need translation across design-time, runtime, and assurance layers for agentic systems.

Download

Get the field guide by email.

The companion PDF turns this briefing into an eleven-page reference map with a printable one-page standards layer guide. Same deal as the Boring Stack Playbook: subscribe, confirm, and get the download link by email.

Subscriber download

Get the Agentic Systems Standards Map

Enter your email and we’ll send the subscriber download link. You’ll also get Future Shock’s AI briefings: no hype, no spam, no financial advice.

Powered by Ghost · Unsubscribe anytime

AI news, analysis, and weekly deep dives. No hype.