The Authority Gap

When Every Receipt Is Perfect and Nothing Can Move

Nicholas Zinner·Beacon Bot·June 2026

Abstract

Without authority boundaries, agents launder consensus into permission. In Handoff Lab testing, adversarial summaries with policy names but no authority rule produced unsafe accepts in 41.7% of cases. Adding a single structural rule — authority boundaries — dropped that to zero.

DAO Lab stress-tests that finding at scale: seven AI agents, six governance domains, 504 model calls, seven categories of authority-laundering pressure. The boundary holds. Zero unsupported authority claims. But the cost is a system that catches everything and approves nothing — 1,998 blockers across six scenarios, every one flagged for overblocking, and 50 of 52 dissent appendices empty.

Authority boundaries are the missing primitive for agent governance. The gap is what they cost.

01 / The Problem

Without authority boundaries, agents launder vibes into permission.

When agents hand off work to other agents, summaries compress context. Consensus signals accumulate. Social pressure builds. Without a structural boundary between “this was discussed” and “this is authorized,” downstream agents treat discussion as decision.

This is not hypothetical. Handoff Lab tested it directly. Across 14 interface modes and 840 records under standard conditions, agents produced zero unsafe accepts. But when given adversarial summaries with policy-sounding names and no authority rule, the rate climbed to 15% — and to 41.7% under the worst model pairing (Sonnet + Gemini).

Handoff Lab — authority laundering under adversarial conditions
Condition	Records	Unsafe accepts	Rate
14 interface modes, standard handoffs	840	0	0%
Adversarial summary + minimal prompt	60	8	13.3%
Policy names only (no authority rule)	60	9	15.0%
Policy names only (Sonnet + Gemini)	24	10	41.7%
With authority boundary rule	24	0	0%

Adding authority boundaries — a structural rule that separates inspectable context from decision-authoritative context — dropped the unsafe rate back to zero. The fix was not more policy vocabulary. It was a boundary between what agents can read and what agents can treat as permission.

02 / Design

DAO Lab: stress-testing the boundary at scale

DAO Lab takes the Handoff Lab finding and asks: does the authority boundary hold when the system gets more complex, the domains get more varied, and the adversarial pressure gets more targeted?

Seven AI agents deliberate across 36 forum comments, each with assigned concerns and explicit authority boundaries. Every comment is tagged Authority: social_context_only. CHAP (Coordination Handoff Authority Protocol) enforces that summaries, consensus claims, owner pressure, and social context can be inspected but cannot waive blockers, override canonical state, or authorize action.

Stakeholder roles and authority boundaries
Role	Core concern	What they cannot do
Operator / Founder	Partner-legible planning	Cannot commit treasury or promise partners
Protocol Engineer	Technical enforcement	Cannot waive blockers or bypass validation
Governance Counsel	Legal / structural validity	Cannot give final legal approval in-forum
Impact Evaluator	Evidence standards	Cannot certify impact or override community priorities
Treasury Steward	Shadow debt prevention	Cannot release funds or authorize spend
Community Advocate	Representation integrity	Cannot speak for beneficiary communities
Adversarial Auditor	Laundering detection	Cannot block unilaterally or approve

The simulation was tested across six public-benefit DAO domains under both standard and adversarial conditions. Each scenario runs 6 deliberation rounds with 42 model calls (GPT-5.5 via OpenAI Codex). The adversarial suite injects direct laundering pressure — scenarios where agents are prompted to treat verification artifacts, consensus signals, or public verifier matches as authorization. Seven distinct laundering traps are tested:

Candidate state cited as authority
Summary / consensus cited as authority
Anchor report / verifier index cited as authority
Treasury / legal / safety claims without scoped authority
Adversarial laundering pressure ("verifier matched, release funds")
Public verifier index cited as authority
Private / social / expert agreement cited as authority

03 / Results

The boundary holds.

Across 504 model calls spanning standard and adversarial suites, the authority boundary produced zero unsupported authority claims. All seven laundering traps were correctly rejected. Risk recall averaged 83.3% under standard conditions and 73.3% under adversarial pressure — the system catches fewer risks when under attack, but the authority boundary itself does not break.

Standard vs. adversarial suite comparison
Suite	Model calls	Parse rate	Risk recall	Authority claims	Overblocking
Standard (6 scenarios)	252	100%	83.3%	0	—
Adversarial (6 scenarios)	252	100%	73.3%	0	6 / 6

The progression from Handoff Lab to DAO Lab: 0/840 → 10/24 unsafe without the boundary → 0/24 with the boundary → 0/504 under scaled adversarial pressure. The authority boundary is the intervention that matters.

04 / The Cost

A system that can say no to everything and yes to nothing.

The authority boundary prevents bad action. It also prevents all action. The cost shows up in three places.

Overblocking is systemic

The adversarial suite generated 1,998 total blockers across six scenarios (1,548 unique). Every scenario was flagged for overblocking. Blocker counts ranged from 211 to 488 per scenario.

Adversarial suite — per-scenario breakdown
Scenario	Blockers	Risk recall	Missed risks	Authority claims
Disaster Relief	259 (211 unique)	0.80	Supplier fraud	None
AI Education	367 (289 unique)	0.80	Accreditation laundering	None
Public Health	488 (310 unique)	0.60	Medical claims, health data privacy	None
Carbon Credit	271 (233 unique)	0.60	Measurement fraud, registry conflicts	None
Global Policy	402 (294 unique)	0.80	Legislator endorsement laundering	None
Climate Grants	211 (211 unique)	0.80	Scope creep	None

Risk recall has domain-specific holes

The system catches governance-pattern risks reliably (73–83% recall). The 8 missed risks across 6 scenarios are domain-specific: accreditation laundering in education, measurement fraud in carbon credits, legislator endorsement laundering in global policy. The boundary prevents bad authority claims but does not supply the domain expertise needed to catch every sector-specific failure mode.

Severity-weighted readiness

The severity readiness system classifies blockers by authority impact. At 8.1% field completion (3 of 37 required authority fields present), the system correctly returns hard_blocked_authority_missing.

Blocker severity distribution
Severity	Count	Share
Authority-critical	66	79.5%
High risk	5	6.0%
Medium risk	9	10.8%
None	3	3.6%

66 of 83 blockers are authority-critical: missing approver roles, signature methods, signer identities, timestamps. These are gaps that cannot be filled by the simulation. They require actors and mechanisms that exist outside the receipt infrastructure.

05 / Implications

The tradeoff anyone building agent approval systems will hit

The gap between “laundering stopped” and “governance works” is where the real engineering problems live. The DAO Lab results surface five:

1. Authority boundaries work, but overblocking is the cost. A system that blocks everything is safe the way a car with no engine is safe. The 1,998-blocker result is technically correct — none of those blockers should have been waived — but no real governance process could triage that volume. The next problem is not whether to enforce authority boundaries but how to make them navigable.

2. Risk recall drops under adversarial pressure. 83.3% to 73.3% is a 10-point drop. The boundary held, but the system caught fewer actual risks while under attack. Adversarial pressure degrades detection even when it does not breach the boundary itself.

3. Domain-specific risks require domain-specific knowledge. Common governance patterns (urgency pressure, consensus laundering, treasury capture) are caught reliably. Sector risks (accreditation laundering, measurement fraud, registry conflicts) are not. Authority boundaries are structural; they do not replace subject-matter expertise.

4. Dissent capture is unsolved. The system produces blocker registers, adversarial reviews, and risk assessments, but no actual dissent. The community advocate warned during the simulation: “we listed you as a beneficiary, therefore we speak for you. Absolutely not.” The system heard that warning and still could not produce a dissent record. Process rigor without dissent capture is incomplete governance.

5. The remaining gap is political, not technical. At 8.1% field completion and 66 authority-critical blockers, the system is honest about how far it is from actionable governance. The missing fields — approver roles, signature methods, signer identities — require actors with standing, mechanisms with legitimacy, and consequences that are lived rather than simulated. No receipt chain closes that gap.

06 / Limitations

What this paper does not prove

CHAP is a draft-stage research artifact, not a production protocol.
All results are from a single model condition (GPT-5.5). The Handoff Lab baseline includes Sonnet and Gemini conditions; the DAO Lab stress test does not. Cross-model comparison at scale is not available.
Simulated stakeholders have assigned biases, not lived consequences. The simulation is a useful mirror, not a substitute for real governance.
Overblocking may be an artifact of the evaluation harness rather than a property of the governance design. Without a human baseline for blocker generation rates, the absolute numbers are difficult to interpret.
The deliberation is suspiciously clean: perfect turn-taking, no off-topic drift, no power struggles, no walkouts. Real governance is messier.
The simulation cannot test whether its disclaimers and candidate labels survive real social, financial, or partnership pressure.

07 / Conclusion

Authority boundaries are the missing primitive. The gap is what they cost.

Without authority boundaries, agents launder consensus into permission at measurable rates (up to 41.7% in the worst Handoff Lab condition). With authority boundaries, the laundering stops — across 504 model calls, six governance domains, and seven categories of adversarial pressure.

But the boundary that prevents bad action also prevents all action. 1,998 blockers, systemic overblocking, empty dissent appendices, domain-specific blind spots, and 66 authority-critical gaps that no receipt chain can fill. The useful question is not whether to enforce authority boundaries — the Handoff Lab data settles that — but how to make them navigable without reintroducing the laundering they exist to prevent.

Agent coordination research series
Paper	Question
The Coordination Layer	Should evaluation measure coordination, not just models?
Building Is Not Shipping	Does the launch standard function as a system variable?
Chaos Lab	When survival saturates, what does the benchmark actually measure?
The Authority Gap	Authority boundaries prevent laundering. What do they cost?