The lexicon

The language of production agents.

The vocabulary for the layer between AI capability and institutional trust. 25 terms, each defined by how it fails and how you fix it.

Governance & authority

▸ Action Lineage

The append-only record of which agent did what, on whose authority, with what inputs, and what changed.

▸ Agent Assurance

The discipline of proving an agent should have done what it did. Monitoring tells you something happened; assurance proves it was allowed, bounded, and correct.

▸ Attestation

External or human verification treated as a first-class evidence mode.

▸ Authority Envelope

The explicit set of actions, resources, and scopes an agent is allowed to touch. The boundary. Everything outside it is denied by default.

▸ Coordination

Making multiple agents, human and machine, work together without chaos: trust, accountability, verification, protocol.

▸ Determinism at Edges

Bound uncertainty at the boundaries of a system so it does not leak into everything downstream.

▸ Principal Chain

The chain of on-whose-behalf an action runs: user to agent to sub-agent, every action resolving back to a human principal.

▸ Revocation

The ability to instantly kill an agent's authority. The off switch. If you cannot revoke in one move, you do not control the agent.

▸ Theory of Constraints

Find the one constraint that governs the whole system, because improving anything else is noise.

▸ Trust Artifact

The durable receipt you can show leadership, security, customers, or a regulator. Proof that survives after the agent acts.

Context

▸ Context Plane

The addressable layer where an agent gets what it needs to know: records, typed links, permissioned views. The difference between an agent that guesses and one that knows.

▸ Organizational Legibility

Making an organization addressable, so agents and humans can actually act inside it.

▸ Outcome Graph

Capability modeled as a position in a graph of achieved outcomes, not a label or a level.

Evals

▸ Eval Loop

The repeatable harness that scores agent behavior against known-good outcomes. The thing that turns seems fine into a number you can watch.

▸ Failure Taxonomy

A named map of how agents break: exfiltration, corruption, cost blowup, loop, drift, silent-wrong. You cannot monitor what you cannot name.

Observability

▸ Decision Record

A durable, reviewable record of a choice: what was decided, why, and what it produced.

▸ Receipts Over Claims

A working posture: prove it with an artifact, then say it. Evidence beats assertion every time.

▸ Remediation Loop

What happens after a failure is seen: trace to eval to policy to fix to proof. Closing the loop is what turns monitoring into assurance.

▸ Runtime Receipt

Live proof that a tool or surface works right now, not that it worked at last deploy.

▸ Trace

The replayable record of a single agent run: prompts, context, tool calls, outputs. The black box recorder.

Org & practice

▸ Agent Civilization

The end state: hundreds or thousands of agents, governed, observable, and owned. The alternative to the shadow workforce.

▸ Agent Guild

The org structure that governs agents: charter, review council, ownership ladder, rituals. How you go from agent anarchy to civilization.

▸ Dark Agent

A semi-autonomous AI process that takes real actions on real systems but is not treated like a production system: no owner, no logs, no policy, no tests.

▸ Governance X-Ray

A fast assessment that surfaces every agent in your org, its owner, its blast radius, and its risk tier. The first thing you run.

▸ Operator Sovereignty

Preserving independent judgment, attention, and identity while working with AI.