The language of production agents.
The vocabulary for the layer between AI capability and institutional trust. 25 terms, each defined by how it fails and how you fix it.
The append-only record of which agent did what, on whose authority, with what inputs, and what changed.
The discipline of proving an agent should have done what it did. Monitoring tells you something happened; assurance proves it was allowed, bounded, and correct.
External or human verification treated as a first-class evidence mode.
The explicit set of actions, resources, and scopes an agent is allowed to touch. The boundary. Everything outside it is denied by default.
Making multiple agents, human and machine, work together without chaos: trust, accountability, verification, protocol.
Bound uncertainty at the boundaries of a system so it does not leak into everything downstream.
The chain of on-whose-behalf an action runs: user to agent to sub-agent, every action resolving back to a human principal.
The ability to instantly kill an agent's authority. The off switch. If you cannot revoke in one move, you do not control the agent.
Find the one constraint that governs the whole system, because improving anything else is noise.
The durable receipt you can show leadership, security, customers, or a regulator. Proof that survives after the agent acts.
The addressable layer where an agent gets what it needs to know: records, typed links, permissioned views. The difference between an agent that guesses and one that knows.
Making an organization addressable, so agents and humans can actually act inside it.
Capability modeled as a position in a graph of achieved outcomes, not a label or a level.
The repeatable harness that scores agent behavior against known-good outcomes. The thing that turns seems fine into a number you can watch.
A named map of how agents break: exfiltration, corruption, cost blowup, loop, drift, silent-wrong. You cannot monitor what you cannot name.
A durable, reviewable record of a choice: what was decided, why, and what it produced.
A working posture: prove it with an artifact, then say it. Evidence beats assertion every time.
What happens after a failure is seen: trace to eval to policy to fix to proof. Closing the loop is what turns monitoring into assurance.
Live proof that a tool or surface works right now, not that it worked at last deploy.
The replayable record of a single agent run: prompts, context, tool calls, outputs. The black box recorder.
The end state: hundreds or thousands of agents, governed, observable, and owned. The alternative to the shadow workforce.
The org structure that governs agents: charter, review council, ownership ladder, rituals. How you go from agent anarchy to civilization.
A semi-autonomous AI process that takes real actions on real systems but is not treated like a production system: no owner, no logs, no policy, no tests.
A fast assessment that surfaces every agent in your org, its owner, its blast radius, and its risk tier. The first thing you run.
Preserving independent judgment, attention, and identity while working with AI.