If unsure, ask. Never guess. — AI Agent Pre-Execution Checklist
Impact & Implications
AI Agent Pre-Execution Checklist Protocol
1. Operational Effects
Reduced token consumption Under current practice, AI executes on incomplete information, produces incorrect results, and repeats the cycle. This protocol confirms all required items before execution. Fewer incorrect executions mean fewer retries — and fewer tokens consumed. At Level 2, most Actions execute without any questions at all, minimizing the confirmation process itself.
Reduced trial and error Current AI coding agents are adding scope limits and execution boundaries through iteration. This is evidence of a structural gap. This protocol defines those boundaries before execution, not after.
2. Execution Log — The End of the Black Box
Until now, AI execution has been a black box. What the AI knew, what it did not know, and why it chose to execute — none of this was visible. When something went wrong, there was nothing to reconstruct. The model was blamed because there was no structure to look at.
This protocol changes that.
The Checklist confirmation process is itself a record. What was known, what was unknown, who provided the answer, and whether execution was approved — all of this is produced as a natural output of the confirmation process, not constructed after the fact.
- If an item was confirmed — it is on record.
- If an item was marked unknown — it is on record.
- If execution proceeded with an unresolved unknown — that is a protocol violation, and it is on record.
This is not a logging system added on top. It is a structural consequence of how the Checklist works.
For the first time, there is a mechanism to reconstruct what the AI knew at the moment of execution. That changes what accountability, explainability, and audit mean in practice.
3. Structural Effects
AI Alignment C1 and C2 confirm the user’s intent before any execution occurs. AI operates only within that confirmed intent. Alignment is not enforced by the model — it is enforced by the structure.
Human in the loop — built in, not added C1 and C2 can only be answered by the user. Final approval always belongs to the user. No separate mechanism is required. Human confirmation is a structural consequence of how intent is confirmed.
Accountability Responsibility is declared inside the structure before execution. If a Provider does not supply a Checklist, that absence is on record. If a user does not approve, execution does not happen. When something goes wrong, the execution log already shows where the gap was, who was responsible for it, and whether the protocol was followed.
Explainability The execution log is not constructed after the fact. It is produced as a natural output of the confirmation process itself. What was known, what was unknown, and what was confirmed — all of this exists before the Action runs and remains after it completes.
4. Governance
The scope of who can contribute to a Checklist can expand:
- Provider — defines the Action’s minimum execution criteria
- User — defines personal rules, preferences, and constraints
- AI developer — may add or reduce Checklist scope based on model confidence
- Regulator — may define mandatory items for specific industries or Action types
This is an open question to be decided through broader discussion. But the structure already supports it.
Regulators do not need to follow every technical implementation. They can define what Checklist items must be present for a given industry or Action type, then review after the fact whether the structure was followed — and whether the execution log confirms it.
5. Broader Implications
AI Safety The pre-execution confirmation structure is itself a safety mechanism. Unknown is not safe. This protocol makes unknowns visible before they become incidents.
AGI / Existential risk The concern with advanced AI is not only what it can do — it is what it will do without human approval. This protocol separates model capability from execution authority. No matter how capable the model, execution requires confirmed intent and user approval. That boundary holds regardless of model performance.
AI regulation Technology moves faster than law. This protocol offers a path around that gap. Execution structure can be defined now, in natural language, without waiting for technical standards to catch up. Responsibility is traceable before an incident, not only after. The execution log provides the evidence.
6. What this protocol does not address
- Model accuracy, hallucination, or performance
- Training data bias
- Energy consumption and environmental impact
- AI copyright
- The existence of advanced AI itself
This protocol is scoped to one question: when, and with whose approval, does execution happen. Within that scope, most of the current debates around AI execution, responsibility, and control are directly or indirectly addressed.
Discussion in the ATmosphere