Operations
Agent Toolchain Health Check
Check the harness before blaming the model.
Use when
Agent setup depends on CLIs, MCP servers, browsers, tokens, local models, and cron jobs.
Cadence
Weekly or before a heavy agent run
Verification
Critical tools authenticate, return sane output, and have a known fallback or blocker owner.
Advanced specStructured loop spec
| Field | Value |
|---|---|
| Name | Agent Toolchain Health Check |
| Category | Operations |
| Trigger | Weekly or before a heavy agent run |
| Objective | Check the harness before blaming the model. |
| Allowed inputs | Relevant files, source notes, logs, tests, screenshots, metrics, or task state for this loop |
| Allowed actions | Define the exact scope, source of truth, and approval boundary.; Inspect current state and rank the highest-risk gap.; Make one small, reversible improvement.; Run the stated verification and record evidence.; Stop on success, budget, no progress, or approval required. |
| Verification | Critical tools authenticate, return sane output, and have a known fallback or blocker owner. |
| Stop condition | Stop when the verifier passes, the budget is exhausted, no progress is made, a blocker appears, or approval is required. |
| Budget | Set a time, turn, token, retry, file, or dollar cap before running the loop. |
| Approval boundary | Human approval required before publishing, sending, deleting, spending, changing accounts, touching production, or making reputational/legal/financial commitments. |
| Safe output | Draft, report, checklist, table, or approval-gated recommendation |
| Works with | Claude, ChatGPT, Gemini, any tool-using AI assistant |
RunbookSteps
- Define the exact scope, source of truth, and approval boundary.
- Inspect current state and rank the highest-risk gap.
- Make one small, reversible improvement.
- Run the stated verification and record evidence.
- Stop on success, budget, no progress, or approval required.
Copy promptPrompt
Run the Agent Toolchain Health Check loop. Use it when Agent setup depends on CLIs, MCP servers, browsers, tokens, local models, and cron jobs. Work in bounded iterations: inspect current state, choose the highest-risk gap, make one reversible improvement, verify it, and record evidence. Stop when Critical tools authenticate, return sane output, and have a known fallback or blocker owner. or when blocked, budget exhausted, or approval is required.