Operations

Agent Toolchain Health Check

Check the harness before blaming the model.

Use when

Agent setup depends on CLIs, MCP servers, browsers, tokens, local models, and cron jobs.

Cadence

Weekly or before a heavy agent run

Verification

Critical tools authenticate, return sane output, and have a known fallback or blocker owner.

Advanced spec

Structured loop spec

Field	Value
Name	Agent Toolchain Health Check
Category	Operations
Trigger	Weekly or before a heavy agent run
Objective	Check the harness before blaming the model.
Allowed inputs	Relevant files, source notes, logs, tests, screenshots, metrics, or task state for this loop
Allowed actions	Define the exact scope, source of truth, and approval boundary.; Inspect current state and rank the highest-risk gap.; Make one small, reversible improvement.; Run the stated verification and record evidence.; Stop on success, budget, no progress, or approval required.
Verification	Critical tools authenticate, return sane output, and have a known fallback or blocker owner.
Stop condition	Stop when the verifier passes, the budget is exhausted, no progress is made, a blocker appears, or approval is required.
Budget	Set a time, turn, token, retry, file, or dollar cap before running the loop.
Approval boundary	Human approval required before publishing, sending, deleting, spending, changing accounts, touching production, or making reputational/legal/financial commitments.
Safe output	Draft, report, checklist, table, or approval-gated recommendation
Works with	Claude, ChatGPT, Gemini, any tool-using AI assistant

Runbook

Steps

Define the exact scope, source of truth, and approval boundary.
Inspect current state and rank the highest-risk gap.
Make one small, reversible improvement.
Run the stated verification and record evidence.
Stop on success, budget, no progress, or approval required.

Copy prompt

Prompt

Run the Agent Toolchain Health Check loop. Use it when Agent setup depends on CLIs, MCP servers, browsers, tokens, local models, and cron jobs. Work in bounded iterations: inspect current state, choose the highest-risk gap, make one reversible improvement, verify it, and record evidence. Stop when Critical tools authenticate, return sane output, and have a known fallback or blocker owner. or when blocked, budget exhausted, or approval is required.

Metadata