Operations

Agent Toolchain Health Check

Check the harness before blaming the model.

Use when

Agent setup depends on CLIs, MCP servers, browsers, tokens, local models, and cron jobs.

Cadence

Weekly or before a heavy agent run

Verification

Critical tools authenticate, return sane output, and have a known fallback or blocker owner.

Advanced spec

Structured loop spec

FieldValue
NameAgent Toolchain Health Check
CategoryOperations
TriggerWeekly or before a heavy agent run
ObjectiveCheck the harness before blaming the model.
Allowed inputsRelevant files, source notes, logs, tests, screenshots, metrics, or task state for this loop
Allowed actionsDefine the exact scope, source of truth, and approval boundary.; Inspect current state and rank the highest-risk gap.; Make one small, reversible improvement.; Run the stated verification and record evidence.; Stop on success, budget, no progress, or approval required.
VerificationCritical tools authenticate, return sane output, and have a known fallback or blocker owner.
Stop conditionStop when the verifier passes, the budget is exhausted, no progress is made, a blocker appears, or approval is required.
BudgetSet a time, turn, token, retry, file, or dollar cap before running the loop.
Approval boundaryHuman approval required before publishing, sending, deleting, spending, changing accounts, touching production, or making reputational/legal/financial commitments.
Safe outputDraft, report, checklist, table, or approval-gated recommendation
Works withClaude, ChatGPT, Gemini, any tool-using AI assistant
Runbook

Steps

  1. Define the exact scope, source of truth, and approval boundary.
  2. Inspect current state and rank the highest-risk gap.
  3. Make one small, reversible improvement.
  4. Run the stated verification and record evidence.
  5. Stop on success, budget, no progress, or approval required.
Copy prompt

Prompt

Run the Agent Toolchain Health Check loop. Use it when Agent setup depends on CLIs, MCP servers, browsers, tokens, local models, and cron jobs. Work in bounded iterations: inspect current state, choose the highest-risk gap, make one reversible improvement, verify it, and record evidence. Stop when Critical tools authenticate, return sane output, and have a known fallback or blocker owner. or when blocked, budget exhausted, or approval is required.
Metadata

Tags

agentstoolsMCP
Next loops

Related