Evaluation
Agent Merge Queue Review
Separate useful agent output from merge-shaped confetti.
Use when
Agent throughput created more branches than a human can safely review from memory.
Cadence
After multiple agent-generated PRs or branches accumulate
Verification
Only branches with passing checks, clear intent, non-conflicting scope, and human-readable evidence are merged or promoted.
Advanced specStructured loop spec
| Field | Value |
|---|---|
| Name | Agent Merge Queue Review |
| Category | Evaluation |
| Trigger | After multiple agent-generated PRs or branches accumulate |
| Objective | Separate useful agent output from merge-shaped confetti. |
| Allowed inputs | Relevant files, source notes, logs, tests, screenshots, metrics, or task state for this loop |
| Allowed actions | List all candidate PRs/branches with goal, touched files, test evidence, risk, and age.; Group related branches and identify conflicts, duplicated work, and dependency ordering.; Run targeted checks for the highest-value candidates first.; Approve, request changes, merge, park, or close each candidate with a reason.; Update repo instructions or loop docs with any recurring failure pattern seen across branches. |
| Verification | Only branches with passing checks, clear intent, non-conflicting scope, and human-readable evidence are merged or promoted. |
| Stop condition | Stop when the verifier passes, the budget is exhausted, no progress is made, a blocker appears, or approval is required. |
| Budget | Set a time, turn, token, retry, file, or dollar cap before running the loop. |
| Approval boundary | Human approval required before publishing, sending, deleting, spending, changing accounts, touching production, or making reputational/legal/financial commitments. |
| Safe output | Draft, report, checklist, table, or approval-gated recommendation |
| Works with | Claude, ChatGPT, Gemini, any tool-using AI assistant |
RunbookSteps
- List all candidate PRs/branches with goal, touched files, test evidence, risk, and age.
- Group related branches and identify conflicts, duplicated work, and dependency ordering.
- Run targeted checks for the highest-value candidates first.
- Approve, request changes, merge, park, or close each candidate with a reason.
- Update repo instructions or loop docs with any recurring failure pattern seen across branches.
Copy promptPrompt
Run the Agent Merge Queue Review loop. Inventory candidate PRs or branches with goal, touched files, verification evidence, risk, and age. Group related work, find conflicts and duplicated abstractions, run targeted checks for the highest-value candidates, and decide approve/request-changes/merge/park/close with reasons. Capture recurring agent failure patterns in repo instructions or loop docs.