jedarden/trail-boss

Fork 0

Single-pane attention router for interactive AI coding agents: surfaces whichever session is blocked waiting on you, so you answer or skip from one place.

Find a file

jedarden 2c9d0436be docs(trail-boss): update PROGRESS.md - phase 6 complete		2026-05-30 12:58:36 -04:00
.beads	feat(trail-boss): phase 6 complete - all 7 acceptance scenarios passing	2026-05-30 12:55:37 -04:00
.claude	feat(trail-boss): phase 6 complete - all 7 acceptance scenarios passing	2026-05-30 12:55:37 -04:00
.marathon	docs(marathon): update Current state — phases 1–5 complete, phase 6 next	2026-05-25 22:57:57 -04:00
bin	feat(trail-boss): phase 6 complete - all 7 acceptance scenarios passing	2026-05-30 12:55:37 -04:00
daemon	feat(trail-boss): phase 6 complete - all 7 acceptance scenarios passing	2026-05-30 12:55:37 -04:00
docs	feat(trail-boss): phase 1-2 - PermissionRequest probe and emitter	2026-05-25 22:04:03 -04:00
notes	docs(trail-boss): phase 6 complete - all 7 acceptance scenarios passing	2026-05-30 12:57:08 -04:00
.gitignore	chore(marathon): add plan-driven marathon coding infrastructure	2026-05-25 21:42:55 -04:00
.needle-predispatch-sha	feat(trail-boss): phase 6 complete - all 7 acceptance scenarios passing	2026-05-30 12:55:37 -04:00
package.json	feat(trail-boss): commit phase 3 daemon code (previously untracked)	2026-05-25 22:57:51 -04:00
PROGRESS.md	docs(trail-boss): update PROGRESS.md - phase 6 complete	2026-05-30 12:58:36 -04:00
README.md	docs(readme): frame Trail Boss as human-on-the-loop / dead-letter queue	2026-05-24 22:56:59 -04:00
test-daemon-phase3.sh	feat(trail-boss): commit phase 3 daemon code (previously untracked)	2026-05-25 22:57:51 -04:00
test-daemon.sh	feat(trail-boss): commit phase 3 daemon code (previously untracked)	2026-05-25 22:57:51 -04:00
test-navigation.sh	feat(trail-boss): phase 4 - Navigation command	2026-05-25 22:18:01 -04:00
test-presentation.sh	feat(trail-boss): phase 5 - Presentation layer	2026-05-25 22:26:17 -04:00
test-walking-skeleton.sh	feat(trail-boss): phase 6 complete - all 7 acceptance scenarios passing	2026-05-30 12:55:37 -04:00
tmux.conf	feat(trail-boss): phase 6 complete - all 7 acceptance scenarios passing	2026-05-30 12:55:37 -04:00

README.md

Trail Boss

You run a herd of AI coding agents like cattle, each grazing its own task. When one bogs down or strays — needs a decision, hits a permission gate, or finishes and waits for the next order — Trail Boss is the single pane where it reports in. You ride over, set it right (or wave it on), and your reply lands back in the exact session — so you stop hand-cycling terminal windows hunting for whoever's stuck.

Human on the loop, not in it

Trail Boss turns human-in-the-loop into human-on-the-loop. Classic agentic HITL wires you into the inner cycle — approving each step, answering each prompt — so you are the bottleneck on every iteration. Trail Boss flips it: agents run autonomously by default and you supervise from above, engaged only by exception. When an agent can't proceed on its own — needs a decision, hits a permission gate, or exhausts its turn — it falls through to you.

Put plainly, the human is the failure mode. Trail Boss is a dead-letter queue for a fleet of agents: the happy path never touches you; only stalled work routes to you, you process the exception (reply or skip), and it goes back on the wire. Instead of you polling many sessions to find the one that needs you, each stuck session raises its hand and Trail Boss presents them as one prioritized queue — most-stuck first. Read the context, give the order (reply), or wave it on (skip).

┌─ TRAIL BOSS ────────────────────────────────────────────── 3 stuck ───┐
│                                                                        │
│  ▶ api-gateway         PERMISSION   stuck 2m14s                        │
│      wants to run: terraform apply -target=module.lb                   │
│      [a]llow  [d]eny  [e]dit  [s]kip  [o]pen pane                       │
│  ──────────────────────────────────────────────────────────────────  │
│    search-index        PLAN         stuck 0m48s                        │
│      proposed plan: "Add incremental reindex on write…" (42 ln)        │
│  ──────────────────────────────────────────────────────────────────  │
│    docs-site           QUESTION     stuck 0m11s                        │
│      "Version the API reference per release, or keep one rolling page?"│
│                                                                        │
│  [tab] next   [enter] focus   reply ▸ ____________________________     │
└────────────────────────────────────────────────────────────────────────┘

Why

Long-form agentic coding runs many sessions in parallel, one per terminal window. Each periodically stalls waiting on a human:

a permission prompt (run this command? edit this file?)
a plan waiting for approval
a clarifying question
or it simply finished its turn and is idle, wanting the next instruction

Discovering those stalls by manually cycling windows is the bottleneck — with N sessions, most of your time goes to finding the one that needs you, not answering it, and a session can sit blocked for minutes while otherwise-parallel work waits. The human is the scarce resource; the system should route the human's attention, not the other way around — engaging you by exception, not on every step.

How it works

Each agent session emits a signal the moment it blocks, via Claude Code hooks — Stop (turn finished, idle, awaiting the next prompt) and PermissionRequest (a hard approval). A small always-on collector tracks every session's state, tails the transcript to extract what is being asked, and serves a single-pane queue ranked most-stuck-first. You answer or skip; your reply is delivered back into the exact session — via tmux send-keys (overlays a terminal workflow with no rewrite) or the Agent SDK's canUseTool / streaming input (a cleaner, programmatic substrate). Full design in docs/plan/plan.md.

Repository layout

README.md — this file
docs/plan/plan.md — the complete design: problem, capabilities, architecture, phases, open questions
docs/research/claude-code-mechanics.md — the Claude Code primitives for detect / correlate / deliver
docs/research/related-work.md — public prior art and how Trail Boss differs
docs/notes/decisions.md — naming rationale and key design decisions

Status

Research / design. No implementation yet. The detection model is settled (Stop + PermissionRequest are the two load-bearing signals); the next step is the collector plus the session→pane registry.