AgentHerder · Course · Week 4
Week 2 was the tools. Week 3 was the integrations. Week 4 is the meta-layer — the agents that supervise the other agents, the patterns that move context between streams without you being the human router, the rituals that hold a flying-stage fleet together across a working week.
The deliverable is concrete: by end of week, you have a Team Ops sweep skill installed and running on your own work, and you've demoed it.
At running stage, coordination is mostly you. You decide which tab needs attention. You move information between tabs. You notice which streams are stuck. The whole orchestration happens in your head.
At flying stage, that breaks. 10+ streams means too many tabs to track manually, too much cross-stream context to hold, too many small coordination decisions per hour for the human to keep up. The fleet runs ahead of your ability to supervise it.
What replaces the human coordination is also agents. A class of agents whose job isn't to do the engineering work — it's to watch the agents that do the engineering work, surface what needs attention, and handle the routine coordination so you don't have to. These meta-agents are the difference between flying-stage as sustainable practice and flying-stage as burnout.
Three patterns to install this week:
Plus the Friday demo + weekly planning ritual at flying-stage, which is mechanically different from the running-stage version because of the volume.
The single most-installed pattern in Fred's daily fleet. Worth understanding in detail.
A periodic agent that, on a schedule (every 30 min, every hour, or on-demand when you trigger it), does this sweep:
cctabs sessions + cctabs scrollback.You read the triage list. You approve / edit / send replies via cctabs send. The fleet keeps moving.
Without it: at 10 streams, you're context-switching to every stream every few minutes to check status. Brain-fry by lunch.
With it: you context-switch on a schedule (every 30–60 min), to the streams the meta-agent has surfaced, with the prep already done. Your attention goes to the decisions, not the scanning.
You don't need a complex framework — it's a Claude Code skill file in your top-level orchestrator session. Rough shape:
# Team Ops Sweep
When invoked, do this:
1. Run `cctabs sessions` to list all active tabs.
2. For each tab, run `cctabs scrollback <name>` to read the last
~100 lines of terminal output.
3. Classify each tab's state:
- **Active**: an agent is currently working, no input needed
- **Blocked**: agent is waiting on a user reply or approval
- **Quiet**: agent has been idle >30 min with no apparent work
- **Stuck**: agent appears to be looping or repeatedly retrying
4. For each Blocked tab:
- Read the agent's last message asking for input.
- Read the surrounding context (the work-stream's CLAUDE.md,
NEXT-STEPS.md, recent commits in the repo).
- Draft a 1-2 sentence reply the human would likely send.
5. For each Stuck tab: identify what's failing and propose a
re-prompt or a manual intervention.
6. Produce a triage report:
- Streams needing attention: <list>
- For each: state, draft reply (if Blocked), recommended
action (if Stuck).
- Streams cooking fine: <count, no detail needed>.
7. Surface the report. Don't take actions on other tabs — just
draft what the human should send.This is a starting template. Tighten it to your fleet's specifics — the repos you care about, the kinds of work that show up frequently, the conventions your team uses.
The discipline: the sweep is for your attention, not for the agent's authority. The meta-agent surfaces and drafts; you decide and send. At today's capability level, letting the meta-agent send replies autonomously across the fleet introduces too many silent failure modes.
The Team Ops sweep template above is one skill. Once you're at flying-stage, you're authoring skills constantly — every time a prompt you've typed three times deserves to be extracted, every time a coordination pattern shows up across streams, every time a recurring agent task could be invoked by name instead of re-explained.
A few practitioner-tested rules:
triage-stuck-prs, sync-staging-fixtures, summarize-week-for-friday-demo. Avoid generic names like helper or utils — those won't surface when an agent is scanning the available skill set.description field at the top is what makes the skill discoverable by a sibling Claude session. Treat it as the one-line pitch the agent reads to decide whether this skill is the right tool. Vague descriptions get skipped; specific ones get invoked..md file in .claude/skills/. If the prompt is one-off, leave it inline. The skill library compounds in value when it's curated, not when it's exhaustive.The skill library is one of the artefacts that compounds across months of practice. The team that maintains a shared skills directory in their orchestrator repo (and treats it as a first-class artefact in code review) gets more leverage per engineer over time.
The pattern: information lives in one tab, needs to be used in another, without you carrying it.
docs/decisions/), and the agents in B and C pick it up next time they scan their context.LEARNED.md or equivalent, referenced from CLAUDE.md.cctabs send and cctabs scrollback let one Claude session read another's terminal directly, and a sibling agent can hand context across at runtime.The repo is the bus. Most cross-stream context flows through committed markdown.
When a decision matters across streams, it goes into the repo. When it matters only to one stream, it stays in that stream's working memory. The repo-as-bus pattern is what makes 10+ parallel sessions sustainable — because you're not the routing layer.
Discipline: when an agent surfaces something cross-cutting, capture it to the repo before moving on. A 30-second commit now saves you (or a meta-agent) re-explaining the same thing across 8 tabs later.
The third coordination layer: keeping your shipped work moving through the team's process without you watching it.
A lot of PR-nudging automation overlaps with the Team Ops sweep skill from §1. You can build them as one combined skill, or split them by domain (sweep handles in-flight work; nudging handles shipped work). Either works. The shared underlying capability is: an agent that periodically polls some surface, identifies what needs movement, takes routine actions, surfaces non-routine ones.
The quiet shift that makes high-volume PR review workable: a one-line feature flip can carry the same weight as a huge refactor that doesn't visibly change anything on the surface. Trying to scale per-PR review time with per-PR line count breaks the moment you're shipping 5× the PRs at smaller average size.
The review question stops being "is this code OK" and becomes "is this change moving the work-stream in the right direction." Most of the time the answer is yes and the review takes 30 seconds. The few PRs that need real direction-discussion get it — the rest don't get pretend-attention they don't need.
The coordination primitive most fleets discover only after brain-fry has hit. A 10-minute daily Slack huddle (or equivalent) — no agenda, drop-in, context flows continuously while people work — becomes load-bearing the moment each person is running 10+ parallel streams.
Without it, context fragments across streams: tab A's agent figures something out that would help tab D's agent, but the human running both isn't paying attention to the cross-cutting opportunity. Multiply by N engineers, each running their own fleet, and the team is leaving compounding leverage on the floor.
What this looks like:
What it's not:
For solo operators, the equivalent is harder. A daily voice memo to yourself, or a running scratch file the agent reads at the start of each session, can substitute. Less rich than a real team's ambient sync; better than nothing.
At running stage, Friday demo is straightforward: each engineer demos the week's shipped work, the team updates next week's plan. Maybe an hour.
At flying stage, the volume is different. A 4-engineer team running 8–20 streams each can ship 50+ PRs in a week. A 1-hour demo at that volume becomes a 4-hour slog if you try to demo each PR individually.
The Friday ritual is the heartbeat that keeps the fleet practice from drifting. Without it, individual engineers go heads-down on their fleets for weeks and the team loses shared context about what's being learned. The compounding that makes the practice valuable comes from the team learning, not from individuals running parallel sessions in isolation.
If you're solo running a fleet without a team, run it as a Friday self-demo + planning session. Record yourself, review the next morning. Same discipline, different audience.
Do this — Assignment 1 of 1
Time estimate: 2–4 hours to build and tune the skill; integrated runtime into your normal week.
If this is your first time building a Claude Code skill: the rough shape is a markdown file with a description, the invocation pattern, and the steps the agent should follow. Claude Code's docs cover the format. The Team Ops sweep is a useful first skill to build because it pays for itself the day you finish it.
Week 5 — The emotional spine. The identity work specific to flying stage. Letting go. Brain-fry mitigation. Trusting the structure. The political layer — selling parallelism to a skeptical manager, defending throughput in code review, explaining your work to colleagues who think you're cheating. The week that determines whether the practice sticks past Week 6.
No spam. One email per meaningful update.