AgentHerder · Course · Week 3
Week 1 set the policy: every coordination interaction routes through the agent. Week 2 installed the mechanical fleet. Week 3 is what makes the policy real across the long tail — the deep practice of browser automation, the deep practice of comms automation, and an honest answer to the question every cohort asks: "why don't we just make these agents autonomous?"
Each new integration point removes a class of coordination friction. The fleet's effective throughput isn't the agent's raw speed — it's the surface area of what the agent can reach without you, multiplied by how much of your attention is left over for the work that compounds.
Give your agents more and more integration points so that all the small things across every parallel work-stream cancel out.
The implication: at flying-stage, your week's work is shaped by the gaps in integration, not the gaps in skill. The interesting question every week is: what's the next coordination surface I haven't yet routed through the agent? The answer is usually visible in the friction you're still feeling — the dashboard you still log into, the message you still draft from scratch, the partner portal where you still chase MFA codes by hand.
Week 3 is about closing those gaps, deeply.
In the Augmented Mind Course Week 2, browser automation is taught as a baseline — stop clicking through routine SaaS yourself. At flying-stage, the deep discipline starts where the baseline ends.
The first thing that goes wrong when you ask Claude to drive a browser at the fleet level: every workflow needs auth, every auth flow is slightly different, and password managers don't natively talk to Claude's browser tools.
Patterns that work:
op integration). The agent reads the credential at the moment of need, never logs it. Don't put credentials in CLAUDE.md or repo files.The hard part. The agent can drive the username/password step, then hits a TOTP prompt or a push notification. Several patterns:
oathtool or 1password-cli's op item get --otp give the current code via shell. The agent calls the shell tool, gets the code, types it. No human interruption needed if the agent has shell access and the credential is in the vault.The discipline: pick one MFA pattern per integration and standardize it. Mixing patterns mid-fleet (one tab needs you to type a code, another sends a push, a third just stalls) is brain-fry territory.
The pain class you discover only at scale: SaaS dashboards with JS that breaks the agent's selectors after a UI refresh, pages that lazy-load content the agent needs but doesn't wait for, single-page apps where the agent's "click" doesn't actually trigger the framework's handler because it was attached after the click selector was captured.
The deep-practice patterns:
text="Sign in"), ARIA labels (role="button", name="Submit"), or data-test-ids over fragile CSS paths. Class-based selectors break the next time the UI is restyled.The most underrated practice at the fleet level. Claude is conservative by default — when a flow stalls, the default is to report back "I tried, didn't work, here's what happened." Most of the time, the right response is not to accept that.
Most "Claude can't do X" is "Claude tried one approach, it failed, and now needs to try a different one." Your job is to push back.
Concrete escalation ladder when Claude says it can't:
The shift from running to flying: at running stage, "Claude says it can't" becomes "I'll do it myself." At flying stage, "Claude says it can't" becomes "let me find what's stopping it and unblock it." The fleet's coverage of coordination surface keeps growing because you treat agent stuckness as a debug target, not a hand-off.
Same logic, applied to the second-biggest coordination surface: messages. Baseline (Week 1 territory): the agent drafts the routine reply, you approve. Deep practice (this week): the full inbox-to-outbox AI surface.
The pattern: a meta-agent runs periodically (every few hours or on demand), reads new mail, categorises (action-needed, reply-needed, FYI, junk), drafts replies for the reply-needed bucket, queues them for you to skim-and-send. You review the queue once or twice a day instead of context-switching to email constantly.
What this needs:
Two patterns, depending on the org culture:
Specific case worth singling out: scheduling. The agent reads incoming meeting requests, cross-references your calendar, your stated availability rules, the request priority, and either accepts, declines, or proposes alternatives. Often the highest-leverage comms automation because scheduling back-and-forth is uniquely soul-draining and uniquely automatable.
Tools that work today: the Google Calendar MCP (or equivalent) plus a meeting-acceptance skill encoding your rules ("decline anything after 5 PM," "auto-accept 1:1s with my team," "propose alternatives for anything that conflicts with deep-work blocks").
The recurring weekly "what did the team ship" message. At fleet scale, you have so much shipped you can't remember it all by Friday. The agent does:
Your editing time per week: 5–10 minutes. Your time spent constructing the update from memory: previously 30–60 minutes. The compounding adds up.
Every cohort asks it. Usually around Week 2 or Week 3. The framing varies: "why are we still in the loop?" or "couldn't we just let the agents run?" or "what's stopping me from setting up an agent that fixes its own CI failures, deploys itself, and handles inbound emails without me?"
Fred's stance, in his own words:
You need supervision and human guidance — that's why parallelism is important. Parallelism is the human operator. We're not here to replace ourselves with agents that don't need us. The value-add is YOU — the domain expertise, the seeing where the agent should be, the guiding.
Translated into the AgentHerder posture:
This bootcamp is not training you to set up agents that run without you. It's training you to be the irreplaceable orchestrator of a large agent fleet. Those are different jobs.
The autonomous-agents path is a real path — there are products and companies pursuing it. It has different trade-offs: more brittle, harder to course-correct, lower-judgement output, more expensive incident-response when things go wrong (because nobody was watching). At the current state of model capability and tooling, autonomous-agent setups break in ways the human-in-the-loop fleet doesn't.
At fleet level (10+ parallel), the human operator is the source of:
The mental error to avoid: assuming "autonomous" means "more leveraged." At today's capability level, autonomous usually means less leveraged, because the orchestrator role gets shifted to incident-response after the fact instead of to active direction-setting upfront.
This may change as model capability improves. The premise of the AgentHerder practice — that the human operator's judgement is load-bearing — is bet against by some serious people. It's worth knowing this is a position, not a settled fact. But it's the position the course teaches, and it's the one Fred is staking the brand on.
If you find yourself drawn to the autonomous-agents path: that's a legitimate direction, just not this course's direction. The skills overlap, the philosophy doesn't. Don't try to install both at once.
Do this — Assignment 1 of 1
Time estimate: 2–6 hours depending on how custom the integration is and how cleanly the target system exposes the surface you need.
If you find that the most painful gap is something where the integration just doesn't exist yet — no MCP, no clean API, no scriptable surface — that's also valuable information. Write it up. The gaps are where the next generation of tools and skills come from.
Week 4 — Coordination patterns. The Team Ops sweep skill. Cross-stream context sharing. PR-nudging automation. Friday demo + weekly planning at flying-stage. The operational layer that turns 6+ parallel sessions from chaos into flow.
No spam. One email per meaningful update.