Human in or out of the loop: autonomy levels & guardrails
Yesterday you closed the loop. Today you decide who drives it — and build the rails that let you hand over the wheel without crashing.
Autonomy is a dial, not a switch: you automate the mechanical parts of the loop (production, tagging, allocation) and keep humans on judgment (taste, brand, ethics, crowning winners) — and you earn your way up the dial as guardrails and a track record mature. You never start fully autonomous.
For eighteen days you built a machine. The genome that labels every creative at birth (Day 4). The matrix that turns one idea into fifty tagged assets (Week 2). The AI production line that ships them (Week 3). A clean test structure and a fitness function that defines "winner" by the deepest money event (Day 16). The dissection that reads winners down to elements (Day 17). And yesterday, the recursion: insights flow into the Creative Memory and brief the next batch, so quality compounds batch over batch (Day 18).
Now the operational question that decides whether this is a job or a system: which steps does a human still touch, and which run themselves? Get this wrong in one direction and you never scale — you're a very organised manual labourer. Get it wrong in the other and you scale a mistake at the speed of compute. Today is about finding the right rung, and the rails that let you climb safely.
1The four autonomy levels
Think of the loop as a car and yourself as the driver. Some days you want both hands on the wheel; some days the system can hold the lane while you watch the road. The dial has four rungs. At every rung the loop runs — what changes is how much of it the human does versus the system.
◀ More control, less scale · The dial: how much of the loop the human runs vs. delegates · More scale, more earned trust ▶
Read the dial top to bottom and you'll notice the human never leaves — they just move up the value chain. At L0 the human is a builder. At L3 the human is a policy-setter and auditor. The mechanical work (generating variants, applying genome tags, shifting €5 from a laggard to a leader) drains downward into the system at every rung. The judgment work (is this on-brand? is this claim true? is this winner real or noise?) stays human all the way up. That is the whole design principle: delegate the mechanical, retain the judgment.
Crucially, the system's two loops don't change with autonomy. Meta's inner loop — Advantage+ Creative and the auction allocating budget among the creatives you gave one ad set (Day 16) — runs automatically regardless of your rung. What the dial governs is your outer loop: who decides what to make next. L0 means a human writes every brief; L3 means the Creative Memory writes it and a human audits. The recursion from Day 18 is identical at every level — only the hands on it differ.
2The guardrails that let you climb
You cannot climb the dial on optimism. You climb it on guardrails — the hard rails that make delegation safe. The logic is simple: the more of the loop you let run unattended, the more constraints you must pre-commit so the unattended parts can't wander somewhere expensive or off-brand. Guardrails are what convert "I hope the system behaves" into "the system cannot do the thing I'm afraid of." Here is the standing set.
Notice the pattern: most rails are system-held (the machine enforces them on itself), but the two that touch taste, ethics, and truth — the compliance gate and the veto — are human-held, and they stay human at every rung including L3. A guardrail isn't a suggestion the system can override when convinced; it's a wall. The whole reason you can sleep while an L3 loop runs is that the worst outcome is already bounded before you went to bed.
3How to choose your rung — and how to move up
Your level is not a personality; it's a function of three inputs: risk tolerance (how expensive and how reversible is a bad batch?), volume (how many batches per week — is manual even physically possible?), and maturity (how rich is your Creative Memory, and how good is the system's track record?). A new account with a thin Memory and a regulated product belongs at L0–L1. A six-month-old engine with a fat Proven Elements Library, a populated Graveyard, and three batches of clean compounding has earned L2, and is a candidate for bounded L3 on its lowest-risk cells.
Here's a concrete read of how you'd actually move up. Suppose your engine ships 30 tagged variants a week across 5 concepts. At L0/L1, a human reviews all 30 plus does the buy and the read — call it ~6 hours of human attention per batch. The Creative Memory is young: maybe 4 proven elements scored, win-rate noisy at ~12%.
Three months on, the Memory has 19 proven elements segmented by persona, a Graveyard of 30+ dead ideas you'll never re-test, and win-rate has climbed to ~24% with CPA trending down batch over batch (exactly the compounding you proved in Day 18). Now the system's brief-from-Memory is demonstrably as good as the human's, and the dissection it drafts matches the human's read 9 times in 10. You move to L2: the system launches and allocates inside a €150/day batch cap, auto-kills any variant 30% below the hook-rate average after 48 hours, and you only show up at two checkpoints — crowning winners and the brand gate. Human attention per batch drops to ~90 minutes, and throughput doubles because nothing waits on your inbox.
That is the move: you earn each rung with evidence. The promotion criterion is never "I'm busy" — it's "the system's autonomous decisions have matched my judgment, on this kind of cell, often enough that I trust the guardrails to catch the rest." You ratchet up one rung, on one low-risk cell type, watch it, then widen. You never wake up one morning and flip the whole engine to L3 because a dashboard told you your Opportunity Score would rise.
This is exactly the L0–L5 self-driving ladder. L0 is your hands on the wheel the whole way. L2 is adaptive cruise with lane-keep — the car holds the lane on the motorway, but you're watching and you can grab the yoke in a heartbeat. L3 is the car driving the route while you supervise within a defined domain. Nobody sane skips to "no driver" on day one — and nobody ships it without the brakes, the lane sensors, and the big red disengage button wired in first. The guardrails aren't bureaucracy; they're the brakes. You don't earn the autonomy and then add the safety systems. You build the safety systems and that is what earns you the autonomy.
Most of the dial is wired with two surfaces you already have. In Ads Manager, Automated Rules encode the system-held guardrails — spend caps, circuit breakers, fatigue triggers, the minimum-data floor — as if/then conditions that run on a schedule and either act or alert. The human-held rails (the compliance gate, the veto) live as checkpoints in your weekly cadence, plus the AI-disclosure checkbox at the ad level. A creative-intelligence tool (Motion, Atria, Rule1 from Day 17) feeds the dissection that the rules read. Here's an L2 ruleset for one test cell:
Two rows are deliberately not automated: crowning the winner waits for a human checkpoint even when the data floor is met, and any compliance flag halts and escalates. Everything mechanical runs; everything that's judgment stops for a person. That single screen is your autonomy level.
Both ends of the dial are traps. The first: going fully autonomous before the guardrails and data exist. Hand an L3 loop a thin Creative Memory and a noisy fitness function and you don't get scale — you get a flawed objective optimised at the speed of compute. The system confidently breeds more of whatever its half-formed map calls a "winner," and because no one's crowning by hand, it ships fast, confident garbage. The opposite trap is quieter and just as costly: staying fully manual forever. Day 1's entire premise was that creative is the last lever with real human leverage — but leverage means scale. An engine you personally hand-crank for every batch is a hobby, not a system; it will never out-produce fatigue (element F, the clock that climaxes in Day 20). Your edge is treating autonomy as something you earn one rung at a time, on evidence — automating the mechanical the moment it's safe, and never delegating the taste, the brand, or the truth.
Today's recap — 30 seconds
- Autonomy is a dial, not a switch. Four rungs: L0 Manual → L1 Assisted → L2 Supervised → L3 Bounded-autonomous. At every rung the human moves up the value chain, never off it.
- Delegate the mechanical, retain the judgment. Production, tagging and budget allocation drain into the system; taste, brand, ethics and crowning winners stay human at every level.
- Guardrails let you climb safely: spend caps, the human-held compliance gate, a minimum-data threshold (the media-buying course's noise trap, raised to the creative), explore/exploit bounds, fatigue triggers, circuit breakers, and the human veto.
- You earn your rung on evidence — risk, volume, and a mature Creative Memory + track record. The system's autonomous calls must match your judgment before you widen its leash. Never start at L3.
- Both failure modes cost you: autonomous-too-early = fast confident garbage; manual-forever = no scale, and fatigue wins.