The Agent Loop Conflict

The goal is to free up human attention. Removing the human from the agent loop is only one tactic, and it can create fragility if we remove judgment too.

A useful frame for this is the Theory of Constraints, especially the five focusing steps. First find the constraint. Then exploit it: make the most of the scarce resource before adding more complexity. In this case, the scarce resource is human attention.

The gap

Agent systems promise faster work, but human approval prompts often make the loop slow and noisy. The work stops moving while it waits for approval, which can increase wait time considerably.

That pulls the operator back to the screen. Instead of freeing attention, the agent creates a new monitoring task: keep watching, so you can approve the next step when the work gets stuck.

The impact is not only speed. If the human spends attention on mechanical prompts, there is less attention left for reading the result, noticing the missing assumption, or deciding that the task is pointed in the wrong direction.

The change we are pushed toward

The immediate change we are pushed toward is to remove the human from the loop. Let the agent run. Let the loop move faster. Stop spending scarce attention on every tiny yes or no.

But from a higher level, removing the human is not the real goal. The real goal is to free up human attention for more important activities: judgment, steering, understanding, and deciding what should happen next.

That matters because this change has a real negative. If we remove the human from the wrong part of the loop, we also remove judgment. Nobody may stop to ask whether the work is correct, useful, safe, or worth doing at all.

The pro/con cloud

The goal is to free up human attention. Removing the human from the loop is a tactic for achieving that goal, not the goal itself.

Goal

Free up human attention so it can be used for the judgment, steering, and understanding that matter.

+ Change

Human attention is freed from tiny approvals that the environment could have handled, and the operator is not forced to make decisions from a tiny slice of misleading local context.

+ No change

Human involvement makes the work steerable. People can understand the changes being made, notice shortcomings in the approach, and redirect the agent toward better solutions.

Change

Remove the human from the agent loop.

No change

Keep the human in the agent loop.

- Change

The agent can make the best local decision and still be wrong globally. It may change the wrong thing, use the wrong approach, follow bad assumptions, or try to fix something that should not be fixed at all.

- No change

Human-in-every-step can feel faster locally while not improving total lead time. The developer sees the agent doing work, but the whole change still takes the same amount of time, or longer, because the loop is full of interruptions and approvals.

Threat

Agents may take more valuable attention from the human without gaining anything. The system feels faster locally, but becomes more fragile.

Another change

Remove the human from mechanical approval, but keep the human in judgment.

Finding the injection

The conflict is not really between humans and no humans. The goal is to free up human attention. Removing the human from the agent loop is only a tactic. The conflict is between removing the human from the loop and keeping the human in the loop. There are four moves: improve the change side, improve the no-change side, choose when to apply each (called when/when-not), and make another change.

To improve the change side, look at what we already do with junior developers. We do not approve every keystroke. We let them make a solution, then review the result. If the solution is inadequate, we give more instructions, clarify the goal, and sometimes start over.

Applied to agents, that means letting the agent work without tiny approvals, but reviewing meaningful units of work: the plan, the assumptions, the diff, and the result.

To improve the no-change side, keep what we wanted from human involvement: steering, understanding, and correction. Remove the low-value interruptions. The human stays involved where understanding and correction matter, not where the system only wants another yes or no.

For when/when-not, there are still tasks where humans already need control: working with production systems, gathering information the agent cannot gather itself, or doing interactive work that requires human attention. In those cases, keep the human in the loop.

But for bounded, reversible tasks that do not require human input while they run, do not keep the human in every step.

A final move is another change: split the loop apart. Remove the human from mechanical approval. Keep the human in judgment.

That means designing the environment before the agent starts. Safe files are readable. Needed commands are executable. Secrets, private files, and dangerous paths are unreachable.

The human decides what kind of room the agent may work in, then spends attention on the shape and quality of the work.

Making the most of attention

This is not a strange idea. Companies already work this way. We hire people into specific roles so not every task and decision consumes the same person's attention. Once we treat human attention as the constraint, the next question is how to exploit it: how do we make the most of this scarce resource?

Agents should be understood in the same way. The goal is not to get rid of the human. The goal is to free human attention so it can be used where it matters more. That is what leads to these moves: improve the change side, improve the no-change side, use when/when-not, and finally make another change. Remove attention from mechanical approval, and spend it on judgment.