Agents cannot own ongoing responsibilities — they can only do one-shot tasks

devtools0 views
Every agent interaction is a single synchronous session: you give it a task, it runs, it finishes, it is gone. Agents cannot monitor a system over days, follow up on a PR review cycle, track a customer onboarding process, or watch for regressions after a deploy. So what? The most valuable work in any organization is ongoing ownership, not one-shot tasks. A senior engineer's value is not in writing one function — it is in owning a system: watching for issues, responding to incidents, iterating based on feedback, maintaining quality over time. Agents cannot do any of this. They are expensive temps who disappear after each task. Why does this matter in the first place? One-shot tasks are the easy part of work. Monitoring production after a deploy, iterating on a PR based on reviewer feedback, gradually improving a system over weeks — this is where 80% of engineering time goes and 90% of the value is created. Agents are optimized for the 20% of work that was already the easiest part. The structural reason: agents run on ephemeral compute (API calls, serverless functions) with no persistent process. There is no standard way to define a long-running agent responsibility, trigger re-engagement based on external events (webhook, schedule, state change), or maintain continuity of context across multiple sessions over days or weeks. Building this requires solving agent memory, event-driven orchestration, and cost management simultaneously.

Evidence

No major agent framework supports event-driven long-running agent processes. Claude Code, Cursor, GitHub Copilot all operate in single-session mode. Devin attempted multi-session workflows but reviews indicate poor continuity between sessions. The closest analog is GitHub Actions (event-driven automation) but it has no LLM reasoning or memory.

Comments