Mirza Iqbal

Posted on Jun 17

My self-improving agent setup works because it never edits itself

#ai #claudecode #automation #agents

Everyone wants an agent that rewrites itself.

That dream is seductive. you set it up once, walk away, and come back to a smarter system that improved while you slept. No maintenance, no babysitting, pure compounding.

I run something that improves on its own for weeks at a stretch. And the reason it works is the opposite of that dream. it is never allowed to edit itself.

Let me explain, because the distinction is the whole game.

Everyone wants the wrong thing.

"Self-improving agent" has come to mean autonomy. an agent that changes its own instructions, rewrites its own tools, modifies its own behavior with no human in the loop.

That sounds like progress. It is how you lose control of a system you depend on. An agent that can silently rewrite its own rules can silently break them, and you will not know until something downstream fails in a way you cannot trace back.

More capable self-editing means a larger blast radius the day it edits wrong.

What my setup actually does.

Mine improves through a loop with one hard limit. it can detect, it can propose, it cannot adopt.

Every day it notices things. a tool I depend on shipped a new version. a pattern I corrected twice this week that should become a standing rule. a piece of my own behavior that keeps repeating. It gathers all of that and surfaces it to me.

Then it stops. That change does not happen until I look at it and say yes.

So the system sharpens over time, yet every single improvement passed through a human gate. Nothing enters the setup that I did not see.

Why unsupervised self-editing is a trap.

Picture the alternative I deliberately did not build. an agent that, on its own, decides a rule should change and changes it.

One wrong inference and the rule that protected me yesterday is gone today, silently. The next wrong edit builds on the first. Errors compound in a self-modifying system instead of staying isolated, because each change becomes the foundation the next one reasons from.

By the time you notice, you cannot tell which of fifty unsupervised edits caused the drift. You have traded a system you understand for one that surprises you, and surprise is the last thing you want from the thing you handed your trust to.

Detection is the feature, not autonomy.

Here is what took me a while to see. the valuable part was never the editing. it was the noticing.

Catching that a dependency changed, that a habit had hardened into a pattern, that something I keep fixing should be fixed once and for all. that is the hard part, and that is what I automated. Approving the change is cheap and takes me seconds.

So I automated the watching and kept the deciding. My machine does the part that is tedious and easy to forget. I do the part that needs judgment and carries risk. Each side does what it is actually good at.

My honest opinion.

Here it is. autonomy is the wrong goal for a personal agent system. auditability is the right one.

A setup nobody can audit only looks advanced. it is fragile underneath. My best self-improving system is the one where improvement is real and continuous, and still nothing ever changes without me seeing it first. Those two things fit together. they are the entire design.

Anyone chasing a fully autonomous self-rewriting agent is optimizing for the demo, not for the Tuesday six months out when it quietly edits the wrong thing.

What three weeks of this looks like.

I have not edited the setup by hand in weeks. it is sharper now than when I last touched it.

Not from rewriting itself, but from handing me small, specific, already-reasoned improvements that I approved in seconds. Compounding is real here. control never left my hands.

If you are building anything that improves over time, ask which half you are automating. Automate the noticing. keep the deciding. A system that watches relentlessly and changes only on your yes beats a clever one that surprises you, every single week.

Your turn

Would you let an agent change its own rules without asking you first.

If this was useful

I work through this in public, the wins and the freezes both, mostly on LinkedIn and YouTube. If the real version of building in the open is useful to you, that is where it lives. LinkedIn, YouTube and X under Mirza Iqbal, and the work at next8n.com.

DEV Community

My self-improving agent setup works because it never edits itself

Your turn

If this was useful

Top comments (0)