The agent-native SDLC

Agents write the code. Intent and evidence are your job.

Waterfall, Agile, and Scrum were scaffolding for human limitations. Agents don't have those limitations. Ideas 2 Execution is the loop that's left — declare the intent, let agents build, force the evidence, adapt — and never stop.

View on GitHub See the IDEA loop

Private early access · No CI, no daemon, no ceremony

i2e · capability: shorten-url

code-generatedAgentpytest

no-open-redirectAgentpytest

redirect-latency-p95Agent+Providerdatadog

brand-feelAgent+Providerhuman

Agent green → ships · Agent+Provider feed adapt

The loop

One loop. Four phases. It never ends.

Ideas 2 Execution runs the IDEA loop — the whole development lifecycle, minus the ceremony.

Targets that come back unmet feed the next loop. A fixed bug becomes a Case that gates every future build. The loop never ends — that's the point.

The shift

Your methodology was a workaround for being human.

Every process the industry adopted in the last forty years exists to compensate for a human limitation. Agents don't share those limitations — so the process became overhead.

The human limit

The patch we shipped

In an agent world

The human limit

Humans can't hold an entire system in their head.

The patch we shipped

Waterfall froze the spec up front so no one had to.

In an agent world

Agents hold the whole codebase in context. A frozen spec is just stale.

The human limit

Humans misjudge scope and miscommunicate intent.

The patch we shipped

Agile shipped in small increments to catch the drift early.

In an agent world

Agents land a change in minutes. The increment is now a single loop.

The human limit

Humans lose context and forget to coordinate.

The patch we shipped

Scrum added standups, planning, and retros to re-sync the team.

In an agent world

Agents don't lose context and don't attend standups. The ceremony is pure cost.

The human limit

Humans make mistakes line by line.

The patch we shipped

Code review put a second human in front of every diff.

In an agent world

No human reads most of the diff anymore. Line-by-line review can't scale to agent output.

Strip away the scaffolding and two things remain — the two things every process was ever a proxy for: did we build what we meant to (intent), and is there proof it works (evidence)?

The trap

An AI that writes its own tests will pass its own tests.

That isn't a failure. It's exactly what the model is built to do.

A model optimizes for the signal you hand it. When it also gets to define that signal, a green suite proves nothing — it's the agent grading its own homework. Velocity measured against self-authored tests is theater: you can ship a thousand passing checks and move no metric that matters. I2E forces the evidence apart from the agent by splitting it in two.

Agent

What the agent can prove.

A check the agent fully controls and runs on demand — a unit test, an API probe, a type-check. Deterministic, repeatable, and yes, often agent-authored. Cases gate the ship. But passing every Case you wrote is the floor, not the finish line.

Agent+Provider

What the agent cannot fake.

A verdict that lives outside the agent's reach — an external metric, a measurement that needs time to elapse, or a human's judgment. The agent can't write a Target green. Targets are where you learn whether the work actually moved the needle.

Cases keep the build honest. Targets keep the business honest.

Command center

A live, interactive view of the loop.

Ideas 2 Execution comes with a web console to monitor every agent, drill into the evidence, and force the verdicts that only you can give.

I2E CLI / Web Console

I2E Web Console showing agent progress and forced evidence

LIVE|

$i2e force

v0.4.2-alpha

The framework

A harness for code no human will read.

When agents write in parallel and humans review almost none of it, you need somewhere to stand. I2E is that vantage point.

Forced evidence

Every Capability names a provider for each claim of success. You cannot declare a result you have no way to collect. No aspirational metrics — ever.

Cases vs. Targets

The framework separates what an agent can verify itself from what only an external system, the passage of time, or a person can confirm. Self-grading can't hide here.

Parallel agent loops

An orchestrator plans a batch of non-conflicting Capabilities each tick and runs them in isolated worktrees. Agents just write code — the framework keeps them from colliding.

Full agent visibility

See every in-flight agent, its current step and progress, and the evidence behind each verdict — drill from a green light down to the exact failing query.

A human evidence harness

Some proof only a person can give. A pending queue and a live dashboard are where humans mark, judge, and demonstrate the evidence an agent cannot produce.

The loop never ends

Bugs become permanent Cases. Unmet Targets reopen the loop. Code is an output that regenerates; your intents and evidence are what persist.

The /context folder

Expert knowledge, picked up automatically.

DESIGN.md, ARCHITECTURE.md, SECURITY.md — and anything else your domain experts care to drop in — live in /context. The framework reads them when it's planning, building, and grading evidence, so the rules in those documents shape every Capability without anyone having to repeat them. Write it once, and every agent in the loop respects it appropriately.

Velocity, honestly

Velocity is not how much code you shipped. It's whether the code moved a metric you meant to move.

An agent can spend a million tokens producing a thousand green tests against a goal nobody set. That isn't speed — it's expensive motion. A real loop closes on intent and evidence: every cycle either moves the needle or tells you plainly that it didn't. When it didn't, you stop paying for it.

From the field

Less reviewing diffs. More steering intent.

“We had agents shipping code all day and no idea if any of it mattered. I2E made the question unavoidable — which Target did this move? Half our backlog quietly evaporated.”

Priya Natarajan

VP Engineering, Ledgerwise

“Our agents' test suites were always green. Always. The first time we forced Targets, we found three "done" features that moved nothing. That's the day the number started meaning something.”

Marcus Feld

Head of Platform, Nimbus

“Four agents, four parallel loops, one dashboard telling me exactly what each is doing and what needs me. I stopped reviewing diffs and started steering intent.”

Dana Okoro

Director of Engineering, Arcline Systems

Questions

Frequently asked

Still curious? Reach us at hello@i2e.io.

Workshops

We can teach your team to run the IDEA loop.

Half-day and multi-day workshops, in person or remote. We walk your engineers through declaring intent, wiring providers, and standing up the evidence harness in your own repos — so when we leave, the loop is running and your team owns it.

Let's discuss workshops

Scoped engagements from a single team kickoff to multi-week rollouts.

Agents write the code. Intent and evidence are your job.

One loop. Four phases. It never ends.

Intent

shorten-url

Develop

3 agents · isolated worktrees

Evidence

shorten-url · evidence

Adapt

Target came back unmet

Your methodology was a workaround for being human.

An AI that writes its own tests will pass its own tests.

What the agent can prove.

What the agent cannot fake.

A live, interactive view of the loop.

A harness for code no human will read.

Forced evidence

Cases vs. Targets

Parallel agent loops

Full agent visibility

A human evidence harness

The loop never ends

Expert knowledge, picked up automatically.

Less reviewing diffs. More steering intent.

Frequently asked

Agents write the code. Intent and evidence are your job.

One loop. Four phases. It never ends.

Intent

shorten-url

Develop

3 agents · isolated worktrees

Evidence

shorten-url · evidence

Adapt

Target came back unmet

Your methodology was a workaround for being human.

An AI that writes its own tests will pass its own tests.

What the agent can prove.

What the agent cannot fake.

A live, interactive view of the loop.

A harness for code no human will read.

Forced evidence

Cases vs. Targets

Parallel agent loops

Full agent visibility

A human evidence harness

The loop never ends

Expert knowledge, picked up automatically.

Less reviewing diffs. More steering intent.

Frequently asked

Isn't this just another methodology?

If agents write the tests, why trust the tests?

Do I have to change where my code lives?

What do I actually do as a human?

Does this need CI, a server, or a daemon?

How does it handle many agents at once?

Do we still need developers?

How many agents does this support?

We can teach your team to run the IDEA loop.