MartinLoop — Your AI agent doesn't know when to stop.
Same task. $2.30 with MartinLoop. $5.20 without.
AI agents burn cash retrying blind. MartinLoop sets the budget, verifies
the fix, and exits clean — before the bill arrives. Reproducible via
pnpm --filter @martin/benchmarks eval on the flaky-CI-gate task.
Open source · MIT licensed · martin-loop on npm · Keesan12/Martin-Loop on GitHub
npm install -g martin-loop
martin run "your task" --budget 3 --verify "pnpm test"
martin inspect
Your AI budget is leaking. Engineering knows. Finance doesn't.
- Wasted tokens. By the 5th retry, 80% of what the model reads is its own failed attempts. You're paying for it to re-read garbage.
- Every error gets the same fix: retry. Syntax error? Retry. Hallucination? Retry. Logic bug? Retry. No diagnosis, no targeted fix — just another $2 burned.
- No spending limit. There's no cap inside the loop. Finance sees a $40K bill they can't explain. AI budgets get frozen.
Four things happen on every MartinLoop run.
- Smarter retries. Failed attempts get compressed into structured signals. Token usage stays flat instead of growing with every retry.
- Targeted failure handling. 11 failure types, each with a different response. Syntax → constraint fix. Hallucination → grounding check. Logic → chain-of-thought scaffold.
- Hard budget cap. Set a dollar limit before the run starts. MartinLoop tracks spend in real time and stops before the cap — not after.
- Smart exits. MartinLoop can exit with a partial result when diminishing returns kick in or the budget ceiling approaches. Every run ships something useful.
Pricing
- Open Source — $0. Full governed runtime. MIT licensed. Unlimited loops. No account required.
- Pro — $49/mo. Adds the MartinLoop Dashboard, loop heatmap, and cost tracking history.
- Growth — $149/mo. Failure taxonomy reports, smart model routing, custom budget governors, finance-ready cost reports.
- Enterprise — $499/mo. Hosted control plane, custom classifiers, compliance-ready audit trails, on-prem deployment, SLA.
FAQ
- What is MartinLoop?
- The open-source control plane for AI coding agents. Hard budget stops, JSONL run records, an inspectable audit trail, an 11-class failure taxonomy, and test-verified completion.
- How much can a governed run save vs an ungoverned one?
- On the flaky-CI-gate benchmark, a MartinLoop-governed run completes for $2.30 vs $5.20 ungoverned — a 2.3× reduction. Reproduce with
pnpm --filter @martin/benchmarks eval. - Is it open source?
- Yes. The core is MIT licensed. The hosted dashboard and managed control plane are commercial.
- Who is it for?
- Engineering teams, platform teams, and CTOs running AI coding agents in production who need budget control and auditability.