writing code
was the work.
now, the last step.

Five branches. A Claude conversation about architecture from three days ago. Prompts in three providers. Decisions in Slack. Specs in Notion. Sensitive context that doesn't want to live on anyone's cloud. PaellaDoc orchestrates everything that happens before — and after — the code itself. From the AI conversation that produced the line to the Playwright e2e video that proved it works. On your machine.

↓ download · macOS build a plugin ↑

ORCHESTRATOR / LIVE paelladoc ado · in motion

each story = 1 AI developer · 1 worktree · 1 isolated context.
merge wars: 0. context bleed: 0.

[+] · the category nobody named yet

this is not
an IDE.

Categories don't die when products disappear. They die when they answer the wrong question.

The IDE optimizes one metric: friction between intent and code in the repo. Cursor is the culmination of that story, not its rupture. If your question is "how do I write code faster?" — Cursor is the best answer.

That stopped being the question.

You don't open your machine to write code. You open it to a Claude conversation from three days ago about architecture. A ChatGPT session with prompts now worth gold and lost. Five active branches. Notion docs. Linear tasks. Slack decisions that change today's code. Three LLM providers with different policies. Context too sensitive for anyone's cloud.

The bottleneck moved. It's not typing. It's coherence — between intent, context, decisions and models.

Three things IDE + AI structurally cannot solve

/ 01 · Context sovereignty Your prompts, decisions, dialect, institutional knowledge belong to the AI provider — not you. ChatGPT memory and Claude Projects vanish the day terms change.
/ 02 · Multi-model routing Serious devs use 3–4 models per task — Opus for architecture, GPT for generation, locals for repetition, Gemini for multimodal. IDEs don't orchestrate this. You copy-paste, losing context every jump.
/ 03 · Heterogeneous artifacts What you build is code + specs + decisions + AI conversations + docs + prompts. The IDE only sees code. You are the human integrator between six tabs.

These aren't features missing. They're architecturally external to the IDE concept. No IDE will solve them — no matter how much AI gets bolted on.

PaellaDoc is
what comes after.

read the full manifesto →

EVERY LINE TRACED · EVERY TEST GATED · EVERY DECISION AUDITED ★ ON YOUR MACHINE · NO LOCK-IN ★ EVERY LINE TRACED · EVERY TEST GATED · EVERY DECISION AUDITED ★ ON YOUR MACHINE · NO LOCK-IN ★ EVERY LINE TRACED · EVERY TEST GATED · EVERY DECISION AUDITED ★ ON YOUR MACHINE · NO LOCK-IN ★ EVERY LINE TRACED · EVERY TEST GATED · EVERY DECISION AUDITED ★ ON YOUR MACHINE · NO LOCK-IN ★

[+] · start where you are

four ways in.
pick yours.

I have a repo

Point PaellaDoc at it. Reverse-intake reads your code and seeds PRD, epics, stories.

start here →

I'm starting from zero

No repo yet. Write the PRD with the orchestrator. Stack template scaffolds the first commit.

start here →

I define product, not code

PRD, epics, acceptance criteria. The work-item view shows you what every agent decided.

start here →

I don't touch code

Workspace mode. Describe what you want, iterate, see evidence. The agent ships; the gate verifies.

start here →

[002] · what it gives you

Three things, in this order.

/01 · Traceability

every line, accountable

Git stops at "who committed it". PaellaDoc keeps the chain: spec → agent conversation → code → e2e proof. Open any line months later, refactor without fear, onboard without ceremony. The thread doesn't break.

/02 · Real validation

no green lies

CI runs the tests the agent wrote. PaellaDoc runs Playwright e2e against your running app — your dev server, the same one you use — with the agent's code applied, and writes screenshots + video + trace to disk. No evidence, no green light. The agent doesn't decide. The gate does.

/03 · Sovereignty

your context, your machine

Everything on your disk, in plain SQLite. No cloud, no vendor cage. Anonymous telemetry, opt-out anytime — chats, code, prompts and secrets never leave your machine. Bring any model — Claude today, Codex tomorrow, a local Llama next week.

[+] · traceability

a year passes.
the thread
is still intact.

A year goes by. You open a story you barely remember. You see exactly who wrote which line, why, what spec asked for it, what test proved it works, and what depends on it. No archaeology. No "who shipped this?" No abandoned PRs from agents nobody can debrief.

Refactor without fear — you know what depends on what before you touch it
Onboard a new dev without ceremony — the story tells its own history
Audit when legal asks — every change names its author, human or system
Retire stale code without breaking what depends on it
Search across years of work — product surface, decisions, evidence

$ paelladoc trace payments/webhook.ts:42

→ written by: @codex · 2025-04-12 14:33
→ asked by: AC-014-002
"refund webhook rejects invalid sig"
→ part of: US-014 → Epic-payments
→ PRD-portfolio-v3
→ proven by: gold-gate run-7843 · PASS
→ touched since: 0 changes
→ depended on by: 3 specs, 2 tests

→ thread intact ✓

[003] · how the orchestrator works

one orchestrator.
n projects. n×k brains.

# your orchestratorYOU (1 human)
├── orchestrator · paelladoc-ado · local
│   ├── project: acme/portfolio-v3
│   │   ├── worktree-01 → claude · story-014 · ✓
│   │   ├── worktree-02 → codex  · story-015 · ↻
│   │   └── …16 more
│   ├── project: paelladoc/ado-core
│   │   ├── worktree-01 → codex  · refactor · ✓
│   │   └── …23 more
│   └── project: cabin/local-llm-bench
│       └── worktree-01 → llama-local · ✓ (offline)
└── context_store · sqlite · ~/.paelladoc/
    ├── chats.db   2.4 GB
    ├── decisions.db 180 MB
    └── artifacts/   11 GB

ISOLATED

Every agent runs in its own git worktree. Its own branch, its own filesystem view, its own conversation. Cleanup is automatic. Merge wars stop being a thing.

PARALLEL

You are not the bottleneck. Queue 50 user stories before bed; wake up to 50 worktrees finished, replayed, and ready to review at your speed.

LOCAL

The orchestrator is on your machine. The context is on your machine. Each AI developer calls out to whichever brain you choose — or to a local model you self-host.

the context
is the asset.
keep it.

The real value of working with AI for months isn't the code it generates. It's everything you build along the way: your decisions, your edge cases, the way your domain actually works.

Most agentic tools quietly turn that into someone else's training data. Or they lock it inside a single frontier model — switch and you lose everything.

PaellaDoc ADO inverts it. Your context lives in plain SQLite, on your disk, in formats you can read. Point any model at it — Claude today, Codex tomorrow, a local Llama from a cabin next week. The brain changes; your knowledge doesn't.

• 100% local • anonymous telemetry · opt-out • SQLite + WAL • bring your own model

[+] · gold gate

no story closes
without proof.

Most agent stacks let the agent declare its own work done. The PR lands. Three weeks later, your test suite catches the lie — if you're lucky. PaellaDoc inverts it: every acceptance criterion runs as a real Playwright e2e against your running app — your dev server, the same one you use. Screenshots, video, trace — saved to disk before status flips to done. No evidence, no green light.

Real Playwright run per acceptance criterion — not a mock, not a unit test
Runs against your dev server, with the agent's code applied to the worktree
Screenshots, video and trace captured to disk as proof
Status only flips to done when evidence exists and passes
The agent doesn't decide. The gate does.

$ paelladoc ac validate AC-014-002

→ dev server detected · localhost:5173
→ playwright run · 1 spec · headed

  refund webhook · rejects invalid sig
  ✓ navigated /admin/payments
  ✓ posted invalid signature
  ✓ asserted 401 + error toast

→ evidence captured:
  · screenshots/run-7843-{1,2,3}.png
  · video/run-7843.webm
  · trace/run-7843.zip

→ gold gate: PASS ✓
→ AC-014-002 → done

[004] · off-grid

ship production
where you're allowed to.

Air-gapped clients. On-prem mandates. Regulated industries that forbid frontier APIs. Or just a cabin off-grid. PaellaDoc ADO keeps the product methodology intact — PRD, epics, stories, acceptance criteria, golden-gate validation — and routes each story to whatever brain your context allows. Same hierarchy. Same evidence trail. Same shipping bar.

Local: Llama-cpp, Ollama, vLLM, MLX. Hosted: any OpenAI-compatible endpoint
Per-project policy: lock to local-only, allow frontier, or mix per AC
Cost-aware routing: cheap models for trivial criteria, frontier only when earned
Full audit trail in plain SQLite — specs, decisions, traces, artifacts

benchmark · frontier on a desk

DeepSeek V4 Flash 158B-A13B · MoE · 1M context

31–56 tok/s on MacBook Pro M5 Max · 128 GB

Day-0 frontier MoE, running locally. The orchestrator drives it like any other backend — same PRD, same golden gate, same audit trail.

         ⋆ · ⋆ ·    ·  ⋆     ·
   ·       /\         · ⋆      ·
       ⋆  /  \   ·                ·
   ·     /____\        ⋆     ·
        |  []|         ·   ·
   _____|____|_____  ·            ·
  /                \
 /  cabin · 02:14   \    ·
 |   no signal      |
 |   battery: 71%   |
 |   model: llama   |
 |   tokens/s: 42   |
 |__________________|

$ paelladoc run --backend=mlx:deepseek-v4-flash
→ context loaded from ~/.paelladoc/store.db
→ resuming story-018 · refactor auth module
→ MacBook Pro M5 Max · 128 GB · 47 tok/s
→ agent ready · offline frontier ✓

[005] · transparency

every call,
on the record.

Frontier-everywhere is a money problem nobody is talking about honestly. PaellaDoc starts by giving you the receipt: every model call logged with its real cost, attributed to the AC that triggered it, queryable from your terminal. You pick the policy per project — lock to local-only, allow frontier, mix per AC. PaellaDoc enforces it and tells you what it cost.

Real cost per call, logged, not estimated
Per-project policy: local-only, frontier-allowed, or mixed per AC
Receipt rendered to terminal — daily or on-demand
SQLite-native, queryable, exportable

PAELLADOC // COST REPORT period: 2025–04 · 1 human · 47 stories

typecheck pass haiku-4.5 127 $0.04

lint sweep llama-local 89 $0.00

write unit tests sonnet-4.6 214 $1.81

refactor module sonnet-4.6 63 $2.30

architecture decision opus-4.7 11 $0.92

security audit opus-4.7 4 $0.41

TOTAL 508 calls $5.48

@jlcases · april 2026 · 47 stories
frontier on hard AC only · local-first for the rest

* rendered live from ~/.paelladoc/chats.db · $ paelladoc cost --report

closed at the
core.
open at the edge.

PaellaDoc is open-core. The desktop orchestrator, context graph, memory ranking, router and execution engine are the product core.

The extension surface is open: plugin SDK, manifest schema, CLI adapters, MCP packs, validators and examples.

Build adapters for your favorite agents. Keep your context local. Don't wait for me to support your stack.

• core: source-available • SDK: Apache-2.0 • data: portable • no lock-in

[+] · plugin layer

bring your own AI developer.

/ 01

CLI adapters

Wire any agent CLI — Claude Code, Codex, Gemini, Cursor, your own — into the orchestrator. The router calls them. PaellaDoc runs them.

/ 02

MCP packs

Bundle Model Context Protocol servers as plugins. Vector DBs, custom tools, internal APIs — exposed to every agent, scoped per project.

/ 03

Project validators

Custom golden-gate checks. Type purity, license audit, perf budget — your bar, not mine.

/ 04

Stack templates

Opinionated bootstraps. Next.js + Postgres, Rails + Sidekiq, your house template — not mine.

/ 05

Workflow skills

Reusable skill recipes. PR review, migration scaffolding, test-pyramid generation — packaged, versioned, shareable.

Boundary by design. Plugins cannot access raw chats, KG internals, embeddings, router scoring or secrets — unless PAELLADOC grants an explicit local permission.

[006] · who's this for

built for the solo operator.

The portfolio founder

Six side-products. One human. The orchestrator runs each as its own workspace, and the context never bleeds between them.

The platform principal

100 stories across a legacy migration. Queue them, assign brains, watch the tree turn green. The PM sees what every agent decided, not just what it shipped.

The off-grid hacker

Owns the model weights. Owns the context. Codes from a cabin. PaellaDoc just routes the work.

The compliance-bound consultancy

Client code never leaves the laptop. Frontier APIs are opt-in per project. Audit trail in plain SQLite.

The model-curious team

Same task, three brains, side-by-side diff. Keep the best output. Stop guessing which model is good at what.

@jlcases

solo founder · CPTO @ Rankia

[007] · why I built this

“I'm one person. I run several products. I don't want to be a manager of AI developers — I want to be the architect, and let the orchestrator run.

I also don't want my entire mental model — every conversation, every decision — to live inside a vendor's cloud where it disappears the day I switch tools.

PaellaDoc was for the new paradigm — context and spec-driven over the code itself. The architect writes the spec; the system ships the code. 5× productivity — thesis. 277★ in April 2025, when nobody was talking about this yet.

Now I'm back. PaellaDoc ADO — chasing 100× for one human.”

[008] · download

orchestrate
the rest.

free for personal use · 100% local · no account · macOS Apple Silicon

↓ download · macOS Apple Silicon plugin SDK ↑

v0.2.175 · signed & notarized · verify checksum ↑

My commitment. The current core features for personal individual use will stay free. Pro, Team, cloud, support, sync or commercial use may require a paid license. read the full promise ↑

team or enterprise? — DM @jlcases on X

writing code was the work. now, the last step.

this is notan IDE.

four ways in.pick yours.

I have a repo

I'm starting from zero

I define product, not code

I don't touch code

Three things, in this order.

every line, accountable

no green lies

your context, your machine

a year passes. the thread is still intact.

one orchestrator.n projects. n×k brains.

the contextis the asset.keep it.

no story closes without proof.

ship productionwhere you're allowed to.

every call,on the record.

closed at thecore.open at the edge.

bring your own AI developer.

CLI adapters

MCP packs

Project validators

Stack templates

Workflow skills

built for the solo operator.

The portfolio founder

The platform principal

The off-grid hacker

The compliance-bound consultancy

The model-curious team

orchestratethe rest.

writing code
was the work.
now, the last step.

this is not
an IDE.

four ways in.
pick yours.

a year passes.
the thread
is still intact.

one orchestrator.
n projects. n×k brains.

the context
is the asset.
keep it.

no story closes
without proof.

ship production
where you're allowed to.

every call,
on the record.

closed at the
core.
open at the edge.

orchestrate
the rest.