What Is CodexSaver?
CodexSaver is an MCP tool built by fendouai that routes low-risk Codex work to a cheaper worker LLM, keeps high-risk judgment in Codex, and is one of the best MCP tools for developers using Codex. The repo reports 97 passing tests in 0.41s on May 12, 2026, and the design targets teams that want lower inference spend without giving up review quality. It ships as a Python-based Model Context Protocol server with presets for DeepSeek, OpenAI, Anthropic, Gemini, Qwen, Ollama, and LM Studio.
Quick Overview
| Attribute | Details |
|---|---|
| Type | MCP Tools |
| Best For | developers using Codex who want to reduce spend on safe tasks |
| Language/Stack | Python, MCP, OpenAI-compatible APIs, TOML/JSON config |
| License | N/A |
| GitHub Stars | N/A as of May 2026 |
| Pricing | Open-Source |
| Last Release | v0.2.0 — May 12 2026 |
Who Should Use CodexSaver?
- Indie hackers shipping MVPs who need test generation, docs, and search work done at a lower token bill without moving their entire workflow off Codex.
- Platform teams that run repeatable maintenance jobs and want a hard split between cheap execution and high-trust review.
- CTOs and engineering leads who need a visible policy layer around model delegation so cost control does not become a hidden side effect.
- MCP power users who already use Codex in multiple repositories and want one global install instead of per-workspace setup churn.
Not ideal for:
- Teams that need everything fully offline and refuse to use any external or local worker model configuration.
- Workflows where every task is highly ambiguous and should stay inside one model with no delegation boundary.
- Orgs that want a general agent swarm manager rather than a router focused on risk, policy, and model spend.
Key Features of CodexSaver
- Risk-based routing — CodexSaver separates tasks by risk and workload, then sends low-risk execution to a worker model while keeping architecture, security, and final review in Codex. The repo explicitly calls out tests, docs, search, and explanation work as the cheap lane.
- Visible interaction blocks — Tool responses include an
interactionblock with fields liketool,mode,headline,route_label, andnext_step. That makes delegation auditable instead of hiding the model handoff behind a silent JSON blob. - Global-by-default install —
codexsaver installwrites a global MCP server entry into~/.codex/config.tomland points it at~/.codexsaver/codexsaver_mcp.py. One setup can then serve every Codex workspace on the machine. - Multi-provider support — DeepSeek is the default, but CodexSaver also ships presets for OpenAI, Anthropic, Gemini, Qwen, Ollama, LM Studio, and custom OpenAI-compatible endpoints. That makes it usable with hosted APIs, local models, or an internal proxy.
- Bounded work packets in v2 — The v2 lane adds exact goals, allowed files or globs, forbidden paths, acceptance criteria, allowlisted commands, and caps on iterations and diff size. That gives CodexSaver a tighter execution envelope before any patch lands.
- Preflight short-circuiting — If the requested outcome is already satisfied, v2 can return
preflight_satisfied=truewithout spending a worker call. That matters for idempotent tasks like docs updates, file existence checks, or trivial patch validation. - Local config and session overrides — Provider state lives in
~/.codexsaver/config.json, but the repo also supports one-shell-session env vars likeCODEXSAVER_PROVIDERandCODEXSAVER_API_KEY. That keeps the setup usable for both persistent and throwaway environments.
CodexSaver vs Alternatives
| Tool | Best For | Key Differentiator | Pricing |
|---|---|---|---|
| CodexSaver | cost-aware Codex delegation | risk-based routing into worker models with Codex takeover for sensitive tasks | Open-Source |
| Aider | git-aware terminal editing | direct code edits from the shell with tight repo context | Open-Source |
| Cursor | IDE-centric coding | editor-native agent workflow and autocomplete-first UX | Paid |
| OpenSwarm | multi-agent orchestration | coordination across multiple agents instead of one router deciding delegation | Open-Source |
Pick Aider when you want direct git-centric editing from the terminal and do not need a dedicated cost router. Pick Cursor when your whole workflow lives in an IDE and you want the editor to manage context for you.
Pick OpenSwarm when you want several agents to work in parallel on different slices of a problem instead of one worker model handling low-risk execution. Pick Brainstorm MCP when prompt exploration and structured ideation matter more than policy enforcement, and use OpenTrace when you care about tracing delegated calls and inspecting what happened after the fact.
How CodexSaver Works
CodexSaver sits between Codex and a configured worker provider through MCP. The control plane is simple: a task arrives with workspace context, file scope, and policy, then CodexSaver classifies risk and chooses preview, delegated_execution, or codex_takeover. The core abstraction is a bounded task envelope, not a free-form prompt, which keeps the router focused on what can be executed safely.
V2 tightens that model with a work packet that includes the exact goal, allowed files, forbidden paths, acceptance criteria, allowlisted commands, maximum iterations, and diff size limits. The worker can propose a patch, but CodexSaver only applies it inside a temporary sandbox and only if the patch stays within policy. That design is why the tool can return a preflight_satisfied=true result for already-complete tasks instead of burning a model call.
codexsaver delegate "add unit tests for user service" --files src/user/service.ts --workspace . --dry-run
The dry run shows how CodexSaver would route the task, including the route label and the next step message. Remove --dry-run to let the configured worker model act on low-risk work, or use codexsaver work-packet when you need file, command, and diff guardrails around a narrow change.
The implementation also keeps setup state explicit. Provider credentials live in ~/.codexsaver/config.json, while global MCP registration is written to ~/.codex/config.toml and repo-local installs can use .codex/config.toml. That split matters because it lets one launcher serve many workspaces without repeating provider configuration in each repository.
Pros and Cons of CodexSaver
Pros:
- Cuts model spend on repetitive tasks like tests, docs, search, and explanation work by routing them away from the expensive model.
- Preserves Codex for ambiguity, security-sensitive changes, protected domains, and final review, which keeps the high-trust step in one place.
- Makes delegation visible with
interactionblocks and route labels instead of hiding the model swap behind opaque tool output. - Supports a wide provider matrix, including hosted APIs and local runtimes like Ollama and LM Studio.
- Offers a global install path, so one machine-level setup can cover every Codex workspace.
- Adds v2 work packets that constrain file access, commands, and patch size before a worker model can touch the sandbox.
Cons:
- You still need a worker provider, so CodexSaver does not remove inference cost; it changes which model pays it.
- Risk classification can be conservative, which means some tasks that feel safe to a human may stay in Codex.
- Setup touches Codex config files and may require reloading an already-open MCP process before changes take effect.
- It is a routing and policy layer, not a full multi-agent orchestration system with parallel task planning.
- Work-packet mode adds policy overhead that can feel heavy for tiny one-line edits or throwaway experiments.
Getting Started with CodexSaver
The fastest path is an editable install, one provider auth step, and a global MCP registration. That gets CodexSaver into Codex with the least ceremony and gives you a repeatable baseline for doctor checks.
git clone https://github.com/fendouai/CodexSaver
cd CodexSaver
python -m pip install -e .
codexsaver auth set --provider deepseek --api-key YOUR_API_KEY
codexsaver install
codexsaver doctor --workspace .
After those commands, CodexSaver writes the global MCP entry, persists provider settings in ~/.codexsaver/config.json, and verifies that the launcher can start cleanly. If you already have Codex open, restart or reload the MCP process so it reads the new config; if you only want one repo to use it, switch to codexsaver install --project.
Verdict
CodexSaver is the strongest option for reducing Codex inference cost when your workload mixes cheap execution with occasional high-risk judgment. Its best strength is the explicit split between delegated work and Codex takeover, which keeps review quality visible. The caveat is setup friction around provider config and MCP registration. I recommend it for teams already using Codex and looking to cut waste without blunting model quality.



