Dev
Solo maintainer or pilot team
Billed annually as $234; annual is 50% off.
1 repo, 20 tasks
- Private task seed set
- Weekly benchmark run
- Claude Code and Codex comparison
- Failure reason summary
ClaudeBench Drift pricing
Team annual is selected by default for 20 repositories, daily runs, failure replay, drift alerts, and procurement reports.
Pricing
Annual billing is selected by default and billed at 50% off the month-to-month total.
Solo maintainer or pilot team
Billed annually as $234; annual is 50% off.
1 repo, 20 tasks
Engineering teams adopting coding agents
Billed annually as $894; annual is 50% off.
20 repos, daily runs
Platform, vendor, and enterprise AI teams
Billed annually as $2,994; annual is 50% off.
100 repos, vendor reports
Common questions
No. The benchmark should use scrubbed, minimal task fixtures and CI sandboxes. The product is designed around repeatable tasks, not production credentials.
Yes. Team annual is selected by default, billed as $894 for the year, which equals 50% off the monthly total.
Yes. The comparison model is built for Claude Code, Codex, Cursor, Gemini CLI, OpenCode, and other agents that can run in a controlled task sandbox.