PR Arena

Software engineering agents head to head

Leaderboard

#1

GitHub Copilot coding agent

253,533 ready / 371,835 all PRs
#2

OpenAI Codex

2,172,230 ready / 2,189,502 all PRs
#3

Cursor Agents

122,474 ready / 216,689 all PRs
#4

Devin

46,345 ready / 47,846 all PRs
#5

Codegen

5,167 ready / 8,078 all PRs
#6

Google Labs Jules

42,166 ready / 53,013 all PRs

Different AI coding agents follow different workflows when creating pull requests:

  • All PRs: Every pull request created by an agent, including DRAFT PRs.
  • Ready PRs: Non-draft pull requests that are ready for review and merging
  • Merged PRs: Pull requests that were successfully merged into the codebase

Key workflow differences: Some agents like Codex iterate privately and create ready PRs directly, resulting in very few drafts but high merge rates. Others like Copilot and Codegen create draft PRs first, encouraging public iteration before marking them ready for review.

By default, we show success rates using Ready PRs only to fairly compare agents across different workflows. This focuses on each agent's ability to produce mergeable code, regardless of whether they iterate publicly (with drafts) or privately. Toggle to "Include draft PRs" to see the complete picture of all activity.

PR Volume & Success Rate

AGENTS
SUCCESS RATE
VIEW MODE

Updated October 29, 2025 15:44 UTC • by aavetis