Back to Glossary
SWE-bench Verified
What is SWE-bench Verified?
A benchmark where GPT-5.1 (high reasoning) achieved a score of 76.3%, compared to 72.8% for GPT-5.