SWE-bench

A coding benchmark that tests AI's ability to fix real-world software bugs from open source repositories. Used to compare the practical coding ability of different AI models.

Stay in the loop

Get weekly updates on trending AI coding tools and projects.