You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
ErikBjare
added
duplicate
This issue or pull request already exists
and removed
triage
Interesting but stale issue. Will be close if inactive for 3 more days after label added.
labels
Mar 13, 2024
Feature description
Devin, the "First AI software engineer" is using their SWE-Bench performance as primary evidence of their capability.
Motivation/Application
Provide a benchmark to evaluate Devin's claims, and solidify gpt-engineer's reputation as a legitimate autonomous coding agent.
The text was updated successfully, but these errors were encountered: