-
Notifications
You must be signed in to change notification settings - Fork 5.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
AutoGen meets SWE_bench #2933
AutoGen meets SWE_bench #2933
Conversation
️✅ There are no secrets present in this pull request anymore.If these secrets were true positive and are still valid, we highly recommend you to revoke them. 🦉 GitGuardian detects secrets in your source code to help developers and security teams secure the modern development process. You are seeing this because you or someone else with access to this repository has authorized GitGuardian to scan your pull request. |
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #2933 +/- ##
===========================================
- Coverage 32.90% 19.97% -12.94%
===========================================
Files 94 97 +3
Lines 10235 10953 +718
Branches 2193 2509 +316
===========================================
- Hits 3368 2188 -1180
- Misses 6580 8614 +2034
+ Partials 287 151 -136
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. |
@skzhang1 would you like to continue working on this draft PR? |
Thanks. Do you think we can close this and revisit the idea once the new version is merged? It can be part of |
Sure @ekzhu , we can revisit after the new version. |
Why are these changes needed?
This PR provides a basic implementation for SWE_bench benchmark (https://www.swebench.com). This PR is still on working and not ready for review.
🌹 Acknowledgement: The code is greatly on the basis of swe-agent.
Please add more @Hk669 if you have other suggestions. @Hk669 will mainly take response for it based on this draft PR.
Related issue number
N/A
Checks