Pull requests: openai/evals
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[eval] Add IMO problems with exact answers
#1528
opened May 15, 2024 by
justinlinw
Loading…
13 tasks done
Dependabot configuration to update actions in workflows
#1526
opened May 1, 2024 by
ScottBrenner
Loading…
3 tasks done
Added Quran Eval & Simple Fact Model-Graded Definition
#1511
opened Apr 1, 2024 by
sakher
Loading…
13 tasks done
Add Classification Rule Articulation Eval
#1510
opened Mar 30, 2024 by
danesherbs
Loading…
13 tasks done
Fix specifying API arguments from the CLI
#1505
opened Mar 27, 2024 by
LoryPack
Loading…
6 tasks done
[Evals] Add eval for Dhivehi diacritical marks
#1495
opened Mar 16, 2024 by
aanaseer
Loading…
11 of 12 tasks
Adding Indian Women Menstrual Health Chatbot Eval
#1430
opened Dec 11, 2023 by
cranberrydeveloper
Loading…
13 tasks done
Choose completion function for evaluation of modelgraded evals
#1418
opened Nov 17, 2023 by
LoryPack
Loading…
6 tasks done
Valid Hanabi clues eval & update Includes to optionally take Exclusions
#1385
opened Oct 17, 2023 by
sjadler2004
Loading…
13 tasks done
Add a new eval : chinese_literary_grace
#1375
opened Oct 7, 2023 by
Conghui-Niu
Loading…
12 of 13 tasks
Chess eval: Changed typo 'beset' to 'best' in all 101 examples.
#1374
opened Oct 3, 2023 by
Zirunis
Loading…
Add Eval: Interpreting balance sheet absolute changes
#1336
opened Aug 16, 2023 by
TensorTemplar
Loading…
12 of 13 tasks
[Resolves Issue #1228] Improve ModelGraded Evals Formatting for Increased GPT Compliance
#1258
opened Jun 28, 2023 by
douglasmonsky
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.