Skip to content

Navigation Menu

Explore
For
- Enterprise
- Teams
- Startups
- Education
By Solution
Resources
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

openai / evals Public

Notifications
Fork 2.5k
Star 14.1k

Code
Issues 85
Pull requests 35
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: openai/evals

Labels 10 Milestones 0

Labels 10 Milestones 0

New pull request New

35 Open 1,227 Closed

35 Open 1,227 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[eval] Add IMO problems with exact answers

#1528 opened May 15, 2024 by justinlinw

Loading…

13 tasks done

Dependabot configuration to update actions in workflows

#1526 opened May 1, 2024 by ScottBrenner

Loading…

3 tasks done

show evals in wandb weave

#1522 opened Apr 19, 2024 by yogeshg • Draft

13 tasks

3

Added Quran Eval & Simple Fact Model-Graded Definition

#1511 opened Apr 1, 2024 by sakher

Loading…

13 tasks done

2

Add Classification Rule Articulation Eval

#1510 opened Mar 30, 2024 by danesherbs

Loading…

13 tasks done

eval pattern-concat-logic

#1508 opened Mar 28, 2024 by natanaelwf

Loading…

13 tasks done

1

Fix specifying API arguments from the CLI

#1505 opened Mar 27, 2024 by LoryPack

Loading…

6 tasks done

1

[Evals] Add eval for Dhivehi diacritical marks

#1495 opened Mar 16, 2024 by aanaseer

Loading…

11 of 12 tasks

Add **kwargs to OpenAIChatCompletionFn

#1494 opened Mar 15, 2024 by ezraporter

Loading…

1

add a new eval:needle_in_a_matrix

#1475 opened Mar 11, 2024 by gordbegli

Loading…

13 tasks done

Extending to Azure OpenAI implementation

#1470 opened Feb 23, 2024 by pkt1583

Loading…

1

Adding Indian Women Menstrual Health Chatbot Eval

#1430 opened Dec 11, 2023 by cranberrydeveloper

Loading…

13 tasks done

7

Choose completion function for evaluation of modelgraded evals

#1418 opened Nov 17, 2023 by LoryPack

Loading…

6 tasks done

Add Eval: name well known security weaknesses

#1392 opened Oct 28, 2023 by ourmony

Loading…

1 task

1

Valid Hanabi clues eval & update Includes to optionally take Exclusions

#1385 opened Oct 17, 2023 by sjadler2004

Loading…

13 tasks done

5

Deepcopy in recorder

#1376 opened Oct 12, 2023 by johny-b

Loading…

1

Add a new eval : chinese_literary_grace

#1375 opened Oct 7, 2023 by Conghui-Niu

Loading…

12 of 13 tasks

3

Chess eval: Changed typo 'beset' to 'best' in all 101 examples.

#1374 opened Oct 3, 2023 by Zirunis

Loading…

1

Add gpt4facts Eval

#1363 opened Sep 25, 2023 by mmtmn

Loading…

13 tasks done

3

Add Eval: Interpreting balance sheet absolute changes

#1336 opened Aug 16, 2023 by TensorTemplar

Loading…

12 of 13 tasks

3

#1324 opened Jul 29, 2023 by Livegan

Loading…

13 tasks

#1308 opened Jul 7, 2023 by mrzu • Draft

5 of 6 tasks

add eval against machiavellianistic attitudes

#1270 opened Jul 1, 2023 by Huge

Loading…

2

[Resolves Issue #1228] Improve ModelGraded Evals Formatting for Increased GPT Compliance

#1258 opened Jun 28, 2023 by douglasmonsky

Loading…

1

4

Now I have the change in place, it seems wrong.

#1209 opened Jun 21, 2023 by CholoTook

Loading…

Previous 1 2 Next

Previous Next

ProTip! Mix and match filters to narrow down what you’re looking for.

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.