Skip to content

Detailed guide on what AI benchmark metrics mean and how to use these to find the best foundation model for the use case at hand

License

Notifications You must be signed in to change notification settings

anwielts/ai-benchmark-guide

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 

Repository files navigation

ai-benchmark-guide

Detailed guide on what AI benchmark metrics mean and how to use these to find the best foundation model for the use case at hand.

Overview table

Metric/challenge name TLDR explanation Link to detailed explanation Link to paper
GSM8K Solve 'grade school math word problems' Detailed explanation arXiv

Usage guide by use case

Math

Solving textual math problems

Computer science

TBD

Diclaimer

Above information may be wrong.

About

Detailed guide on what AI benchmark metrics mean and how to use these to find the best foundation model for the use case at hand

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published