Pull requests: openai/evals
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Update README: Add Langtrace as an Eval vendor
#1531
opened May 21, 2024 by
karthikscale3
Loading…
5 of 13 tasks
[eval] Add IMO problems with exact answers
#1528
opened May 15, 2024 by
justinlinw
Loading…
13 tasks done
Dependabot configuration to update actions in workflows
#1526
opened May 1, 2024 by
ScottBrenner
Loading…
3 tasks done
Support GPT-4o, Added Quran Eval & Simple Fact Model-Graded Definition
#1511
opened Apr 1, 2024 by
sakher
Loading…
13 tasks done
Add Classification Rule Articulation Eval
#1510
opened Mar 30, 2024 by
danesherbs
Loading…
13 tasks done
Fix specifying API arguments from the CLI
#1505
opened Mar 27, 2024 by
LoryPack
Loading…
6 tasks done
[Evals] Add eval for Dhivehi diacritical marks
#1495
opened Mar 16, 2024 by
aanaseer
Loading…
11 of 12 tasks
Adding Indian Women Menstrual Health Chatbot Eval
#1430
opened Dec 11, 2023 by
cranberrydeveloper
Loading…
13 tasks done
Choose completion function for evaluation of modelgraded evals
#1418
opened Nov 17, 2023 by
LoryPack
Loading…
6 tasks done
Valid Hanabi clues eval & update Includes to optionally take Exclusions
#1385
opened Oct 17, 2023 by
sjadler2004
Loading…
13 tasks done
Add a new eval : chinese_literary_grace
#1375
opened Oct 7, 2023 by
Conghui-Niu
Loading…
12 of 13 tasks
Chess eval: Changed typo 'beset' to 'best' in all 101 examples.
#1374
opened Oct 3, 2023 by
Zirunis
Loading…
Add Eval: Interpreting balance sheet absolute changes
#1336
opened Aug 16, 2023 by
TensorTemplar
Loading…
12 of 13 tasks
Previous Next
ProTip!
What’s not been updated in a month: updated:<2024-04-24.