Code for the paper Prediction-Powered Ranking of Large Language Models, Arxiv 2024.
ranking-algorithm
llm-eval
llm-evaluation
llm-evaluation-framework
prediction-powered-inference
rank-sets
-
Updated
May 27, 2024 - Python
Code for the paper Prediction-Powered Ranking of Large Language Models, Arxiv 2024.
Add a description, image, and links to the rank-sets topic page so that developers can more easily learn about it.
To associate your repository with the rank-sets topic, visit your repo's landing page and select "manage topics."