Transformers model as Judge #174

anilaltuner · 2024-04-26T08:31:02Z

Transformers library added to llm as judge.

Only need is when the JudgeLLM class called, change judge model with transformers.

Ex.

JudgeLLM(
            judge_model_name="microsoft/Phi-3-mini-128k-instruct",
            template_path="src/lighteval/tasks/extended/mt_bench/judge_prompts.jsonl",
            multi_turn=True,
        )

clefourrier · 2024-04-30T11:49:15Z

Hi! Thanks for this PR! Can you fix your PR so that tests are passing?

anilaltuner · 2024-04-30T13:41:58Z

Yes, I'll fix on a short time but Run tests gives error for

ERROR tests/test_main.py - huggingface_hub.utils._errors.HfHubHTTPError: 500 Server Error: Internal Server Error for url: https://huggingface.co/api/datasets/gsm8k/paths-info/e53f048856ff4f594e959d75785d2c2d37b678ee (Request ID: Root=1-6630cc27-222f10cf5f5b028e1ffcebcc;677b1d40-c6e5-4fc5-9718-6441f20a365c)

Is it about huggingface hub?

clefourrier · 2024-04-30T13:53:03Z

Hm, let me re-run your tests, maybe you committed when the hub was down

anilaltuner · 2024-04-30T13:59:37Z

Thanks, I fixed code quality and pushed. We can re-run whenever you want

clefourrier · 2024-05-02T15:43:48Z

cc @NathanHB if you have the time to do a more in depth review

NathanHB · 2024-05-12T11:18:00Z

src/lighteval/metrics/llm_as_judge.py

+ self.generation_args = {
+ "max_new_tokens": 500,
+ "return_full_text": False,
+ "temperature": temperature,


temperature is not needed if do_sample is set to False

NathanHB · 2024-05-12T11:20:11Z

src/lighteval/metrics/metrics_sample.py

@@ -625,22 +625,25 @@ class JudgeLLM:

 def __init__(self, judge_model_name: str, template_path: str, multi_turn: bool = False):
 if judge_model_name not in self.available_models:
- raise ValueError(f"{judge_model_name} not in available models for llm as a judge metric")
+ judge_type = "openai"


this wouldn't work, if we pass in gpt-12 for example, it will set the judge type to openai and continue, only to fail later because gpt-12 does not exist in the openai api

NathanHB · 2024-05-31T16:03:36Z

hey ! thanks for the fix, I will have the bandwitdh to test next week and will merge asap :)

anilaltuner added 2 commits April 26, 2024 11:24

Transformers as Judge added

fdb22c7

Transformers as Judge added

6906b04

Formatting fix

8a33c12

NathanHB reviewed May 12, 2024

View reviewed changes

Check model from HfApi

aef34a7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Transformers model as Judge #174

Transformers model as Judge #174

anilaltuner commented Apr 26, 2024

clefourrier commented Apr 30, 2024

anilaltuner commented Apr 30, 2024

clefourrier commented Apr 30, 2024

anilaltuner commented Apr 30, 2024

clefourrier commented May 2, 2024

NathanHB May 12, 2024

NathanHB May 12, 2024

NathanHB commented May 31, 2024

Transformers model as Judge #174

Are you sure you want to change the base?

Transformers model as Judge #174

Conversation

anilaltuner commented Apr 26, 2024

clefourrier commented Apr 30, 2024

anilaltuner commented Apr 30, 2024

clefourrier commented Apr 30, 2024

anilaltuner commented Apr 30, 2024

clefourrier commented May 2, 2024

NathanHB May 12, 2024

Choose a reason for hiding this comment

NathanHB May 12, 2024

Choose a reason for hiding this comment

NathanHB commented May 31, 2024