`RESPONSE` contains lot longer text than is expected based on the `output_features` and `max_sequence_length`. #3985

amankhandelia · 2024-04-06T11:53:59Z

RESPONSE consumed by ROUGEScoreMetric function, gets a lot longer text than is expected based on the output_features and max_sequence_length. Even when the max_sequence_length is 8 or 16 tokens, it RESPONSE contains the text which is as long as the text in prompt_template.

Based on my investigation, it is happening because in get_decoded_targets_and_predictions condition is wrong, instead of targets != IGNORE_INDEX_TOKEN_ID, it is set to predictions[PREDICTIONS] != IGNORE_INDEX_TOKEN_ID. From what I understand we should be using the targets index to truncate the predictions correctly.

When I apply this change I get the correct metric value, matching up with the expectations given the results seen during finetuning

Python version: 3.11
Ludwig version: 0.10.1

The text was updated successfully, but these errors were encountered:

amankhandelia mentioned this issue Apr 6, 2024

Minor change to fix the incorrect response truncation #3986

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`RESPONSE` contains lot longer text than is expected based on the `output_features` and `max_sequence_length`. #3985

`RESPONSE` contains lot longer text than is expected based on the `output_features` and `max_sequence_length`. #3985

amankhandelia commented Apr 6, 2024

RESPONSE contains lot longer text than is expected based on the output_features and max_sequence_length. #3985

RESPONSE contains lot longer text than is expected based on the output_features and max_sequence_length. #3985

Comments

amankhandelia commented Apr 6, 2024

`RESPONSE` contains lot longer text than is expected based on the `output_features` and `max_sequence_length`. #3985

`RESPONSE` contains lot longer text than is expected based on the `output_features` and `max_sequence_length`. #3985