Fix _init_max_length in base_model.py #185

gucci-j · 2024-05-04T16:09:05Z

What does this PR do?

This PR just fixes an error caused in self._init_max_length(config.max_length). I added try-except to avoid the error.

Error

AttributeError occurred while processing load_model() for bigscience/bloom-7b1 because self._tokenizer was not defined at this stage.

WARNING:lighteval.logging.hierarchical_logger:  Test all gather {
WARNING:lighteval.logging.hierarchical_logger:    Test gather tensor
WARNING:lighteval.logging.hierarchical_logger:    gathered_tensor tensor([0], device='cuda:0'), should be [0]
WARNING:lighteval.logging.hierarchical_logger:  } [0:00:00.000649]
WARNING:lighteval.logging.hierarchical_logger:  Creating model configuration {
WARNING:lighteval.logging.hierarchical_logger:  } [0:00:00.000012]
WARNING:lighteval.logging.hierarchical_logger:  Model loading {
loading configuration file config.json from cache at /mnt/parscratch/users/acp23ay/private/hub/models--bigscience--bloom-7b1/snapshots/6232703e399354503377bf59dfbb8397fd569e4a/config.json
Model config BloomConfig {
  "_name_or_path": "bigscience/bloom-7b1",
  "apply_residual_connection_post_layernorm": false,
  "architectures": [
    "BloomForCausalLM"
  ],
  "attention_dropout": 0.0,
  "attention_softmax_in_fp32": true,
  "bias_dropout_fusion": true,
  "bos_token_id": 1,
  "eos_token_id": 2,
  "hidden_dropout": 0.0,
  "hidden_size": 4096,
  "initializer_range": 0.02,
  "layer_norm_epsilon": 1e-05,
  "masked_softmax_fusion": true,
  "model_type": "bloom",
  "n_head": 32,
  "n_inner": null,
  "n_layer": 30,
  "offset_alibi": 100,
  "pad_token_id": 3,
  "pretraining_tp": 1,
  "skip_bias_add": true,
  "skip_bias_add_qkv": false,
  "slow_but_exact": false,
  "torch_dtype": "float16",
  "transformers_version": "4.39.0.dev0",
  "unk_token_id": 0,
  "use_cache": true,
  "vocab_size": 250880
}

WARNING:lighteval.logging.hierarchical_logger:  } [0:00:00.123938]
WARNING:lighteval.logging.hierarchical_logger:} [0:00:00.435923]
Traceback (most recent call last):
  File "/users/acp23ay/src/lighteval/run_evals_accelerate.py", line 82, in <module>
    main(args)
  File "/users/acp23ay/src/lighteval/src/lighteval/logging/hierarchical_logger.py", line 166, in wrapper
    return fn(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^
  File "/users/acp23ay/src/lighteval/src/lighteval/main_accelerate.py", line 77, in main
    model, model_info = load_model(config=model_config, env_config=env_config)
                        ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/users/acp23ay/src/lighteval/src/lighteval/models/model_loader.py", line 83, in load_model
    return load_model_with_accelerate_or_default(config=config, env_config=env_config)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/users/acp23ay/src/lighteval/src/lighteval/models/model_loader.py", line 125, in load_model_with_accelerate_or_default
    model = BaseModel(config=config, env_config=env_config)
            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/users/acp23ay/src/lighteval/src/lighteval/models/base_model.py", line 76, in __init__
    self._max_length = self._init_max_length(config.max_length)
                       ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/users/acp23ay/src/lighteval/src/lighteval/models/base_model.py", line 269, in _init_max_length
    if hasattr(self.tokenizer, "model_max_length"):
               ^^^^^^^^^^^^^^
  File "/users/acp23ay/src/lighteval/src/lighteval/models/base_model.py", line 103, in tokenizer
    return self._tokenizer
           ^^^^^^^^^^^^^^^
AttributeError: 'BaseModel' object has no attribute '_tokenizer'. Did you mean: 'tokenizer'?

NathanHB · 2024-05-12T10:38:16Z

src/lighteval/models/base_model.py

+ if hasattr(self.tokenizer, "model_max_length"):
+ return self.tokenizer.model_max_length
+ except AttributeError:
+ hlog("No max length config setting is found in the model or tokenizer. max_length set to 2048.")


interesting. this functions need tokenizer to be defined but the setup tokenizer function needs max_length to be defined as well. this will fail if we cannot find the sequence length in the model config. that means that we can simply remove this whole try catch (it will always fail) and just return 2048.

@NathanHB Hi, thanks for the review. Should I just delete the whole try-except and update this branch accordingly?

hi ! yes as this will always fail

@NathanHB Hi, I've just deleted the relevant lines and updated the branch:)

NathanHB · 2024-05-12T10:38:46Z

src/lighteval/models/base_model.py

 self.use_chat_template = config.use_chat_template
+ self._max_length = self._init_max_length(config.max_length)


not sure this change is needed

could you revert this as well, otherwise lgtm, thanks for the fix ! @gucci-j

src/lighteval/models/base_model.py

NathanHB · 2024-05-31T16:08:21Z

looks good, thanks for the fix could you just adress the last comment ? I will merge afterward

Update base_model.py

003d389

gucci-j closed this May 4, 2024

Update base_model.py

6442254

gucci-j reopened this May 4, 2024

gucci-j changed the title ~~Fix tokenizer loading order in base_model.py~~ Fix _init_max_length in base_model.py May 4, 2024

Merge branch 'main' into dev-gucci

6801ae9

NathanHB reviewed May 12, 2024

View reviewed changes

src/lighteval/models/base_model.py Outdated Show resolved Hide resolved

NathanHB mentioned this pull request May 12, 2024

Evaluate EncoderDecoderModels #183

Open

clefourrier and others added 3 commits May 13, 2024 07:49

Merge branch 'main' into dev-gucci

86d33f1

Merge branch 'main' into dev-gucci

7e33e5f

Removed try-except in base_model.py

bde6130

gucci-j requested a review from NathanHB May 17, 2024 11:56

clefourrier and others added 2 commits May 22, 2024 15:46

Merge branch 'main' into dev-gucci

87fa364

Update src/lighteval/models/base_model.py

faec99a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix _init_max_length in base_model.py #185

Fix _init_max_length in base_model.py #185

gucci-j commented May 4, 2024 •

edited

NathanHB May 12, 2024

gucci-j May 14, 2024 •

edited

NathanHB May 16, 2024

gucci-j May 16, 2024 •

edited

NathanHB May 12, 2024

NathanHB May 31, 2024 •

edited

NathanHB commented May 31, 2024

		self.use_chat_template = config.use_chat_template
		self._max_length = self._init_max_length(config.max_length)

Fix _init_max_length in base_model.py #185

Are you sure you want to change the base?

Fix _init_max_length in base_model.py #185

Conversation

gucci-j commented May 4, 2024 • edited

What does this PR do?

Error

NathanHB May 12, 2024

Choose a reason for hiding this comment

gucci-j May 14, 2024 • edited

Choose a reason for hiding this comment

NathanHB May 16, 2024

Choose a reason for hiding this comment

gucci-j May 16, 2024 • edited

Choose a reason for hiding this comment

NathanHB May 12, 2024

Choose a reason for hiding this comment

NathanHB May 31, 2024 • edited

Choose a reason for hiding this comment

NathanHB commented May 31, 2024

gucci-j commented May 4, 2024 •

edited

gucci-j May 14, 2024 •

edited

gucci-j May 16, 2024 •

edited

NathanHB May 31, 2024 •

edited