[MAINTENANCE] Refactor and clean up. #4008

alexsherstinsky · 2024-05-23T20:13:13Z

Scope

Make the implementation for the fix of the ViTEncoder to ensure that the transformers.ViTModel returns the output_attentions more elegant (and cuts on the amount of code).

Code Pull Requests

Please provide the following:

a clear explanation of what your code does
if applicable, a reference to an issue
a reproducible test for your PR (code, config and data sample)

Documentation Pull Requests

Note that the documentation HTML files are in docs/ while the Markdown sources are in mkdocs/docs.

If you are proposing a modification to the documentation you should change only the Markdown files.

api.md is automatically generated from the docstrings in the code, so if you want to change something in that file, first modify ludwig/api.py docstring, then run mkdocs/code_docs_autogen.py, which will create mkdocs/docs/api.md .

arnavgarg1 · 2024-05-23T20:22:32Z

ludwig/encoders/image/base.py

- gradient_checkpointing=gradient_checkpointing,
- )
+ if output_attentions:
+ config = ViTConfig(


Can we just create a dictionary mapping, then optionally add "attn_implementation" if output_attention and pass the dictionary as **kwargs into the Config? Will reduce boilerplate :)

@arnavgarg1 Done -- sorry about missing the obvious! Thanks!

arnavgarg1 · 2024-05-23T20:55:40Z

ludwig/encoders/image/base.py

+ "gradient_checkpointing": gradient_checkpointing,
+ }
+ config_dict["attn_implementation"] = "eager"
+ config = ViTConfig(**config_dict)


Do we no longer need to do the if else so we can pass these custom args in both cases?

@arnavgarg1 We do -- I updated the code. If we do not do it, the tests still pass, because locally nothing changes (but runs slower). This is because of the breaking change on the HF side, where they default to SDPA for speed (but it is lazy execution, leaving attention tensors as None; with "eager" in either case, things still work, but less efficiently). This is fixed now -- good catch -- thanks!

github-actions · 2024-05-23T21:04:41Z

Unit Test Results

  6 files ±0   6 suites ±0 14m 23s ⏱️ +2s
12 tests ±0   9 ✔️ ±0   3 💤 ±0 0 ❌ ±0
60 runs ±0 42 ✔️ ±0 18 💤 ±0 0 ❌ ±0

Results for commit 9b91d6b. ± Comparison against base commit 7053966.

♻️ This comment has been updated with latest results.

alexsherstinsky added 2 commits May 23, 2024 13:10

Refactor and clean up.

9a13751

Refactor and clean up.

b1e248c

alexsherstinsky marked this pull request as ready for review May 23, 2024 20:19

alexsherstinsky requested review from w4nderlust, tgaddair, justinxzhao, arnavgarg1, geoffreyangus, jeffkinnison and Infernaught as code owners May 23, 2024 20:19

arnavgarg1 reviewed May 23, 2024

View reviewed changes

Cleanup.

b2d3764

alexsherstinsky requested a review from arnavgarg1 May 23, 2024 20:36

arnavgarg1 reviewed May 23, 2024

View reviewed changes

Cleanup.

a8e5376

alexsherstinsky requested a review from arnavgarg1 May 23, 2024 21:18

arnavgarg1 approved these changes May 23, 2024

View reviewed changes

Cleanup.

9b91d6b

alexsherstinsky merged commit 3b9192b into master May 23, 2024
18 checks passed

alexsherstinsky deleted the maintenance/alexsherstinsky/requirements/cleanup_update_vision_transformer_encoder_for_transformers_version_4_41_1_compatibility-2024_05_23-37 branch May 23, 2024 22:13

skanjila pushed a commit to skanjila/ludwig that referenced this pull request Jun 7, 2024

[MAINTENANCE] Refactor and clean up. (ludwig-ai#4008)

de6ec30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MAINTENANCE] Refactor and clean up. #4008

[MAINTENANCE] Refactor and clean up. #4008

alexsherstinsky commented May 23, 2024

arnavgarg1 May 23, 2024

alexsherstinsky May 23, 2024

arnavgarg1 May 23, 2024

alexsherstinsky May 23, 2024

github-actions bot commented May 23, 2024 •

edited

[MAINTENANCE] Refactor and clean up. #4008

[MAINTENANCE] Refactor and clean up. #4008

Conversation

alexsherstinsky commented May 23, 2024

Scope

Code Pull Requests

Documentation Pull Requests

arnavgarg1 May 23, 2024

Choose a reason for hiding this comment

alexsherstinsky May 23, 2024

Choose a reason for hiding this comment

arnavgarg1 May 23, 2024

Choose a reason for hiding this comment

alexsherstinsky May 23, 2024

Choose a reason for hiding this comment

github-actions bot commented May 23, 2024 • edited

Unit Test Results

github-actions bot commented May 23, 2024 •

edited