[WIP] Gradual Unfreezing to mitigate catastrophic forgetting #3967

ethanreidel · 2024-03-15T02:05:20Z

Adds the ability to gradually unfreeze or thaw specific layers within a pre-trained model's architecture. Aims to mitigate catastrophic forgetting/improve transfer learning capabilities. Currently works for ECD architecture.

User passes in two things:
thaw_epochs (list of integers) and layers_to_thaw (2D array of layer strings)

thaw_epochs:
-1
-2
layers_to_thaw:

["features.0", "features.1"] (thaws these layers (weights+biases) at epoch 1)
["features.2", "features.3"] (epoch 2)
(keep in mind "features.0" will thaw all layers with the prefix "features.0" e.g. "features.0.1/2/3")

TODO/potential issues:

potentially change config syntax
users currently need to know the exact strings in architecture for thawing which is inconvenient
unittest iffy

test:
[tests/ludwig/modules/test_gradual_unfreezing.py]

Any and all feedback is greatly appreciated. 👍

github-actions · 2024-03-15T02:42:02Z

Unit Test Results

      6 files ±      0       6 suites ±0 52m 7s ⏱️ + 22m 2s
2 990 tests -       3 2 966 ✔️ -     15 23 💤 +11 1 ❌ +1
8 970 runs +5 941 8 898 ✔️ +5 893 69 💤 +45 3 ❌ +3

For more details on these failures, see this check.

Results for commit d2ba5cb. ± Comparison against base commit 606c732.

ethanreidel · 2024-03-15T19:02:42Z

@skanjila @saad-palapa

saad-palapa · 2024-03-19T02:18:05Z

ludwig/modules/gradual_unfreezer.py

+
+ if len(self.thaw_epochs) != len(self.layers_to_thaw):
+ raise ValueError("The length of thaw_epochs and layers_to_thaw must be equal.")
+ self.layers = dict(zip(self.thaw_epochs, self.layers_to_thaw))


Can you call this epoch_to_layers

saad-palapa · 2024-03-19T02:21:55Z

ludwig/trainers/trainer.py

@@ -1029,7 +1036,12 @@ def train(
 if profiler:
 profiler.start()

+ current_epoch = 0


Why can't we use progress_tracker.epoch here?

saad-palapa · 2024-03-19T02:28:05Z

ludwig/modules/gradual_unfreezer.py

+ self.config = config
+ self.model = model
+ self.thaw_epochs = self.config.thaw_epochs
+ self.layers_to_thaw = self.config.layers_to_thaw


If a network has hundreds of layers, won't the config get unwieldy?

saad-palapa · 2024-03-19T02:31:39Z

ludwig/modules/gradual_unfreezer.py

+from ludwig.schema.gradual_unfreezer import GradualUnfreezerConfig
+
+
+class GradualUnfreezer:


I thought we were planning on doing something much simpler than this at first? Like a single regex to declare which layers to unfreeze.

In the transfer learning tutorials it trains the classification head until convergence (step 1) and then it unfreezes some of the encoder layers for a few more epochs of low learning rate training (step 2). Is this new functionality supposed to be part of step 2?

saad-palapa · 2024-03-19T02:34:53Z

ludwig/trainers/trainer.py

+ # Initialize gradual unfreezer
+ if self.config.gradual_unfreezer.thaw_epochs:
+ self.gradual_unfreezer = GradualUnfreezer(self.config.gradual_unfreezer, self.model)
+ logger.info(f"Gradual unfreezing for {len(self.gradual_unfreezer.thaw_epochs)} epoch(s)")


Can we make this more descriptive. Maybe something like:

Gradual unfreezing: Epoch 10: unfreezing x1, x2, x3 Epoch 15: unfreezing x4, x5 Epoch 20: unfreezing x6 ...

saad-palapa · 2024-03-19T02:37:06Z

ludwig/modules/gradual_unfreezer.py

+
+ def thaw(self, current_epoch: int) -> None:
+ if current_epoch in self.layers:
+ current_layers = self.layers[current_epoch]


Better to call this:
layers_to_thaw

saad-palapa · 2024-03-19T02:39:34Z

ludwig/modules/gradual_unfreezer.py

+ if layer in str(name):
+ p.requires_grad_(True)
+ else:
+ raise ValueError("Layer type doesn't exist within model architecture")


Can you add the layer name to the error message

saad-palapa · 2024-03-19T02:40:45Z

ludwig/modules/gradual_unfreezer.py

+
+ # thaw individual layer
+ def thawParameter(self, layer):
+ # is there a better way to do this instead of iterating through all parameters?


Perhaps in the init make a map of:
layer name => parameters

Only include the ones that will be thawed

saad-palapa · 2024-03-19T02:56:21Z

ludwig/modules/gradual_unfreezer.py

+ self.thawParameter(layer)
+
+ # thaw individual layer
+ def thawParameter(self, layer):


Make this private

saad-palapa · 2024-03-19T02:58:23Z

ludwig/modules/gradual_unfreezer.py

+
+class GradualUnfreezer:
+ def __init__(self, config: GradualUnfreezerConfig, model):
+ self.config = config


Is this variable referenced outside of init?

ethanreidel added 5 commits March 8, 2024 00:41

initial gradual unfreezing changes

a00660e

added placeholder unittest need fixes tomorrow

884f121

updated static_schema

f6f6d39

fixed various issues. cleaned up unit test. added exception raises

db6a78a

forgot trainer schema changes

d2ba5cb

ethanreidel requested review from w4nderlust, tgaddair, justinxzhao, arnavgarg1, geoffreyangus, jeffkinnison, Infernaught and alexsherstinsky as code owners March 15, 2024 02:05

skanjila self-requested a review March 15, 2024 19:55

saad-palapa suggested changes Mar 19, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Gradual Unfreezing to mitigate catastrophic forgetting #3967

[WIP] Gradual Unfreezing to mitigate catastrophic forgetting #3967

ethanreidel commented Mar 15, 2024 •

edited

github-actions bot commented Mar 15, 2024

ethanreidel commented Mar 15, 2024

saad-palapa Mar 19, 2024

saad-palapa Mar 19, 2024

saad-palapa Mar 19, 2024

saad-palapa Mar 19, 2024

saad-palapa Mar 19, 2024

saad-palapa Mar 19, 2024

saad-palapa Mar 19, 2024

saad-palapa Mar 19, 2024

saad-palapa Mar 19, 2024

saad-palapa Mar 19, 2024

		from ludwig.schema.gradual_unfreezer import GradualUnfreezerConfig


		class GradualUnfreezer:

[WIP] Gradual Unfreezing to mitigate catastrophic forgetting #3967

Are you sure you want to change the base?

[WIP] Gradual Unfreezing to mitigate catastrophic forgetting #3967

Conversation

ethanreidel commented Mar 15, 2024 • edited

github-actions bot commented Mar 15, 2024

Unit Test Results

ethanreidel commented Mar 15, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ethanreidel commented Mar 15, 2024 •

edited