Option to unfreeze encoder layers when training a computer vision model #3733

saad-palapa · 2023-10-16T01:02:00Z

Is your feature request related to a problem? Please describe.
Our computer vision solution is missing an important feature to improve accuracy.

Describe the use case
In the past, I've followed this Tensorflow tutorial to train an image classifier. This is a transfer learning technique that has two training rounds:

Training: add a classification head to a pretrained base model. The base model is frozen and the classification layers are trainable. This is currently what we have right now.
Fine-tuning: unfreezing layers from the base model and doing 10 epochs of low learning rate training.

Describe the solution you'd like
There are two ways to implement this:

Update the configuration to let users choose to automatically run the 2nd fine-tuning round. Maybe this can be the default.
Rely on users to code their own fine-tuning round.

Both require a way to choose how to freeze layers. There are two approaches to freezing layers:

By percentage or layer count of the base model starting from the first hidden layer
Using a regex to choose which layers to freeze. From example:

var_freeze_expr: '(efficientnet|fpn_cells|resample_p6)'

The later layers encode high level features of the image while the early ones encode low level features (corners, edges, colors, gradients, etc). It's not good to fine-tune the lower levels because the model will overfit those features.

When fine-tuning, we unfreeze the later layers of the pretrained base model. The number of layers to unfreeze depends on the task similarity and dataset size relative to the pretrained encoder.

Describe alternatives you've considered
The alternative is to keep things as they are and rely on users to implement their own 2-round transfer-learning job.

The text was updated successfully, but these errors were encountered:

ethanreidel linked a pull request Apr 2, 2024 that will close this issue

Support for freezing pretrained vision model layers with regex #3981

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Option to unfreeze encoder layers when training a computer vision model #3733

Option to unfreeze encoder layers when training a computer vision model #3733

saad-palapa commented Oct 16, 2023

Option to unfreeze encoder layers when training a computer vision model #3733

Option to unfreeze encoder layers when training a computer vision model #3733

Comments

saad-palapa commented Oct 16, 2023