GitHub - hwijeen/learning-to-mask: Code for the study: Exploring the Benefits of Learning to Mask for Task Transfer

Exploring the Benefits of Learning to Mask for Task Transfer

This is the code base that I used for my independent study with Emma Strubell. The goal of the project was to explore how far we can push the limits of parameter-efficient tuning. The literature in parameter efficient tuning suggests that we only need a small perturbation from the pretrained model to solve a downstream task.(e.g. enforcing an update matrix to extremely low-rank yields comparable results). In this work, we work we take one step further: Can we approximate the finetuning procedure without changing parameters at all?

The idea is that we can learn the binary mask such that the masked pretrained model will approximate the finetuned model. The benefit of having a binary mask instead of altered weight is that we can store the learned model in an efficient manner and run inference at a lower latency (with hardware support). Our positive finding include

The mask learning approach performs comparably to finetuning in single-task setup
Maksed model suffers less from negative transfer in the context of intermediate task training.
Using (diagonal) Fisher Information Matrix to initialize the binary mask does not result in faster convergence.

One can find the detailed report in this link.

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
scripts		scripts
.gitignore		.gitignore
README.md		README.md
distance.py		distance.py
fisher.py		fisher.py
interpolate.py		interpolate.py
layers.py		layers.py
pattern_verbalizer.py		pattern_verbalizer.py
requirements.txt		requirements.txt
run_glue.py		run_glue.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Exploring the Benefits of Learning to Mask for Task Transfer

About

Releases

Packages

Languages

hwijeen/learning-to-mask

Folders and files

Latest commit

History

Repository files navigation

Exploring the Benefits of Learning to Mask for Task Transfer

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages