[Feature] Enable parameter reset in loss module #2017

BY571 · 2024-03-18T14:55:29Z

Description

Allows to reset the parameters in the loss module.

pytorch-bot · 2024-03-18T14:55:32Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2017

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 4 New Failures, 1 Unrelated Failure

As of commit 4b29473 with merge base 87f3437 ():

NEW FAILURES - The following jobs have failed:

Continuous Benchmark (PR) / CPU Pytest benchmark (gh)
Workflow failed! Resource not accessible by integration
Continuous Benchmark (PR) / GPU Pytest benchmark (gh)
Workflow failed! Resource not accessible by integration
Habitat Tests on Linux / tests (3.9, 11.6) / linux-job (gh)
RuntimeError: Command docker exec -t 012a098c6051fb52e5a3e8062b34e1b7c0b5b7679a2414910fc6cbcfa5776379 /exec failed with exit code 139
Unit-tests on MacOS CPU / tests (3.8) / macos-job (gh)
test/test_modules.py::TestMultiAgent::test_multiagent_mlp[batch1-None-False-True-3]

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

Unit-tests on Linux / tests-cpu (3.8) / linux-job (gh)
AttributeError: 'OrphanPath' object has no attribute 'exists'

This comment was automatically generated by Dr. CI and updates every 15 minutes.

vmoens

Thanks for this!

We'll need tests for the feature.

How do we handle the target parameters?

Wouldn't something like this be a bit more robust?

from torchrl.objectives import DQNLoss
from torchrl.modules import QValueActor
from torch import nn

module = nn.Sequential(nn.Linear(1, 64), nn.ReLU(), nn.Linear(64, 64))

value_net = QValueActor(module=module, action_space="categorical")
loss = DQNLoss(value_network=value_net, action_space="categorical")

with loss.value_network_params.to_module(loss.value_network):
    loss.apply(lambda module: module.reset_parameters() if hasattr(module, "reset_parameters") else None)

vmoens · 2024-03-19T11:05:02Z

torchrl/objectives/common.py

+ module_names (Optional[List[Parameter]]): A list of module names to reset the parameters for.
+ If None, all modules with names ending in "_params" will be reset.
+ init_func (Optional[Callable]): A function to initialize the parameters.
+ If None, the parameters will be initialized with uniform random values between -1 and 1.


Seems very unlikely that anyone would want to use that init IMO. Shouldn't we use the init method from the corresponding nn.Module if there is?

vmoens · 2024-03-19T11:05:27Z

torchrl/objectives/common.py

+ def reset_parameters(
+ self,
+ module_names: Optional[List[Parameter]] = None,
+ init_func: Optional[Callable] = None,


Suggested change

init_func: Optional[Callable] = None,

init_func: Callable[[torch.Tensor], None] | None = None,

vmoens · 2024-03-19T11:05:37Z

torchrl/objectives/common.py

@@ -363,6 +364,35 @@ def reset(self) -> None:
 # mainly used for PPO with KL target
 pass

+ def reset_parameters(
+ self,
+ module_names: Optional[List[Parameter]] = None,


Suggested change

module_names: Optional[List[Parameter]] = None,

module_names: List[Parameter] | None = None,

vmoens · 2024-03-19T11:05:51Z

torchrl/objectives/common.py

+ """Reset the parameters of the specified modules.
+
+ Args:
+ module_names (Optional[List[Parameter]]): A list of module names to reset the parameters for.


Suggested change

module_names (Optional[List[Parameter]]): A list of module names to reset the parameters for.

module_names (list of nn.Parameter, optional): A list of module names to reset the parameters for.

vmoens · 2024-03-19T11:06:10Z

torchrl/objectives/common.py

+ Args:
+ module_names (Optional[List[Parameter]]): A list of module names to reset the parameters for.
+ If None, all modules with names ending in "_params" will be reset.
+ init_func (Optional[Callable]): A function to initialize the parameters.


Suggested change

init_func (Optional[Callable]): A function to initialize the parameters.

init_func (Callable[[torch.Tensor], None]): A function to initialize the parameters.

vmoens · 2024-03-19T11:07:13Z

torchrl/objectives/common.py

+ else:
+ params_2_reset = [name + "_params" for name in module_names]
+
+ def _reset_params(param):


Having one single reset function will be hard to handle, we need a way to tie the reset function and the module.

vmoens

Thanks for this!

We'll need tests for the feature.

How do we handle the target parameters?

Wouldn't something like this be a bit more robust?

from torchrl.objectives import DQNLoss
from torchrl.modules import QValueActor
from torch import nn

module = nn.Sequential(nn.Linear(1, 64), nn.ReLU(), nn.Linear(64, 64))

value_net = QValueActor(module=module, action_space="categorical")
loss = DQNLoss(value_network=value_net, action_space="categorical")

with loss.value_network_params.to_module(loss.value_network):
    loss.apply(lambda module: module.reset_parameters() if hasattr(module, "reset_parameters") else None)

BY571 · 2024-03-20T08:57:52Z

Thanks for this!

We'll need tests for the feature.

How do we handle the target parameters?

Wouldn't something like this be a bit more robust?

from torchrl.objectives import DQNLoss
from torchrl.modules import QValueActor
from torch import nn

module = nn.Sequential(nn.Linear(1, 64), nn.ReLU(), nn.Linear(64, 64))

value_net = QValueActor(module=module, action_space="categorical")
loss = DQNLoss(value_network=value_net, action_space="categorical")

with loss.value_network_params.to_module(loss.value_network):
    loss.apply(lambda module: module.reset_parameters() if hasattr(module, "reset_parameters") else None)

I like the solution! But we are accessing the parameters directly in your example so we would need to define a reset function manually, which I think is perfectly fine because then the user has to decide the way how to reset weights and biases:

def reset_parameters(params):
    """ User specified resetting function depending on their needs for initialization """
    if len(params.shape) > 1:
        # weights
        nn.init.xavier_uniform_(params)
    elif len(params.shape) == 1:
        # biases
        nn.init.zeros_(params)
    else:
        raise ValueError("Unknown parameter shape: {}".format(params.shape))
  
with loss.value_network_params.to_module(loss.value_network):
    loss.apply(lambda x: reset_parameters(x.data) if hasattr(x, "data") else None)

And for handling the target_network_params I think we could simply do something like:

loss.target_value_network_params.update(loss.value_network_params)

What do you think? I think we can close the draft. But we might want to mention the way to reset parameters somewhere in the docs.

vmoens · 2024-03-20T11:13:45Z

loss.target_value_network_params.update(loss.value_network_params)

This won't work because the target params are locked (you can't update them). They're locked because we want to avoid this kind of operation :)
You should update the data inplace:

loss.target_value_network_params.apply(lambda dest, src: dest.data.copy_(src), loss.value_network_params)

vmoens · 2024-03-20T11:20:03Z

def reset_parameters(params):
    """ User specified resetting function depending on their needs for initialization """
    if len(params.shape) > 1:
        # weights
        nn.init.xavier_uniform_(params)
    elif len(params.shape) == 1:
        # biases
        nn.init.zeros_(params)
    else:
        raise ValueError("Unknown parameter shape: {}".format(params.shape))
  
with loss.value_network_params.to_module(loss.value_network):
    loss.apply(lambda x: reset_parameters(x.data) if hasattr(x, "data") else None)

Unfortunately this isn't very generic
(1) all tensors have a data attribute, even buffers. By doing this you will also use Xavier init on batch-norm buffers if they're 2d
(2) If the model has a mixture of linear, conv and other layers it's going to be hard to have a fine grained control over the params being updated.

Not all modules are "weights" and "biases" and "biases" can be 2d (my point is: the dimension is a very indirect determinator of the tensor role in a model)

The way I usually see this work is to use the module reset_parameters if there is one, which provides a better control over difference in initialization methods.

Maybe we could allow the user to pass a reset function, but in that case we don't even need to re-populate the module (we can just do tensordict.apply(reset)). Note that you could also do

def reset(name, tensor):
    if name == "bias":
        tensor.data.zero_()
    if name == "weight":
        nn.init.xavier_uniform_(tensor)
tensordict.apply(reset, named=True)

which is more straightforward IMO

init loss_module param reset

4b29473

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 18, 2024

BY571 changed the title ~~[Feature] enable parameter reset in loss module~~ [Feature] Enable parameter reset in loss module Mar 18, 2024

vmoens added the enhancement New feature or request label Mar 18, 2024

vmoens reviewed Mar 19, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Enable parameter reset in loss module #2017

[Feature] Enable parameter reset in loss module #2017

BY571 commented Mar 18, 2024 •

edited by vmoens

pytorch-bot bot commented Mar 18, 2024 •

edited

vmoens left a comment

vmoens Mar 19, 2024

vmoens Mar 19, 2024

vmoens Mar 19, 2024

vmoens Mar 19, 2024

vmoens Mar 19, 2024

vmoens Mar 19, 2024

vmoens left a comment

BY571 commented Mar 20, 2024

vmoens commented Mar 20, 2024

vmoens commented Mar 20, 2024

	init_func: Optional[Callable] = None,
	init_func: Callable[[torch.Tensor], None] \| None = None,

	module_names: Optional[List[Parameter]] = None,
	module_names: List[Parameter] \| None = None,

	module_names (Optional[List[Parameter]]): A list of module names to reset the parameters for.
	module_names (list of nn.Parameter, optional): A list of module names to reset the parameters for.

	init_func (Optional[Callable]): A function to initialize the parameters.
	init_func (Callable[[torch.Tensor], None]): A function to initialize the parameters.

[Feature] Enable parameter reset in loss module #2017

Are you sure you want to change the base?

[Feature] Enable parameter reset in loss module #2017

Conversation

BY571 commented Mar 18, 2024 • edited by vmoens

Description

pytorch-bot bot commented Mar 18, 2024 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2017

❌ 4 New Failures, 1 Unrelated Failure

vmoens left a comment

Choose a reason for hiding this comment

vmoens Mar 19, 2024

Choose a reason for hiding this comment

vmoens Mar 19, 2024

Choose a reason for hiding this comment

vmoens Mar 19, 2024

Choose a reason for hiding this comment

vmoens Mar 19, 2024

Choose a reason for hiding this comment

vmoens Mar 19, 2024

Choose a reason for hiding this comment

vmoens Mar 19, 2024

Choose a reason for hiding this comment

vmoens left a comment

Choose a reason for hiding this comment

BY571 commented Mar 20, 2024

vmoens commented Mar 20, 2024

vmoens commented Mar 20, 2024

BY571 commented Mar 18, 2024 •

edited by vmoens

pytorch-bot bot commented Mar 18, 2024 •

edited