model state manipulation #1448

YumcoderCom · 2024-02-04T03:34:13Z

YumcoderCom
Feb 4, 2024

Suppose we have two model

#[derive(Module, Debug)]
pub struct RegressionModel<B: Backend> {
input_layer: Linear,
output_layer: Linear,
activation: ReLU,
}

let model1: crate::model::RegressionModel =
RegressionModelConfig::new(config.input_feature_len).init(&device);

let model2: crate::model::RegressionModel =
RegressionModelConfig::new(config.input_feature_len).init(&device);

How can we compute the average of the parameters of two models and then load these averaged parameters into a new model?

Answered by nathanielsimard

Feb 5, 2024

I think the question is a bit different. You can create functions that work for each parameter of the module. There is the map and visit functions that exist on Module. You could use burn::tensor::container::TensorContainer to aggregate each parameter and then update the module in question. This is the strategy used in burn-train for gradients accumulation, but this can be used to merge modules as well.

View full answer

Nikaidou-Shinku · 2024-02-04T04:45:43Z

Nikaidou-Shinku
Feb 4, 2024

I think #1245 will fix this, once it's merged you can use model1.weight to get the model's parameters and then manipulate them as tensors.

0 replies

nathanielsimard · 2024-02-05T16:08:04Z

nathanielsimard
Feb 5, 2024
Maintainer

I think the question is a bit different. You can create functions that work for each parameter of the module. There is the map and visit functions that exist on Module. You could use burn::tensor::container::TensorContainer to aggregate each parameter and then update the module in question. This is the strategy used in burn-train for gradients accumulation, but this can be used to merge modules as well.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

model state manipulation #1448

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

model state manipulation #1448

YumcoderCom Feb 4, 2024

Replies: 2 comments

Nikaidou-Shinku Feb 4, 2024

nathanielsimard Feb 5, 2024 Maintainer

YumcoderCom
Feb 4, 2024

Nikaidou-Shinku
Feb 4, 2024

nathanielsimard
Feb 5, 2024
Maintainer