About code for pretraining #4

volgachen · 2022-06-04T04:33:52Z

Excuse me, do you have any plan to release codes or instructions for pretraining?

encounter1997 · 2022-06-05T03:04:29Z

Sorry that we do not have the plan to release the code for pre-training, but it can be easily implemented by replacing the model construction function in the DeiT code with our model construction function.

Hope this can help you and feel free to ask anything if you have difficulties in implementing the pre-training code.

volgachen · 2022-06-06T02:55:06Z

Thank you for your response.
I guess I should modify query_shape into (1,1). Is there any other configs I should notice?

encounter1997 · 2022-06-06T03:24:00Z

Taking fp-detr-base-in1k.py for example, there are several parts that should be modified in the config, as follows:

Only the model definition is needed;
The self-attn and the corresponding norm in encoder2 should be removed, and the operation order should be updated.
return_intermediate should be updated since deep supervision is not used during pre-training. The code in the DeiT project may also need to be changed slightly, to obtain the class token from the output sequence for loss computation.
num_classes should be 1000 for ImageNet classification.

volgachen · 2022-06-06T03:34:33Z

The self-attn and the corresponding norm in encoder2 should be removed, and the operation order should be updated.

I suppose the self-attn you mentioned in point 2 is actually prompt_self_attn?

encounter1997 · 2022-06-06T03:39:55Z

Yes, that's right.

volgachen · 2022-06-06T12:48:04Z

Thank you! It seems to be right now.

volgachen · 2022-06-11T01:16:35Z

I find that there is a learning rate decay for sampling_offsets in the training configuration for detection.
How do you handle with sampling_offsets in the pretraining process?

encounter1997 · 2022-06-11T02:10:09Z

We did not carefully tune the learning rate for sampling_offsets and reference_points during pre-training, and simply set their learning rate the same as other parameters in the transformer encoder. Tuning the learning rate may lead to better pre-training results, but we didn't try.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About code for pretraining #4

About code for pretraining #4

volgachen commented Jun 4, 2022

encounter1997 commented Jun 5, 2022 •

edited

volgachen commented Jun 6, 2022

encounter1997 commented Jun 6, 2022

volgachen commented Jun 6, 2022 •

edited

encounter1997 commented Jun 6, 2022

volgachen commented Jun 6, 2022

volgachen commented Jun 11, 2022

encounter1997 commented Jun 11, 2022

About code for pretraining #4

About code for pretraining #4

Comments

volgachen commented Jun 4, 2022

encounter1997 commented Jun 5, 2022 • edited

volgachen commented Jun 6, 2022

encounter1997 commented Jun 6, 2022

volgachen commented Jun 6, 2022 • edited

encounter1997 commented Jun 6, 2022

volgachen commented Jun 6, 2022

volgachen commented Jun 11, 2022

encounter1997 commented Jun 11, 2022

encounter1997 commented Jun 5, 2022 •

edited

volgachen commented Jun 6, 2022 •

edited