[Feature] Support RT-DETR #11395

flytocc · 2024-01-17T13:08:07Z

Motivation

Support RT-DETR as discussed in this issue

Referred to the following repositories for implementation details:

Modification

Added support for RT-DETR with variants (r18vd, r34vd, r50vd, r101vd).
Added support for random sizes and interpolations in BatchSyncRandomResize.
Modified ResNetV1d for depth 18 and 34.
Added a specialized varifocal loss, RTDETRVarifocalLoss.

BC-breaking

When the depth is set to 18 or 34 in ResNetV1d, a downsample with conv_bn is now added to layer1.

Checklist

Pre-commit or other linting tools are used to fix the potential lint issues.
The modification is covered by complete unit tests. If not, please add more unit test to ensure the correctness.
If the modification has potential influence on downstream projects, this PR should be tested with downstream projects, like MMDet or MMPreTrain.
The documentation has been modified accordingly, like docstring or example tutorials.

flytocc · 2024-01-17T13:14:06Z

reproduction

all results trained on 1 gpu (V100) with total batch size 16

r18vd with amp

 Average Precision  (AP) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.465
 Average Precision  (AP) @[ IoU=0.50      | area=   all | maxDets=1000 ] = 0.639
 Average Precision  (AP) @[ IoU=0.75      | area=   all | maxDets=1000 ] = 0.503
 Average Precision  (AP) @[ IoU=0.50:0.95 | area= small | maxDets=1000 ] = 0.286
 Average Precision  (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=1000 ] = 0.501
 Average Precision  (AP) @[ IoU=0.50:0.95 | area= large | maxDets=1000 ] = 0.625
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.689
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=300 ] = 0.692
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=1000 ] = 0.692
 Average Recall     (AR) @[ IoU=0.50:0.95 | area= small | maxDets=1000 ] = 0.506
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=1000 ] = 0.733
 Average Recall     (AR) @[ IoU=0.50:0.95 | area= large | maxDets=1000 ] = 0.872

r18vd without amp

 Average Precision  (AP) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.466
 Average Precision  (AP) @[ IoU=0.50      | area=   all | maxDets=1000 ] = 0.640
 Average Precision  (AP) @[ IoU=0.75      | area=   all | maxDets=1000 ] = 0.505
 Average Precision  (AP) @[ IoU=0.50:0.95 | area= small | maxDets=1000 ] = 0.289
 Average Precision  (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=1000 ] = 0.498
 Average Precision  (AP) @[ IoU=0.50:0.95 | area= large | maxDets=1000 ] = 0.629
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.689
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=300 ] = 0.692
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=1000 ] = 0.692
 Average Recall     (AR) @[ IoU=0.50:0.95 | area= small | maxDets=1000 ] = 0.496
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=1000 ] = 0.733
 Average Recall     (AR) @[ IoU=0.50:0.95 | area= large | maxDets=1000 ] = 0.864

r50vd with amp

 Average Precision  (AP) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.531
 Average Precision  (AP) @[ IoU=0.50      | area=   all | maxDets=1000 ] = 0.714
 Average Precision  (AP) @[ IoU=0.75      | area=   all | maxDets=1000 ] = 0.575
 Average Precision  (AP) @[ IoU=0.50:0.95 | area= small | maxDets=1000 ] = 0.351
 Average Precision  (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=1000 ] = 0.578
 Average Precision  (AP) @[ IoU=0.50:0.95 | area= large | maxDets=1000 ] = 0.700
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.722
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=300 ] = 0.724
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=1000 ] = 0.724
 Average Recall     (AR) @[ IoU=0.50:0.95 | area= small | maxDets=1000 ] = 0.549
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=1000 ] = 0.766
 Average Recall     (AR) @[ IoU=0.50:0.95 | area= large | maxDets=1000 ] = 0.883

hhaAndroid · 2024-01-18T03:12:59Z

@flytocc Thank you very much. I would like to confirm why a previous pull request (PR) could not align the precision. Was something incorrect there?

flytocc · 2024-01-18T05:59:07Z

@flytocc Thank you very much. I would like to confirm why a previous pull request (PR) could not align the precision. Was something incorrect there?

There are many differences in detail, and here are the ones I think are more important (r50vd arch for example):

flytocc/rtdetr	nijkah/rtdetr
norm_decay_mult=`0`	default `1`
BatchSyncRandomResize	RandomChoiceResize
MinIoURandomCrop	RandomCrop
init eccoder with pytorch-like `uniform`	Init `HybridEncoder` with mmcv-like `normal`

flytocc · 2024-02-05T02:10:23Z

The training (w. amp) AP of r50vd arch fluctuates between 52.9 and 53.1.
Random interpolations has almost no effect on AP.

ychensu · 2024-02-20T03:38:31Z

The training AP of r50vd arch fluctuates between 52.9 and 53.1.

Random interpolations has almost no effect on AP.

您好，我试用了下您写的rt-detr，发现在我的数据集中存在不收敛的现象，就是训练到20个epoch左右，map突然变成0，但我在rt-detr官方代码中并没有这个问题，都是用的4卡4batch

flytocc · 2024-02-20T04:19:03Z

The training AP of r50vd arch fluctuates between 52.9 and 53.1.

Random interpolations has almost no effect on AP.

您好，我试用了下您写的rt-detr，发现在我的数据集中存在不收敛的现象，就是训练到20个epoch左右，map突然变成0，但我在rt-detr官方代码中并没有这个问题，都是用的4卡4batch

目前只测试过COCO数据集。你试着可以检查一下数据增强

ychensu · 2024-02-20T07:38:56Z

The training AP of r50vd arch fluctuates between 52.9 and 53.1.

Random interpolations has almost no effect on AP.

您好，我试用了下您写的rt-detr，发现在我的数据集中存在不收敛的现象，就是训练到20个epoch左右，map突然变成0，但我在rt-detr官方代码中并没有这个问题，都是用的4卡4batch

目前只测试过COCO数据集。你试着可以检查一下数据增强

我仅保留了resize至640尺寸的数据增强，结果还是不行，map在21个epoch左右就会降，即使我增大了学习率也不行，我用的是文本检测totaltext数据集，仅在您的代码中出现过这个问题

ychensu · 2024-02-20T07:39:47Z

The training AP of r50vd arch fluctuates between 52.9 and 53.1.

Random interpolations has almost no effect on AP.

您好，我试用了下您写的rt-detr，发现在我的数据集中存在不收敛的现象，就是训练到20个epoch左右，map突然变成0，但我在rt-detr官方代码中并没有这个问题，都是用的4卡4batch

目前只测试过COCO数据集。你试着可以检查一下数据增强

我仅保留了resize至640尺寸的数据增强，结果还是不行，map在21个epoch左右就会降，即使我增大了学习率也不行，我用的是文本检测totaltext数据集，仅在您的代码中出现过这个问题

打错了，降低学习率或者增大batch还是会存在这个问题

flytocc · 2024-02-20T08:41:52Z

@ychensu 要不你到 flytocc/mmdetection 提一个issue

mmeendez8 · 2024-05-03T13:14:31Z

Is this currently blocked?

mm-assistant bot assigned jbwang1997 Jan 17, 2024

flytocc mentioned this pull request Jan 17, 2024

[Training is in progress] [Feature] Support RT-DETR #10498

Open

4 tasks

flytocc added 18 commits May 20, 2024 21:13

init

6244ece

reuse code of DINO

cd67b67

fix amp

fa57d0d

update configs

7c0142b

small fix

c0290a8

revert changes in Resize

60337e1

update configs

028cc3e

fix typo

2ab0a43

fix typo

b7967b2

update

c395ab3

fix amp

7fef092

update configs

aa56e70

fix amp training

10b8637

fix encoder init

2598f82

remove the dependency on mmyolo and fix code style

b6e3ae3

update docs and small fixes

962f52d

fix BatchSyncRandomResiz. __inti__ bug in mmyolo and fix typo

0647715

refactor RTDETRHybridEncoder

4eac4ed

change init method of DetrTransformerEncoder in RTDETRHybridEncoder

12f4ca7

flytocc force-pushed the rtdetr branch from 4ecaebc to 12f4ca7 Compare May 20, 2024 13:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Support RT-DETR #11395

[Feature] Support RT-DETR #11395

flytocc commented Jan 17, 2024 •

edited

flytocc commented Jan 17, 2024

hhaAndroid commented Jan 18, 2024

flytocc commented Jan 18, 2024 •

edited

flytocc commented Feb 5, 2024 •

edited

ychensu commented Feb 20, 2024

flytocc commented Feb 20, 2024 •

edited

ychensu commented Feb 20, 2024

ychensu commented Feb 20, 2024

flytocc commented Feb 20, 2024

mmeendez8 commented May 3, 2024

[Feature] Support RT-DETR #11395

Are you sure you want to change the base?

[Feature] Support RT-DETR #11395

Conversation

flytocc commented Jan 17, 2024 • edited

Motivation

Modification

BC-breaking

Checklist

flytocc commented Jan 17, 2024

reproduction

hhaAndroid commented Jan 18, 2024

flytocc commented Jan 18, 2024 • edited

flytocc commented Feb 5, 2024 • edited

ychensu commented Feb 20, 2024

flytocc commented Feb 20, 2024 • edited

ychensu commented Feb 20, 2024

ychensu commented Feb 20, 2024

flytocc commented Feb 20, 2024

mmeendez8 commented May 3, 2024

flytocc commented Jan 17, 2024 •

edited

flytocc commented Jan 18, 2024 •

edited

flytocc commented Feb 5, 2024 •

edited

flytocc commented Feb 20, 2024 •

edited