What is the relationship between samples_per_gpu and workers_per_gpu and sample_ratio？ #19

joeyslv · 2023-05-10T06:51:59Z

I can only use the default

sample_ ratio=[1, 4]
samples_ per_ gpu=4
workers_ per_ gpu=4

But to increase the batchsize a bit,
samples_ per_ gpu=8,
when the program cannot run and a len() error will occur. Can you tell me the relationship between these three and how labeled and unlabeled data is sampled in the project? Thank you very much

The text was updated successfully, but these errors were encountered:

Adamdad · 2023-05-10T07:02:42Z

There are three distinct concepts to understand:

sample_ratio=[1, 4] indicates the ratio of labeled to unlabeled samples within a single GPU. For instance, sample_ratio=[1, 4] means that there is 1 labeled and 4 unlabeled samples on each GPU.
samples_per_gpu=5 refers to the total number of samples per GPU, regardless of whether they are labeled or unlabeled. In fact, sum(sample_ratio) == samples_per_gpu.
workers_per_gpu=5 determines the number of threads that will be used to load the data. The optimal number of workers per GPU depends on your server setup. By default, we set workers_per_gpu equal to samples_per_gpu, but you can reduce this value if your server has limited CPU resources. If necessary, you can set it to 1 or 0.

ConsistentTeacher/configs/consistent-teacher/consistent_teacher_r50_fpn_coco_180k_10p.py

Lines 256 to 296 in 1fa6477

 data = dict( 

 samples_per_gpu=5, 

 workers_per_gpu=5, 

 train=dict( 

 _delete_=True, 

 type="SemiDataset", 

 sup=dict( 

 type="CocoDataset", 

 ann_file="data/coco_semi/semi_supervised/instances_train2017.${fold}@${percent}.json", 

 img_prefix="data/coco/train2017/", 

 pipeline=train_pipeline, 

 ), 

 unsup=dict( 

 type="CocoDataset", 

 ann_file="data/coco_semi/semi_supervised/instances_train2017.${fold}@${percent}-unlabeled.json", 

 img_prefix="data/coco/train2017/", 

 pipeline=unsup_pipeline, 

 filter_empty_gt=False, 

 ), 

 ), 

 val=dict( 

 img_prefix="data/coco/val2017/", 

 ann_file='data/coco/annotations/instances_val2017.json', 

 pipeline=test_pipeline 

 ), 

 test=dict( 

 pipeline=test_pipeline, 

 img_prefix="data/coco/val2017/", 

 ann_file='data/coco/annotations/instances_val2017.json' 

 ), 

 sampler=dict( 

 train=dict( 

 type="SemiBalanceSampler", 

 sample_ratio=[1, 4], 

 by_prob=False, 

 # at_least_one=True, 

 epoch_length=7330, 

 ) 

 ), 

 )

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What is the relationship between samples_per_gpu and workers_per_gpu and sample_ratio？ #19

What is the relationship between samples_per_gpu and workers_per_gpu and sample_ratio？ #19

joeyslv commented May 10, 2023

Adamdad commented May 10, 2023

What is the relationship between samples_per_gpu and workers_per_gpu and sample_ratio？ #19

What is the relationship between samples_per_gpu and workers_per_gpu and sample_ratio？ #19

Comments

joeyslv commented May 10, 2023

Adamdad commented May 10, 2023