TypeError: 'DataNode' object does not support item assignment #5467

jackdaw213 · 2024-05-14T07:33:07Z

Describe the question.

@pipeline_def(device_id=0)
    def dali_pipeline(image_dir):
        images, _ = fn.readers.file(file_root=image_dir, 
                                    files=utils.list_images(image_dir),
                                    random_shuffle=True, 
                                    name="Reader")
        H, W = 256, 256
        
        images = fn.decoders.image(images, device="mixed", output_type=types.RGB)
        images = images / 255
        images = fn.resize(images, size=512)
        images = fn.crop_mirror_normalize(images, 
                                        dtype=types.FLOAT,
                                        output_layout="HWC",
                                        crop=(H, W),
                                        crop_pos_x=fn.random.uniform(range=(0, 1)),
                                        crop_pos_y=fn.random.uniform(range=(0, 1)))

        images = fn.python_function(images, function=rgb2lab)

        images = fn.transpose(images, perm=[2, 0, 1])
        color = images[1:, :, :] / 110
        black = (fn.expand_dims(images[0, :, :], axes=0) - 50) / 100

        #How to loop through each image ?
        for per image:
            g = Geometric(1/8)
            P = np.random.choice([1, 2, 3, 4, 5, 6, 7, 8, 9])
            mask = types.Constant(shape=(2, H, W), value=0, dtype=types.FLOAT, device="gpu")
            
            for i in range(int(g.sample().item())):
                h = int(torch.clip(torch.normal(mean=torch.tensor((H-P+1)/2.), std=torch.tensor((H-P+1)/4.)), 0, W-P))
                w = int(torch.clip(torch.normal(mean=torch.tensor((W-P+1)/2.), std=torch.tensor((W-P+1)/4.)), 0, W-P))
                
                # Error here
                mask[:,h:h+P,w:w+P] = fn.reductions.mean(fn.reductions.mean(color[:,h:h+P,w:w+P],axes=2,keep_dims=True),axes=1,keep_dims=True)
        
        black = fn.cat(black, mask, axis=0)

        return black, color

My dali pipeline is the above and I have 2 questions:

How do I loop and calculate a mask for each image? Is there any efficient way of doing it outside of loop ?
Mask is a DataNode and does not support item assignment. How can I resolve this issue ? I tried to create a mask as a Pytorch tensor and used torch.mean() but color is a DataNode which does not work with PyTorch

Check for duplicates

I have searched the open bugs/issues and have found no duplicates for this bug report

The text was updated successfully, but these errors were encountered:

JanuszL · 2024-05-14T16:28:53Z

Hi @jackdaw213,

Thank you for reaching out. A couple of observations from our side:

you shouldn't loop over the images in a loop - in DALI batch is implicit and each operation is applied to all samples in it. If you want different processing for some samples please check the conditional execution
DALI doesn't support loops inside the pipeline that are evaluated in the runtime. So for i in range(3) will work, but for i in range(fn.random....) won't
DataNode and does not support item assignment - you can create multiple pieces of the mask and then cat them together like here

jackdaw213 · 2024-05-15T02:16:50Z

Hello @JanuszL, thank you for the answers

you shouldn't loop over the images in a loop - in DALI batch is implicit and each operation is applied to all samples in it. If you want different processing for some samples please check the conditional execution

The operation is to create g.sample() number of P sized square then calculate the average color inside those squares and cat it with black, this operation is done for each image. I think I can figure a workaround for g.sample() but I do not know how to approach the average color mask for each image step. Any tips ?

DALI doesn't support loops inside the pipeline that are evaluated in the runtime. So for i in range(3) will work, but for i in range(fn.random....) won't

Ah that's unfortunate, thank you for the info

DataNode and does not support item assignment - you can create multiple pieces of the mask and then cat them together like here

I don't know if cat can replace the item assignments in this situation or maybe I do not understand the example correctly. Color is the 2 channels AB in LAB image and I want to sample multiples P size squares and average the color inside those squares. Then cat the mask with those squares to L/black but I don't really know how to cat multiple squares individually to the mask from the examples you gave me

mzient · 2024-05-15T09:31:21Z

Hello @jackdaw213,
If your loop isn't too long, then it should be possible to unroll it to the maximum length. With that you could use conditional execution to emulate shorter loops. Inside the loop you could generate the mean values and your square coordinates which you could then stack and pass to fn.erase.

jackdaw213 · 2024-05-15T14:21:21Z

Hello @mzient, thank you for your response

If your loop isn't too long, then it should be possible to unroll it to the maximum length. With that you could use conditional execution to emulate shorter loops

That's a good idea but sometimes the loop would get quite lengthy so that's not suitable for this situation

Inside the loop you could generate the mean values and your square coordinates which you could then stack and pass to fn.erase.

This will create the same mask with the same square size/location for all images in the batch right ? But I want a different mask for each image in the batch, is it possible ?

mzient · 2024-05-16T13:41:19Z

Hello @mzient, thank you for your response

If your loop isn't too long, then it should be possible to unroll it to the maximum length. With that you could use conditional execution to emulate shorter loops

That's a good idea but sometimes the loop would get quite lengthy so that's not suitable for this situation

How long could it get?

Inside the loop you could generate the mean values and your square coordinates which you could then stack and pass to fn.erase.

This will create the same mask with the same square size/location for all images in the batch right ? But I want a different mask for each image in the batch, is it possible ?

No; the batch in DALI is implicit. When you do: slice = img[t:b, l:r], you're in fact slicing all images - and if l, t, r, b are DataNodes, not constants, they too are batches. With explicit batch this would be:

slice = [
  img[i][t[i]:b[i], l[i]:r[i]] for i in range(batch_size)
]

jackdaw213 · 2024-05-17T13:52:46Z

How long could it get?

Oops, I somehow think that your idea was to check for every possible value of g.sample() instead of just checking if i < g.sample() (guess I was too sleepy that night). And yes the loop is not that long, this would work just fine, thank you

No; the batch in DALI is implicit. When you do: slice = img[t:b, l:r], you're in fact slicing all images - and if l, t, r, b are DataNodes, not constants, they too are batches. With explicit batch this would be:

Ah, that cleared up some of my misunderstandings. However, I still do not understand your intention of generating a mask for each image. Now I know that fn.reductions.mean(color[:,h:h+P,w:w+P]) calculates the mean values of that square for every image in the batch. Wouldn't the loop generate coordinates/means for a single mask ?

jackdaw213 · 2024-05-21T13:32:09Z

Hello @mzient, I trying to make the loop work, but there is a bug that I can't seem to fix: TypeError: float() argument must be a string or a real number, not 'DataNodeDebug'. The variables x, y, and P are all DataNode, so fn.erase should just work like the example you gave me, right?

g = Geometric(1/8)
sample = int(g.sample().item())
P = fn.random.uniform(range=[1, 9], shape=(), dtype=types.UINT8)
masks = []
mean = None

for i in range(100):
    if i > sample:
        break

    x = fn.cast(fn.random.normal(mean=(H-P+1)/2., 
                                            stddev=(H-P+1)/4.), 
                                            dtype=types.UINT8)
    y = fn.cast(fn.random.normal(mean=(W-P+1)/2., 
                                            stddev=(W-P+1)/4.), 
                                            dtype=types.UINT8)

    mean = fn.reductions.mean(fn.reductions.mean(color[:, x:x+P, y:y+P], axes=2, keep_dims=True), axes=1, keep_dims=True)

    mask = types.Constant(shape=(2, H, W), value=0, dtype=types.FLOAT, device="gpu")
    mask = fn.erase(mask, fill_value=mean, anchor=(x, y), shape=(P, P), axes=(1, 2)) <--- Error here
    masks.append(mask)
    
black = fn.cat(black, fn.stack(*masks), axis=0)

JanuszL · 2024-05-21T15:34:32Z

Hi @jackdaw213,

anchor and shape should be a data node/tensor with the right dimensionality, not a tuple of them. Please stack/cat them together and into one value and try again.

jackdaw213 · 2024-05-22T12:21:18Z

Hello @JanuszL, modified my code a bit and added fn.stack to my code but there are some issues

g = Geometric(1/8)
sample = int(g.sample().item())
P = fn.random.uniform(range=[1, 9], shape=(), dtype=types.UINT8)
mask = types.Constant(shape=(2, H, W), value=0, dtype=types.FLOAT, device="gpu")
mean = None

for i in range(100):
    if i > sample:
        break

    x = fn.cast(fn.random.normal(mean=(H-P+1)/2., 
                                            stddev=(H-P+1)/4.), 
                                            dtype=types.UINT8)
    y = fn.cast(fn.random.normal(mean=(W-P+1)/2., 
                                            stddev=(W-P+1)/4.), 
                                            dtype=types.UINT8)

    x = math.clamp(x, 0, H-P) <-- Error if I try to stack both x, y after clamping them
    y = math.clamp(y, 0, W-P)

    mean = fn.reductions.mean(fn.reductions.mean(color[:, x:x+P, y:y+P], axes=2, keep_dims=True), axes=1, keep_dims=True)

    mask = types.Constant(shape=(2, H, W), value=0, dtype=types.FLOAT, device="gpu")
    mask = fn.erase(mask, fill_value=mean, anchor=fn.stack(x, y), shape=fn.stack(P, P), axes=(1, 2)) <-- First error here
    
black = fn.cat(black, mask, axis=0)

When I run the code above it gives me this error: TypeError: RunOperatorGPU(): incompatible function arguments. The following argument types are supported: 1. (self: nvidia.dali.backend_impl.PipelineDebug, arg0: int, arg1: List[nvidia.dali.tensors.TensorListGPU], arg2: Dict[str, nvidia.dali.tensors.TensorListCPU], arg3: int) -> List[nvidia.dali.tensors.TensorListGPU]. From what I understand, it seems like fill_value needs to be an integer. Casting mean to uint8 doesn't work, but setting fill_value to 1 does. Is there any way to make mean the fill_value? mean has the shape of (2,1,1)
I also tried to clip/clamp x and y to the range [0, H/W - P] but then fn.stack(x, y) will throw this error TypeError: object of type 'DataNode' has no len()

mzient · 2024-05-22T12:56:15Z

@jackdaw213 I'm quite sure there are some constructs that work in "regular" mode but not in "debug". Can you try to run your code without debugging?

jackdaw213 · 2024-05-22T23:07:35Z

@mzient I turn off debug mode and those 2 issues seem to be gone, however, a new issue pops up. color is a GPU data note so mean is also a GPU data note. But fn.erase needs mean to be a CPU datanote Error while specifying argument 'fill_value'. Named argument inputs to operators must be CPU data nodes. However, a GPU data node was provided. I tried adding device="cpu" to fn.mean but it does not work because An operator with device='cpu' cannot accept GPU inputs. Also from #1176 it seems that DALI does not support GPU to CPU transfer ? Not related to current issue but why is this the case ?

jackdaw213 · 2024-05-28T01:45:54Z

Any ideas @mzient ? I did some more research, but I couldn't find any method to transfer data from the GPU to the CPU.

mzient · 2024-05-28T08:19:31Z

@jackdaw213 Thank you for checking non-debug pipeline. Currently there's no way to go from GPU to CPU within a single pipeline. We're actively working on relaxing the execution model to allow arbitrary transfers, however, a usable version is still a release or two away.

jackdaw213 · 2024-05-29T13:20:50Z

Hello @mzient, it's great to hear that GPU2CPU transfer is coming in the future. However, I'm unsure how to approach my problem in the meantime. The only solutions I can think of are to either make the DALI pipeline run on the CPU or find a way to make torch.mean() to accept DataNote as input, but, I'm not aware of any method to do so. Could you provide some guidance?

jackdaw213 added the question Further information is requested label May 14, 2024

dali-automaton assigned szkarpinski May 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TypeError: 'DataNode' object does not support item assignment #5467

TypeError: 'DataNode' object does not support item assignment #5467

jackdaw213 commented May 14, 2024

JanuszL commented May 14, 2024

jackdaw213 commented May 15, 2024

mzient commented May 15, 2024

jackdaw213 commented May 15, 2024

mzient commented May 16, 2024 •

edited

jackdaw213 commented May 17, 2024

jackdaw213 commented May 21, 2024

JanuszL commented May 21, 2024

jackdaw213 commented May 22, 2024

mzient commented May 22, 2024 •

edited

jackdaw213 commented May 22, 2024

jackdaw213 commented May 28, 2024

mzient commented May 28, 2024

jackdaw213 commented May 29, 2024

TypeError: 'DataNode' object does not support item assignment #5467

TypeError: 'DataNode' object does not support item assignment #5467

Comments

jackdaw213 commented May 14, 2024

Describe the question.

Check for duplicates

JanuszL commented May 14, 2024

jackdaw213 commented May 15, 2024

mzient commented May 15, 2024

jackdaw213 commented May 15, 2024

mzient commented May 16, 2024 • edited

jackdaw213 commented May 17, 2024

jackdaw213 commented May 21, 2024

JanuszL commented May 21, 2024

jackdaw213 commented May 22, 2024

mzient commented May 22, 2024 • edited

jackdaw213 commented May 22, 2024

jackdaw213 commented May 28, 2024

mzient commented May 28, 2024

jackdaw213 commented May 29, 2024

mzient commented May 16, 2024 •

edited

mzient commented May 22, 2024 •

edited