Hello , Is this package able to train a multi-label dataset with one-hot encoding? #669

Mahiro2211 · 2023-10-14T05:43:24Z

It is a great package that improves my efficiency , When I test cifar10, for the one-hot label , I can use

label = torch.argmax(label,dim=1)

to transform one-hot label but When I test it on some one-hot label I can't find a nice method to deal with a multi-label dataset.

at first, I saw this issue it tells me a way to put in multi-label, but I want to further custom it because I need to construct a similarity matrix

label =  torch.matmul(label,label.t())
# For multi-label dataset , if there is one label shared by two samples I mark it as the same

I hope to receive a response from you soon. Thank you.

KevinMusgrave · 2023-10-15T23:53:12Z

Unfortunately there isn't a way to pass in a custom label comparison function into miners or loss functions. It would be a good idea to add this feature though, so I will keep this issue open.

Edit:

Actually I think you can write a miner to accomplish what you're talking about:

from pytorch_metric_learning.miners import BaseMiner

class CustomMiner(BaseMiner):
    def mine(self, embeddings, labels, ref_emb, ref_labels):
        # compare labels and ref_labels however you want
        # return a tuple (a1, p, a2, n)
        # where (a1, p) are the positive pair indices
        # and (a2, n) are the negative pair indices


miner = CustomMiner()
pairs = miner(embeddings, labels)
loss = loss_fn(embeddings, indices_tuple=pairs)

It's not ideal but it's the only workaround I can think of.

Mahiro2211 · 2023-10-16T05:05:26Z

Thank you for your response.I will try it

KevinMusgrave added the enhancement New feature or request label Oct 15, 2023

Mahiro2211 closed this as completed Oct 16, 2023

Mahiro2211 reopened this Oct 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hello , Is this package able to train a multi-label dataset with one-hot encoding? #669

Hello , Is this package able to train a multi-label dataset with one-hot encoding? #669

Mahiro2211 commented Oct 14, 2023

KevinMusgrave commented Oct 15, 2023 •

edited

Mahiro2211 commented Oct 16, 2023

Hello , Is this package able to train a multi-label dataset with one-hot encoding? #669

Hello , Is this package able to train a multi-label dataset with one-hot encoding? #669

Comments

Mahiro2211 commented Oct 14, 2023

KevinMusgrave commented Oct 15, 2023 • edited

Mahiro2211 commented Oct 16, 2023

KevinMusgrave commented Oct 15, 2023 •

edited