Multi-label Classification task #2

abdullahshafin · 2019-03-21T22:26:32Z

I just wanted to know if this can be applied to the case where there is multi-label classification problem with sigmoid output activation unit. As in, there are multiple labels that can be 1 at the same time and hence, the sum of probabilities is not necessarily equal to 1 (as is the case with softmax).

I came to your repo from this issue. Please let me know which loss function I can use in this scenario. I actually saw the code but wasn't entirely sure that the binary_focal_loss function is suitable in this problem. It looked to me as if it's only for binary classification and not for multi-label classification task.

umbertogriffo · 2019-03-22T09:03:01Z

Hi @abdullahshafin, you can't apply this to the case where there is multi-label classification problem with sigmoid output activation unit. Only support the case with softmax.

abdullahshafin · 2019-03-22T20:32:27Z

Hi @umbertogriffo

Thanks for the reply!

Do you mean using softmax for multi-label classification (like facebook paper)? It's still a bit unclear. Normally, softmax is not used for multi-label classification. Can you explain what inputs you expect for your two functions binary_focal_loss and categorical_focal_loss? Do you expect only 2 classes (binary) or does it work for more than 2 classes?

From my understanding, when talking about multiple target classes, keras uses the term binary_crossentropy (keras.losses.binary_crossentropy) for multi-label classification tasks where the output activation unit should then be sigmoid. And categorical_crossentropy (keras.losses.categorical_crossentropy) is used for multi-class classification tasks with softmax output activation unit.

Just to be sure that we are both on the same page, I will explain below what I mean with multi-label and multi-class classification terminologies.

In multi-label classification with 3 classes and 5 examples, the target vector would look like:

Target vector for multi-class classification for a similar configuration would look like:

umbertogriffo · 2019-03-25T10:58:26Z

@abdullahshafin you're absolutely right.
I meant that you can apply this to the multi-class classification problem with softmax output activation unit. multi-label classification isn't supported yet.

abdullahshafin · 2019-03-26T08:17:25Z

@umbertogriffo thanks a lot for your reply and for the clarification. I tried to use your code with a few modifications for multi-label classification. After looking at the code in detail, I strongly believe it should work. However since I am not getting good results at my classification task, I cannot verify that yet. Actually, my task is already quite difficult to learn and I haven't had success learning using weighted binary CE loss either.

Once I try it on some other task and I can verify that it works/does not work for multi-label classification, I will update here.

umbertogriffo · 2019-03-26T09:05:43Z

@abdullahshafin thanks for open this issue. Would be absolutely great if you help me to adapt the code for the multi-label classification. I'll try to find a task that you can use for your experiments.

abdullahshafin · 2019-04-03T14:50:08Z

@umbertogriffo Sorry, I've been busy in verifying if my approach was indeed right or not. It seems as of now, my loss function is not correct. Once I have the correct focal loss implementation for multi-label classification, I will definitely share it.
For now, I am trying to approach the problem using other methods like 1) Weighted Binary CE loss 2) Under-/over-sampling the dataset.

umbertogriffo · 2019-04-04T07:42:02Z

@abdullahshafin don't worry, let me know if I can help you somehow.

oleksandrlazariev · 2019-04-12T14:47:14Z

@abdullahshafin you could just remove K.sum from the final return statement. That should work for multi-label classification task

xingyi-li · 2019-06-09T11:43:31Z

@oleksandrlazariev Hi, I' m interested in your statement, but I don't clearly understand what you mean, would you explain your idea in detail? Thanks a lot!

bryanmooremd · 2019-08-13T13:44:51Z

@umbertogriffo My understanding is that with alpha = 1 and gamma = 0, then the focal loss should produce identical results to cross entropy. However, when I compile with loss=[categorical_focal_loss(alpha=.25, gamma=2)] vs loss = sparse_categorical_crossentropy, I get very different results. Have you directly compared the two and can you comment? I have 0/1 labels that are not one-hot-encoded.

jizhang02 · 2019-12-04T13:23:30Z

Hello, in multi-class loss function?
Do we need to do one-hot encoding?

talhaanwarch · 2020-01-25T10:20:59Z

i am just checking if focal loss for multilabel classification has been implemented or not

Sandeep418 · 2020-04-27T05:41:22Z

@umbertogriffo thanks a lot for your reply and for the clarification. I tried to use your code with a few modifications for multi-label classification. After looking at the code in detail, I strongly believe it should work. However since I am not getting good results at my classification task, I cannot verify that yet. Actually, my task is already quite difficult to learn and I haven't had success learning using weighted binary CE loss either.

Once I try it on some other task and I can verify that it works/does not work for multi-label classification, I will update here.

Hi @abdullahshafin have you succeed to change to multilabel classification.

gnai · 2020-06-23T12:23:14Z

Hello, in multi-class loss function?
Do we need to do one-hot encoding?

As far as I know yes, check this link, it might be useful :
https://www.depends-on-the-definition.com/guide-to-multi-label-classification-with-neural-networks/

umbertogriffo · 2020-07-10T18:49:53Z

@umbertogriffo My understanding is that with alpha = 1 and gamma = 0, then the focal loss should produce identical results to cross entropy. However, when I compile with loss=[categorical_focal_loss(alpha=.25, gamma=2)] vs loss = sparse_categorical_crossentropy, I get very different results. Have you directly compared the two and can you comment? I have 0/1 labels that are not one-hot-encoded.

There was a bug that has been fixed.

thusinh1969 · 2021-07-09T21:10:29Z

Try this https://www.programmersought.com/article/60001511310/
Both binary, multi-class and multi-label. It seems to work for me.

Steve

longsc2603 · 2023-05-05T14:17:29Z

Hi, I'm sorry that I bump this old thread. But I come across this repo of yours and wonder if I can apply for my case. I'm having a BiLSTM + CRF model that has the output shape like this: (None, sequence_length, num_class). Since it is a CRF-extended model (I'm using keras-contrib for CRF layer btw), the output is one-hot-encoded. So I can not really use class_weight in model.fit, is there any ways I can use this loss for my case?

abdullahshafin changed the title ~~Multi-label Classification task~~ Multi-label Classification task label:question Mar 21, 2019

umbertogriffo added the question Further information is requested label Mar 22, 2019

abdullahshafin changed the title ~~Multi-label Classification task label:question~~ Multi-label Classification task Mar 22, 2019

umbertogriffo added the enhancement New feature or request label Mar 26, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-label Classification task #2

Multi-label Classification task #2

abdullahshafin commented Mar 21, 2019

umbertogriffo commented Mar 22, 2019

abdullahshafin commented Mar 22, 2019

umbertogriffo commented Mar 25, 2019

abdullahshafin commented Mar 26, 2019

umbertogriffo commented Mar 26, 2019

abdullahshafin commented Apr 3, 2019 •

edited

umbertogriffo commented Apr 4, 2019 •

edited

oleksandrlazariev commented Apr 12, 2019

xingyi-li commented Jun 9, 2019

bryanmooremd commented Aug 13, 2019

jizhang02 commented Dec 4, 2019

talhaanwarch commented Jan 25, 2020

Sandeep418 commented Apr 27, 2020

gnai commented Jun 23, 2020

umbertogriffo commented Jul 10, 2020

thusinh1969 commented Jul 9, 2021

longsc2603 commented May 5, 2023

Multi-label Classification task #2

Multi-label Classification task #2

Comments

abdullahshafin commented Mar 21, 2019

umbertogriffo commented Mar 22, 2019

abdullahshafin commented Mar 22, 2019

umbertogriffo commented Mar 25, 2019

abdullahshafin commented Mar 26, 2019

umbertogriffo commented Mar 26, 2019

abdullahshafin commented Apr 3, 2019 • edited

umbertogriffo commented Apr 4, 2019 • edited

oleksandrlazariev commented Apr 12, 2019

xingyi-li commented Jun 9, 2019

bryanmooremd commented Aug 13, 2019

jizhang02 commented Dec 4, 2019

talhaanwarch commented Jan 25, 2020

Sandeep418 commented Apr 27, 2020

gnai commented Jun 23, 2020

umbertogriffo commented Jul 10, 2020

thusinh1969 commented Jul 9, 2021

longsc2603 commented May 5, 2023

abdullahshafin commented Apr 3, 2019 •

edited

umbertogriffo commented Apr 4, 2019 •

edited