Question

I'm looking for a solution to detect different moods/styles expressed by an image. Unfortunately, there is no multi-labeled dataset for this task.

The scenario of defining a multi-label classification model based on single labeled data doesn't seem too absurd to me but still I couldn't find any publications or other sources addressing this problem.

So I'm very thankful for any hint how scenarios like this could be tackled (e.g. deriving a multi-labeled dataset from available single label datasets in a weakly supervised manner).

Was it helpful?

Solution

I would train it using the loss appropriate for multi-label (sigmoid activation/binary cross-entropy loss instead of softmax activation/categorial cross-entropy loss). The model will give you probabilities per label, so it's up to you to decide how to interpret them in your application.

Having said that, some multi-label problems have mostly single labels, much fewer double labels, and almost no labels beyond that. For problems like this, you could try a single-label model and see if it works better than a multi-label one. Or have an ensemble :)

OTHER TIPS

If you go the route suggested by @andris (the only option you may have), the multi label classes remain arbitrary, with probability cut points chosen by you. This would not seem to add true value

Licensed under: CC-BY-SA with attribution
Not affiliated with datascience.stackexchange
scroll top