Vowpal Wabbit predictions for multi-label classification

Question 1

If you specify just one label in --csoaa (even in the -t test mode), it means that only that label is "available" for this example, so no other label can be predicted. This is another difference from --oaa (where you always specify just the correct label). See https://groups.yahoo.com/neo/groups/vowpal_wabbit/conversations/topics/2949.

If all labels are "available" (possible) for any test example, you must always include all the labels on each line. With -t you do not need to include the costs of the labels if you just want to get the --predictions (if you don't need vw to compute the test loss). So your myTestFile.txt should look like:

1 2 3 |f 1:12 2:13
1 2 3 |f 3:23 4:234
1 2 3 |f 5:12 6:34

and your myTrainFile.txt should look like:

1:0 2:1 3:1 |f 1:12 2:13
1:1 2:0 3:1 |f 3:23 4:234
1:1 2:1 3:0 |f 5:12 6:34

Question 2

So, for completeness' sake, here is how it does work:

$ cat myTrainFile.txt
1:1.0 |f 1:12 2:13
2:1.0 |f 3:23 4:234
3:1.0 |f 5:12 6:34

$ cat myTestFile.txt
1 2 3 |f 1:12 2:13
1 2 3 |f 3:23 4:234
1 2 3 |f 5:12 6:34

$ vw -t -i myModel.model -p myPred.pred < myTestFile.txt 
only testing
...

$ cat myPred.pred 
2.000000
1.000000
2.000000

So it is a bit suprising maybe that none of examples is classified correctly, but that is another problem.

Thanks @Martin Popel!