labels_true
is the "true" assignment of points to labels: which cluster they should actually belong on. This is available because make_blobs
knows which "blob" it generated the point from.
You can't get that for your own arbitrary data X
, unless you have some kind of true labels for the points (in which case you wouldn't be doing clustering anyway). This just shows some measures of how well the clustering performed in a fake case where you know the true answer.