Pergunta

Assume, I have 100 text documents, and I want to cluster those documents.

The first step is the construct pairwise similarity matrix 100X100 for the documents

My question is:

what are common way to measure similarity between two documents?

Thanks,

Nenhuma solução correta

Licenciado em: CC-BY-SA com atribuição
scroll top