Question

I am working on a dataset of amazon alexa reviews and wish to cluster them in positive and negative clusters. I am using Word2Vec for vectorization so wanted to know the difference between Text Embedding and Word Embedding. Also, which one of them will be useful for my clustering of reviews (Please consider that I want to predict the cluster of any reviews that I enter.) Thanks in advance!

Was it helpful?

Solution

A Text embedding is a vector representation of a text. A trivial way to construct a text embedding is to average the word embeddings of each word in the text.

However using this method, you will lose contextual information.

Licensed under: CC-BY-SA with attribution
Not affiliated with datascience.stackexchange
scroll top