I am working on a dataset of amazon alexa reviews and wish to cluster them in positive and negative clusters. I am using Word2Vec for vectorization so wanted to know the difference between Text Embedding and Word Embedding. Also, which one of them will be useful for my clustering of reviews (Please consider that I want to predict the cluster of any reviews that I enter.) Thanks in advance!

有帮助吗?

解决方案

A Text embedding is a vector representation of a text. A trivial way to construct a text embedding is to average the word embeddings of each word in the text.

However using this method, you will lose contextual information.

许可以下: CC-BY-SA归因
scroll top