Question

I'm trying to index webpages with lucene. Therefore, I'm using doc.add(new TextField("content", webPageContent, Store.YES)) where doc is the document about to be added to the index, and webPageContent is the string of the content of the webpage parsed with JSoup.

Is it the right way to do it - ie, will lucene compute the frequency of each token created from webPageContent?

No correct solution

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top