سؤال

In SOLR 3.5 text field type the StopFilterFactory is listed before the PorterStemFilterFactory.

does this mean that if I wanted to stop for example "game" and "games" I would have to add both to stopwords?

if so would moving the StopFilterFactory after the PorterStemFilterFactory, and adding just "game" to stopwords cause occurrences of both "game" and "games" to be stripped?

I guess the true question is what is the best way to do this, and do I need to add all variations of the word to stopwords?

هل كانت مفيدة؟

المحلول

PorterStemFilterFactory provides agressive stemming and having is before stop word filter may not cause proper stop word removal as the root may be different then the one you are trying to stop.
You can add handling for plurals only you can use solr.EnglishMinimalStemFilterFactory before the stop word filter.
This would handle the plurals and the stop words would then need to handle just the singular version.
You can then add the PorterStemFilterFactory filter to handle the proper stemming.

مرخصة بموجب: CC-BY-SA مع الإسناد
لا تنتمي إلى StackOverflow
scroll top