Using StandardAnalyzer, my Lucene contain numeric terms too(i.e. "200"). So number of term in my index is too big.
Does anyone know if exists an Analyzer or Tokenizer that discards numeric terms?
If not, is there any easy way to get it?

有帮助吗?

解决方案

Antonio, I suggest you try using SimpleAnalyzer.

If this does not work for you, you may have to write your own analyzer.

许可以下: CC-BY-SA归因
不隶属于 StackOverflow
scroll top