Domanda

hi who can i configure solr to find everyting like an texteditor?

i have a field which contains

... text texct blah Computer-Zeitalter blah text m00h ...

i've configured it like this

<fieldType name="text" class="solr.TextField" positionIncrementGap="100">
    <analyzer>
        <tokenizer class="solr.WhitespaceTokenizerFactory" />
        <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt" />
        <filter class="solr.WordDelimiterFilterFactory" 
                generateWordParts="1" generateNumberParts="0" catenateWords="1"
                catenateNumbers="1" catenateAll="0" splitOnCaseChange="1"
                splitOnNumerics="0" stemEnglishPossessive="1"/>
        <filter class="solr.LowerCaseFilterFactory" />
        <filter class="solr.TrimFilterFactory" />   
        <filter class="solr.EnglishPorterFilterFactory" protected="protwords.txt" />
        <filter class="solr.RemoveDuplicatesTokenFilterFactory" />
        <filter class="solr.ReversedWildcardFilterFactory" />
        <filter class="solr.LengthFilterFactory" min="2" max="100" />
    </analyzer>
</fieldType>

but

select/?q=volltext:*eitalter&version=2.2&start=0&rows=10&indent=on
select/?q=volltext:eitalter&version=2.2&start=0&rows=10&indent=on

give no results, why?

i'm using solr 1.4

È stato utile?

Soluzione

Since you use EnglishPorterFilterFactory, it eliminates the "er" suffix from the Zeitalter and index it as zeitalt. So, if you want to find this keyword then

  • you should either change your query as ...q=volltext:*eitalt&...
  • or you should remove EnglishPorterFilterFactory from the field definition and search using ...q=volltext:*eitalter&...
Autorizzato sotto: CC-BY-SA insieme a attribuzione
Non affiliato a StackOverflow
scroll top