Ranking of phonetic search results

Question 1

Your query should work as you specify. Since you specify inject=true on your PhoneticFilter, you should indeed get more term matches on an exact match (that is, both a metaphone match, and a plain text match), and this bears out as far as my testing is concerned.

The problem I do see, is that your analysis leaves you with case-sensitive searching for exact matches. If you index "John", and search for "john", the phonetic matching will work out just fine, but you'll miss the exact match due to the case-sensitivity.

Simply adding a LowercaseFilter to your filter chain should fix that. I would recommend adding it directly above your PhoneticFilter, like:

filters = { 
        @TokenFilterDef(factory = StandardFilterFactory.class), 
        @TokenFilterDef(factory = LowerCaseFilterFactory.class),
        @TokenFilterDef(factory = PhoneticFilterFactory.class, params = {
            @Parameter(name = "encoder", value = "DoubleMetaphone"), 
            @Parameter(name = "inject", value = "true") 
        }) 
}

The positioning above the PhoneticFilterFactory maintains the metaphones in uppercase, which not only follows convention, but also ensures that the metaphone codes and plain-text will not match each other. Can't think of any cases where that would be a concern, actually, but seems nice anyway.

Question 2

Both jon and john are exactly the same from the point of view of a Phonetic based Analyzer. Hibernate Search allows to define multiple Analyzers and you can also index the same property multiple times using the plural form annotation @Fields.

Let's say you index the firstname in two fields named firstname_phonetic and firstname_standard, you can then create two Query instances targeting each, and combine the two Queries using a BooleanQuery with the SHOULD clause. This will get the scorer to combine the scores from both, so that exact matches get ranked higher.

Question 3

Thanks for the answers, I now used the annotation order of "femtoRgon" and defined multiple analyzers by using @Fields (default and phonetic) when I combine a query with standard and one with phonetic field search using different boot values (more 2.0f boot on standard)

Thanking you all for help

Br, Shane