There is a port of the Python Nilsimsa code here.
Nilsimsa hash / digest in Java?
Question
I'm looking for a Nilsimsa or some other LSH function implementation in Java, for detecting similar strings in a stream of incoming strings.
Doesn't seem reasonable that there is no public implementation of it, and I hate re-inventing the wheel.
Thanks!
Solution
OTHER TIPS
you can find One LSH implementation in TarsosLSH.
Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow