There is a port of the Python Nilsimsa code here.
Nilsimsa hash / digest in Java?
Вопрос
I'm looking for a Nilsimsa or some other LSH function implementation in Java, for detecting similar strings in a stream of incoming strings.
Doesn't seem reasonable that there is no public implementation of it, and I hate re-inventing the wheel.
Thanks!
Решение
Другие советы
you can find One LSH implementation in TarsosLSH.
Не связан с StackOverflow