I'm looking for a Nilsimsa or some other LSH function implementation in Java, for detecting similar strings in a stream of incoming strings.

Doesn't seem reasonable that there is no public implementation of it, and I hate re-inventing the wheel.

Thanks!

有帮助吗?

解决方案

There is a port of the Python Nilsimsa code here.

其他提示

you can find One LSH implementation in TarsosLSH.

许可以下: CC-BY-SA归因
不隶属于 StackOverflow
scroll top