Question

Is there a package that contains Levenshtein distance counting function which is implemented as a C or Fortran code? I have many strings to compare and stringMatch from MiscPsycho is too slow for this.

Was it helpful?

Solution

levenshteinDist (from the RecordLinkage package) calls compiled C code. Give it a try.

OTHER TIPS

And stringdist in the stringdist package does it too, even faster than levenshteinDist under certain conditions (1)

You could try stringDist from Biostrings as well

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top