Fast Levenshtein distance in R?
-
02-10-2019 - |
Question
Is there a package that contains Levenshtein distance counting function which is implemented as a C or Fortran code? I have many strings to compare and stringMatch
from MiscPsycho
is too slow for this.
Solution
levenshteinDist (from the RecordLinkage
package) calls compiled C code. Give it a try.
OTHER TIPS
And stringdist
in the stringdist
package does it too, even faster than levenshteinDist
under certain conditions (1)
You could try stringDist
from Biostrings
as well
Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow