Convert Levenshtein Distance to Error Rate

https://stackoverflow.com//questions/22018489

computer-science
machine-learning
levenshtein-distance
information-theory

21-12-2019
|

문제

Is their a way to convert levenstein distances to error rates?

With the error rate being the fraction of the sequence that is not exactly the same.

해결책

You mean you want to normalize Levenshtein distance to [0, 1]? That's

d(a,b) / max(len(a), len(b))

The denominator is an upper bound on Levenshtein distance, so this gives a figure between zero and one. Proof: assume (without loss of generality) that len(a) > len(b), then you can always transform a into b by substituting len(b) characters and deleting len(a) - len(b) of them, for a total of len(a) - len(b) + len(b) = len(a) operations.

라이센스 : CC-BY-SA ~와 함께 속성

제휴하지 않습니다 StackOverflow