does anyone know how the sha1 sum in wikipedia dumps is build? I just found: "These contain information like the sha1 sum of each revision text..." (http://meta.wikimedia.org/wiki/Data_dumps/Dump_format)

But when I try to calculate the sum of any revision text, I never get the same sum. So I thought maybe there is something more influencing this value. I took all the text between the "text"-tags. Thanks

有帮助吗?

解决方案

The sha1sum is converted from an hex- to a base36-number and it is just the revisiontext between the <text></text> -tags. Thanks to MaxSem!

许可以下: CC-BY-SA归因
不隶属于 StackOverflow
scroll top