Question

does anyone know how the sha1 sum in wikipedia dumps is build? I just found: "These contain information like the sha1 sum of each revision text..." (http://meta.wikimedia.org/wiki/Data_dumps/Dump_format)

But when I try to calculate the sum of any revision text, I never get the same sum. So I thought maybe there is something more influencing this value. I took all the text between the "text"-tags. Thanks

Was it helpful?

Solution

The sha1sum is converted from an hex- to a base36-number and it is just the revisiontext between the <text></text> -tags. Thanks to MaxSem!

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top