Question

How one can efficiently store the WebGraph in Relational databases such MySQL for playing with algorithms like PageRank? I think of creating two tables: one for URLs where only distinct URLs will be stored and another outgoing links table, for each url store its outgoing URL. Any ideas or any suggestions for efficient storage?

Était-ce utile?

La solution

There are specific databases which where created for such a purpose. Take a look at

Licencié sous: CC-BY-SA avec attribution
Non affilié à StackOverflow
scroll top