Question

How one can efficiently store the WebGraph in Relational databases such MySQL for playing with algorithms like PageRank? I think of creating two tables: one for URLs where only distinct URLs will be stored and another outgoing links table, for each url store its outgoing URL. Any ideas or any suggestions for efficient storage?

Was it helpful?

Solution

There are specific databases which where created for such a purpose. Take a look at

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top