How one can efficiently store the WebGraph in Relational databases such MySQL for playing with algorithms like PageRank? I think of creating two tables: one for URLs where only distinct URLs will be stored and another outgoing links table, for each url store its outgoing URL. Any ideas or any suggestions for efficient storage?

有帮助吗?

解决方案

There are specific databases which where created for such a purpose. Take a look at

许可以下: CC-BY-SA归因
不隶属于 StackOverflow
scroll top