Pregunta

How one can efficiently store the WebGraph in Relational databases such MySQL for playing with algorithms like PageRank? I think of creating two tables: one for URLs where only distinct URLs will be stored and another outgoing links table, for each url store its outgoing URL. Any ideas or any suggestions for efficient storage?

¿Fue útil?

Solución

There are specific databases which where created for such a purpose. Take a look at

Licenciado bajo: CC-BY-SA con atribución
No afiliado a StackOverflow
scroll top