Sample Database for Full Text Searching
-
01-10-2019 - |
Pergunta
I am looking to do some benchmarking on Full Text Search indexes in PostgreSQL, SQLServer and Lucene.
Any ideas on where to find a good big sample database to perform queries against?
Thanks a lot in advance.
Solução
I think the great source would be wikipedia's database dump, since they contains really great amount of text. They are available here: http://dumps.wikimedia.org/
You could also try usenet archive, but there's harder to pick target language and the quality of language used is also lower.
Licenciado em: CC-BY-SA com atribuição
Não afiliado a StackOverflow