Pregunta

I want to get data from tables in web pages, which seem to be published using "Excel Publish as Web Page Wizard". I'm aware there are a lot of scraping tools (seem much advanced for such a task) and aware I can write a script, but I'd like to be pointed to something much straightforward, especially that the data is published using an automated tool.

I prefer to have a rails solution.

The data is statistical health data, here are links to some samples.

¿Fue útil?

Solución

Yes, the structure is really simple. On your place I'd just use any language you know and don't spend time for learning third party scrapers. Personally, I use C# for such tasks.

Licenciado bajo: CC-BY-SA con atribución
No afiliado a StackOverflow
scroll top