Question

I want to get data from tables in web pages, which seem to be published using "Excel Publish as Web Page Wizard". I'm aware there are a lot of scraping tools (seem much advanced for such a task) and aware I can write a script, but I'd like to be pointed to something much straightforward, especially that the data is published using an automated tool.

I prefer to have a rails solution.

The data is statistical health data, here are links to some samples.

Était-ce utile?

La solution

Yes, the structure is really simple. On your place I'd just use any language you know and don't spend time for learning third party scrapers. Personally, I use C# for such tasks.

Licencié sous: CC-BY-SA avec attribution
Non affilié à StackOverflow
scroll top