I want to get data from tables in web pages, which seem to be published using "Excel Publish as Web Page Wizard". I'm aware there are a lot of scraping tools (seem much advanced for such a task) and aware I can write a script, but I'd like to be pointed to something much straightforward, especially that the data is published using an automated tool.

I prefer to have a rails solution.

The data is statistical health data, here are links to some samples.

有帮助吗?

解决方案

Yes, the structure is really simple. On your place I'd just use any language you know and don't spend time for learning third party scrapers. Personally, I use C# for such tasks.

许可以下: CC-BY-SA归因
不隶属于 StackOverflow
scroll top