Does ScraperWiki rate limit sites it is scraping?

https://stackoverflow.com/questions/5847984

screen-scraping
scraperwiki

27-10-2019
|

题

Does ScraperWiki somehow automatically rate limit scraping, or should I add something like sleep(1 * random.random()) to the loop?

解决方案

There is no automatic rate limiting. You can add a sleep command written in your language to add rate limiting.

Very few servers check for rate limiting, and usually servers containing public data don't.

It is, however, good practice to make sure you don't overrun the remote server. By default, scrapers only run in one thread, so there is a built in limit to the load you can produce.

许可以下： CC-BY-SA 和归因

不隶属于 StackOverflow