Question

I have a project idea for which I want to mine publicly available data on another website that it received by crowd-sourcing. This is so I have initial data for my own project. To reiterate, I want to write a robot to grab data that is displayed on another website and use it for my own website. Does anyone know the legality of this sort of thing? Does the original website own the data that was given to it by a crowd? Even if so, can I use it?

Was it helpful?

Solution

Web scraping is a legally complicated issue. The hassles of legal action and enforceability often keep scrapers from getting in trouble. Outright duplication is considered actionable, although courts have ruled that "duplication of facts" is permitted (US).

I advise you read up here: http://en.wikipedia.org/wiki/Web_scraping#Legal_issues

Best,

OTHER TIPS

legally, you should be fine. as long as the data is made available and the people have consented; you aren't hacking and the other site has permission to share. check for a license on the other site, if there isn't one inquire or be prepared for access to be denied at some point. and even though it is publicly available doesn't mean the other site wants it to be.

also, double check and make sure that you don't inadvertently publish private data as well.

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top