
Via google analytics I noticed that there is website which is scrapping my content automatically.. His content 100% matches mine. is there any way I could block that website host from accesing my server at all? Any solutions what I could do about this? Im running LAMP web host on CentOS.

도움이 되었습니까?


If the IP address of the scraping host is static, you can use .htaccess to block this IP, like:

order allow,deny
deny from
allow from all

If the IP address is variable, but the user agent is constant, you can use agent blocking:

BrowserMatchNoCase SpammerRobot bad_bot
BrowserMatchNoCase SecurityHoleRobot bad_bot
Order Deny,Allow
Deny from env=bad_bot
라이센스 : CC-BY-SA ~와 함께 속성
제휴하지 않습니다 StackOverflow
scroll top