How can I identify ad links from a website? I am doing a research on malvertising. As a part of that, I need to extract all the advertisement urls from the website. How can I do that?

有帮助吗?

解决方案

(Of course it’s impossible to correctly identify all URLs.)

You could make use of the filter lists of various ad filtering tools. They typically contain absolute URLs (submitted by the community) and strings that often appear in such URLs.

For example, AdBlock Plus hosts some filter lists.

Example from EasyList (big text file):

  • &adbannerid=
  • .com/js/adsense
  • /2013/ads/*
  • /60x468.
  • /ad-rotator-
许可以下: CC-BY-SA归因
不隶属于 StackOverflow
scroll top