Question

How can I identify ad links from a website? I am doing a research on malvertising. As a part of that, I need to extract all the advertisement urls from the website. How can I do that?

Was it helpful?

Solution

(Of course it’s impossible to correctly identify all URLs.)

You could make use of the filter lists of various ad filtering tools. They typically contain absolute URLs (submitted by the community) and strings that often appear in such URLs.

For example, AdBlock Plus hosts some filter lists.

Example from EasyList (big text file):

  • &adbannerid=
  • .com/js/adsense
  • /2013/ads/*
  • /60x468.
  • /ad-rotator-
Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top