Frage

How can I identify ad links from a website? I am doing a research on malvertising. As a part of that, I need to extract all the advertisement urls from the website. How can I do that?

War es hilfreich?

Lösung

(Of course it’s impossible to correctly identify all URLs.)

You could make use of the filter lists of various ad filtering tools. They typically contain absolute URLs (submitted by the community) and strings that often appear in such URLs.

For example, AdBlock Plus hosts some filter lists.

Example from EasyList (big text file):

  • &adbannerid=
  • .com/js/adsense
  • /2013/ads/*
  • /60x468.
  • /ad-rotator-
Lizenziert unter: CC-BY-SA mit Zuschreibung
Nicht verbunden mit StackOverflow
scroll top