If you are trying to restrict the crawler to only urls with the same domains as the seed urls, then:
Extract the domain names from the seed URLs.
Write your crawler class (that extends
WebCrawler
) with ashouldVisit
method to filter out any URLs whose domains are not in the set.Configure the controller, add the seeds and start it in the normal way ... as per the example here.