Question

I have a list of web URLS need to be crawl. Is that possible to crawl only the list of webpage s with out crawling it deep. If i add the url as seed it crawls full website with full depth.

Was it helpful?

Solution

To only crawl the pages which you added as a seed, set the MaxDepthOfCrawling to 0.

CrawlConfig config = new CrawlConfig();
config.setMaxDepthOfCrawling(0);
PageFetcher pageFetcher = new PageFetcher(config);
Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top