Question

I am learning nutch and trying to carawl as per this tutorial .I am working on an ubuntu machinewith bash shell. But when I run the script, the execution happens, but nothing happens after ,

InjectorJob: starting at 2014-03-23 09:28:50
InjectorJob: Injecting urlDir: urls/seed.txt

I have waited for hours, I tried running the same with sudo. The same issue occurs. I have tried with default urls given in the tutorial as well. What can be the probable errors?

Was it helpful?

Solution

What was missing was I didnt add Proxy and port details in the nutch-site.xml, as I was accessing through proxy. setting up the same for Ant or JVM is not enough

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top