Question

Is there anyway i can install the only impala without cloudera manager and without cdh. I will be using the apache version of hadoop?

Was it helpful?

Solution

Yes, it is absolutely possible. Add the repository into your sources.list file and update the repository after that.

deb [arch=amd64] http://archive.cloudera.com/impala/ubuntu/precise/amd64/impala precise-impala1 contrib deb-src http://archive.cloudera.com/impala/ubuntu/precise/amd64/impala precise-impala1 contrib

After that, it's merely :

sudo apt-get install impala                (Binaries for daemons)
sudo apt-get install impala-server         (Service start/stop script)
sudo apt-get install impala-state-store    (Service start/stop script)

But do not forget to meet all the prerequisites. For a detailed info you can go here

OTHER TIPS

You can view detailed instructions on how to install and use Impala with Amazon EMR here: http://docs.aws.amazon.com/ElasticMapReduce/latest/DeveloperGuide/emr-impala.html

EMR is based on a Amazon Hadoop distribution that runs on top of Debian squeeze. So, yes it's possible using Cloudera's DEB repo.

You will need to SSH to your EMR master node, find the address on EMR console.

You will also need to enable security rules on the security group you have assigned to your EMR cluster, if you intend to connect to Impala using a JDBC/ODBC client form the outside world.

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top