Why Don't Impala Table Definitions Replicate?

Question 1

I filed a bug report and got back an answer:

In Impala 1.1 and earlier you need to issue an explicit "invalidate metadata" command to make tables created on other nodes visible to the local Impala daemon.

Starting with Impala 1.2 this won't be necessary; the new catalog service will take care of metadata distribution to all impalad's in the cluster.

So it was INVALIDATE METADATA that I had failed to notice. Glad to hear it won’t be necessary in 2.0.

Question 2

I had what I thought was the same issue, but it wasn't resolved by

invalidate metadata;

It turned out that my hive was accessing a local derby database, which impala could not see.

The smoking gun:

On the system where I had imported the table through hive, I had

cat /etc/hive/conf/hive-site.xml
[...]
<property>
    <name>javax.jdo.option.ConnectionURL</name>
   <value>jdbc:derby:;databaseName=/var/lib/hive/metastore/metastore_db;create=true</value>
    <description>JDBC connect string for a JDBC metastore</description>
</property>
[...]

The solution:

I re-deployed the hive client configuration from Cloudera Manager.

Afterwards:

  cat /etc/hive/conf/hive-site.xml
  [...]
  <property>
    <name>hive.metastore.local</name>
    <value>false</value>
  </property>
  <property>
    <name>hive.metastore.uris</name>
    <value>thrift://[snipped-host-name]:[snipped-port]</value>
  </property>

Apparently Cloudera Manager is supposed to deploy the client configuration but in some versions it sometimes fails to do so.