Elasticsearch - Limiting allocation of shards

Question 1

It's possible, but not recommended. Elasticsearch needs reliable data connection between nodes in the cluster to function, which is difficult to ensure for geographically distributed cluster. A better solution would be to have two clusters, one in UK and another one in US. If you need to search both of them at the same time you can use tribal node.

Question 2

Thanks. I looked into this a bit more and have the solution which is indeed using tribal nodes.

For anyone who isn't familiar with them, this is a new feature in ES 1.0.0+

What you do is allocate a new ES node as a tribe node, and configure it to connect to all your other clusters, and when you run a query against it, it queries all clusters and returns a consolidated set of results from all of them.

So in my scenario, I have two distinct clusters, one in each region something this.

US Region

cluster.name: us-region

Two nodes in this region called usnode1 and usnode2

Both nodes are master/data nodes

UK Region

cluster.name: uk-region

Two nodes in this region called uknode1 and uknode2

Both nodes are master/data nodes

The you create another ES node and add some configuration to make it a Tribe node.

Edit elasticsearch.yml with something like this :

node.data: false node.master: false tribe.blocks.write: false tribe.blocks.metadata: false tribe.t1.cluster.name: us-region tribe.t1.discovery.zen.ping.unicast.hosts: ["usnode1","usnode2"] tribe.t2.cluster.name: uk-region tribe.t2.discovery.zen.ping.unicast.hosts: ["uknode1","uknode2"]

You then point Kibana to the tribe node and it worked brilliantly - excellent feature.

Kibana dashboards still save, although I'm not sure how it picks which cluster to save to yet but seems to address my question so a bit more playing and I think it I'll have it sorted.

Elasticsearch - Limiting allocation of shards

Example