Control data locality in Impala by partitioning

https://stackoverflow.com/questions/21797968

12-10-2022
|

Question

I would like to avoid Impala nodes unnecessarily requesting data from other nodes over the network in cases when the ideal data locality or layout is known at table creation time. This would be helpful with 'non-additive' operations where all records from a partition are needed at the same place (node) anyway (for ex. percentiles).

Is it possible to tell Impala that all data in a partition should always be co-located on a single node for any HDFS replica?

In Impala-SQL, I am not sure if the "PARTITIONED BY" clause provide this feature. In my understanding, Impala chunks its partitions into separate files on HDFS but HDFS does not guarantee the co-location of related files nor blocks by default (rather tries to achieve the opposite).

Found some information about Impala's impact on HDFS development but not clear if these are already implemented or still in plans:

http://www.slideshare.net/deview/aaron-myers-hdfs-impala (slides 23-24)

Thank you in advance for all.

Solution

About the slides you mention ("Co-located block replicas") - it's about an HDFS feature (HDFS-2576) implemented in Hadoop 2.1. It provides a Java API to give hints to HDFS as to where the blocks should be placed.

It's not used in Impala as of 2014, but it definitely seems like building some groundwork for that - as it would give Impala a performance equivalent of specifying distribution key in traditional MPP databases.

OTHER TIPS

No, that completely defeats the purpose of having a distributed file system and MPP computing. It also creates a single point of failure and a bottleneck especially if you're talking about a 250GB table that is joined to itself. Exactly the kind of problems that Hadoop was designed to solve. Partitioning data creates sub-directories in HDFS on the namenode and that data is then replicated throughout the datanodes in the cluster.

Licensed under: CC-BY-SA with attribution

Not affiliated with StackOverflow