Question

I am considering HDInsight with Hive and data loaded on Azure Blob Storage.
There is a combination of both historic and changing data.
Does the solution mentioned in Update , SET option in Hive work with blob storage too?
The below Hive statement change the data in blob storage which is my requirement too?

INSERT OVERWRITE TABLE _tableName_ PARTITION ...
Was it helpful?

Solution

INSERT OVERWRITE will write new file(s) into the cluster file system. In HDInsight the file system is backed by Azure blobs, the wasb://... and wasb:///... names. Everything Hive does to the cluster file system, like overwriting them, will accordingly be reflected in the Azure storage BLOBs. See Use Hive with HDInsight for more details.

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top