I have a requirement to copy files from HDFS to local. Now, I have 2 options . 1) Either use Hadoop Native Java API (FileSystem) or 2) Use WebHDFS [ I don;t have any issues with enabling it to my cluster]

Can someone let me know what is preferred option and why?

有帮助吗?

解决方案

If you are using Java, I recommend the native Java APIs as it's more flexible and give you more control.

However, WebHDFS is better if you don't want to involve dozens of libraries required by Hadoop. It decouples your application and Hadoop. Of course, you need pay a little performance cost due to HTTP.

许可以下: CC-BY-SA归因
不隶属于 StackOverflow
scroll top