문제

I have a requirement to copy files from HDFS to local. Now, I have 2 options . 1) Either use Hadoop Native Java API (FileSystem) or 2) Use WebHDFS [ I don;t have any issues with enabling it to my cluster]

Can someone let me know what is preferred option and why?

도움이 되었습니까?

해결책

If you are using Java, I recommend the native Java APIs as it's more flexible and give you more control.

However, WebHDFS is better if you don't want to involve dozens of libraries required by Hadoop. It decouples your application and Hadoop. Of course, you need pay a little performance cost due to HTTP.

라이센스 : CC-BY-SA ~와 함께 속성
제휴하지 않습니다 StackOverflow
scroll top