質問

I want to chain 3 rounds of MapReduce and at the third one to be able to read the original input as well as the output of the second job. Is this at all possible?

役に立ちましたか?

解決

You could setup the last Job to use 2 mappers one of which can have the original file as the input. Assuming you would need to reduce both the inputs (input of first job & output of second job) on some common key. MultipleInputs

ライセンス: CC-BY-SA帰属
所属していません StackOverflow
scroll top