Question

I've built a mapper and a reducer in Ruby and it runs successfully as a streaming job. However, I need to do a second map and reduce based on output of the last reduce.

Is there any way I can define multiple Ruby files for mappers and reducers in my Streaming job? Like chaining.

Was it helpful?

Solution

No.

You can chain two streaming jobs, though, and just use the output directory from the first as the input directory for the second.

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top