Pergunta

I've built a mapper and a reducer in Ruby and it runs successfully as a streaming job. However, I need to do a second map and reduce based on output of the last reduce.

Is there any way I can define multiple Ruby files for mappers and reducers in my Streaming job? Like chaining.

Foi útil?

Solução

No.

You can chain two streaming jobs, though, and just use the output directory from the first as the input directory for the second.

Licenciado em: CC-BY-SA com atribuição
Não afiliado a StackOverflow
scroll top