When using one of the aggregator function in place of a reducer, will Hadoop be smart enough to use same function for combiner phase?

有帮助吗?

解决方案

They fulfill partly the same purpose but the aggregator is more generic and can be used in cases where the combiner can not.

So it's a definite no, the aggregator will not be used as a combiner automatically. If you want it to be used as a combiner you will have to specify it as such.

Quote from cascade, "Combiners are limited to Associative and Commutative functions only, like 'sum' and 'max'. And in order to work, values emitted from the Map task must be serialized, sorted (deserialized and compared), deserialized again and operated on"

许可以下: CC-BY-SA归因
不隶属于 StackOverflow
scroll top