Pregunta

I want to share some data between all the splits in the hadoop framework, more specifically, I have a file that contains alot of terms that I’m concerned to search for, and write how many times appear in each document, but the problem is that in case the word does not appear in some splits, i need to return 0 for this file, but the problem that I cant pass the terms I’m searching for to all the nodes in splits, can anybody give me some idea

¿Fue útil?

Solución

Generally, the DistributedCache is the way to share data across nodes. However since it is deprecated, check this answer.

Licenciado bajo: CC-BY-SA con atribución
No afiliado a StackOverflow
scroll top