Вопрос

I want to share some data between all the splits in the hadoop framework, more specifically, I have a file that contains alot of terms that I’m concerned to search for, and write how many times appear in each document, but the problem is that in case the word does not appear in some splits, i need to return 0 for this file, but the problem that I cant pass the terms I’m searching for to all the nodes in splits, can anybody give me some idea

Это было полезно?

Решение

Generally, the DistributedCache is the way to share data across nodes. However since it is deprecated, check this answer.

Лицензировано под: CC-BY-SA с атрибуция
Не связан с StackOverflow
scroll top