質問

I want to share some data between all the splits in the hadoop framework, more specifically, I have a file that contains alot of terms that I’m concerned to search for, and write how many times appear in each document, but the problem is that in case the word does not appear in some splits, i need to return 0 for this file, but the problem that I cant pass the terms I’m searching for to all the nodes in splits, can anybody give me some idea

役に立ちましたか?

解決

Generally, the DistributedCache is the way to share data across nodes. However since it is deprecated, check this answer.

ライセンス: CC-BY-SA帰属
所属していません StackOverflow
scroll top