質問

I am trying to build a recommendation engine, for that I am thinking of using apache mahout but I am unable to make out if mahout process the data in real time or does it pre-process the data when the server is idle and store the results somewhere in the database.

Also does anyone have any idea what approach do sites like amazon,netflix follow?

役に立ちましたか?

解決

Either/or, but not both. There are parts inside from an older project that are essentially real time for moderate scale. There are also Hadoop based implementations which are all offline. The two are not related.

I am a primary creator of these parts, and if you want a system that does both together, I suggest you look at my current project Myrrix (http://myrrix.com)

ライセンス: CC-BY-SA帰属
所属していません StackOverflow
scroll top