Question

Java based Mahout's goal is to build scalable machine learning libraries. Are there any equivalent libraries in Python ?

Was it helpful?

Solution

scikits learn is highly recommended http://scikit-learn.sourceforge.net/

OTHER TIPS

Spark MLlib is recommmended. It is a scalable machine learning lib, can read data from HDFS and of course runs on top of Spark.

You can access it via PySpark (see the Programming Guide's Python examples).

Orange is supposedly pretty decent, from what I've heard, but I've never used it personally. PyML might be worth taking a look at as well. Also, Monte.

pysuggest is a Python wrapper for SUGGEST, a Top-N recommendation engine that implements a variety of recommendation algorithms for collaborative filtering.

An interesting library is crab.

As of this post, the library only has stable implementations for collaborative filtering algorithms: user-based and item-based.

An SVD implementation is included but it's experimental and content-based algorithms are on the roadmap.

Do check it out!

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top