looking for a java library with a simple to calculate tf–idf, term frequency–inverse document frequency [closed]

StackOverflow https://stackoverflow.com/questions/17015658

  •  31-05-2022
  •  | 
  •  

I need to calculate tf-idf for a set of documents and am looking for a java library that does this.

NOTE: I am aware of Mahout but I really want is a library with a simple interface and one that does not require infrastructure setup.

有帮助吗?

解决方案

Mahout is easy to use and install. All you need is JDK environment and maven. how to install mahout

Also you could use hadoop with mahout, which is not a must (you could run mahout locally without hadoop). However you could find this blog helpful for install hadoop.

许可以下: CC-BY-SA归因
不隶属于 StackOverflow
scroll top