Question

I have tried using OpenNLP Tools 1.5 from SourceForge for getting the Named Entites from a text. I did find a JWNL file in the OpenNLP download. Does that imply that OpenNLP in turn is using WordNet for the Named Entity Recognition. (Specifically, does it mean that the Name Model files were generated with WordNet) ?

Was it helpful?

Solution

No, OpenNLP only uses WordNet for co-reference resolution, such as correlating "He" to "Steve Jones" in separate sentences.

WordNet is not used in Named Entity Recognition. The named entity training files are manually annotated fields from various conferences/tests, such as CoNLL.

Source: http://opennlp.sourceforge.net/README.html

OTHER TIPS

As I know OpenNLP use maximum entropy package to provide all their statistical models - pos tagging, sentence detection, NER even tokenization. Wordnet integration is not part of OpenNLP so I think its features were not used in training models.

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top