Question

i am trying to extract the all the Named entities and its type in Freebase dump.

I have looked at How to extract Freebase Data Dump for a particular topic But i didn't get actual named entities in Freebase dump.

I'am trying to get the information something like this, Name: Barak Obama type: Person Name: New York type: City etc..

Freebase is already having all these entities in its database but i didn't find any of Freebase API or Google API to list all the entities and its type.

Could you please help me.

Thanks

Was it helpful?

Solution

The data dump format is described here. It's changed some since I wrote the answer that you reference, but the basic principals are still the same.

The MQLread and Search APIs can both be used to filter Freebase topics to a specific type (e.g. the /people/person type). Topics can have multiple types, but if what you want is the classic NER types, then the first thing to do is probably filter by the desired types since there are "only" about 3 million people in Freebase out of the tens of millions of topics.

Freebase contains a lot more information than just the name which would be useful in an NER task, so you might want to take a step back and think about what other properties you might want to extract as well.

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top