Question

I want to search the google Api freebase. I want to get general amount of data. For example all Ids of songs, or films. I downloaded the data dumps gz file. I wonder what will be the best solution of parsing the file and getting the data I need. I am using .net c#.

Was it helpful?

Solution

There are a couple .NET libraries that can read the RDF format of the dumps:

The data dumps are also formatted as tab separated values so you should be able to use any CSV parser to parse each line as a triple.

Make sure that you read through the developer docs on how the data dumps are formatted. Basically, each line forms a triple that has a subject, predicate and object. To get all the data about films you'll be looking for triples that have a predicate that starts with /film/.

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top