سؤال

I'm trying to use dbpedia spotlight to spot special terms (which is not included in dbpedia) by using a local mediawiki dump as an input instead of the default index and spotter.dict. Any ideas will be so appreciated

هل كانت مفيدة؟

المحلول

DBpedia Spotlight requires 5(five) files to build the index as follows:

Format N Triples:

  • Instance Types: List of URLs and their types (DBpedia, Freebase etc)

E.g:

   <YOUR_LINK> <www.w3.org/1999/02/22-rdf-syntax-ns#type> <DBpedia:Type> .
  • Labels: List of URLs and Labels

E.g:

   <YOUR_LINK> <www.w3.org/2000/01/rdf-schema#label> "Label"@en .
  • Redirects: List of URLs and their redirect pages

E.g:

   <YOUR_LINK> <dbpedia.org/ontology/wikiPageRedirects> <YOUR_LINK> .
  • Disambiguations List of URLs and their disambiguations pages

    .

XML Dump:

  • Wiki dump - (like Wikipedia Dump).

After preparing these files with your own data, "just" follow the internationalization guide available in DBpedia Spotlight wiki to create the index with your own data.

All the best,

مرخصة بموجب: CC-BY-SA مع الإسناد
لا تنتمي إلى StackOverflow
scroll top