I'm trying to use dbpedia spotlight to spot special terms (which is not included in dbpedia) by using a local mediawiki dump as an input instead of the default index and spotter.dict. Any ideas will be so appreciated

有帮助吗?

解决方案

DBpedia Spotlight requires 5(five) files to build the index as follows:

Format N Triples:

  • Instance Types: List of URLs and their types (DBpedia, Freebase etc)

E.g:

   <YOUR_LINK> <www.w3.org/1999/02/22-rdf-syntax-ns#type> <DBpedia:Type> .
  • Labels: List of URLs and Labels

E.g:

   <YOUR_LINK> <www.w3.org/2000/01/rdf-schema#label> "Label"@en .
  • Redirects: List of URLs and their redirect pages

E.g:

   <YOUR_LINK> <dbpedia.org/ontology/wikiPageRedirects> <YOUR_LINK> .
  • Disambiguations List of URLs and their disambiguations pages

    .

XML Dump:

  • Wiki dump - (like Wikipedia Dump).

After preparing these files with your own data, "just" follow the internationalization guide available in DBpedia Spotlight wiki to create the index with your own data.

All the best,

许可以下: CC-BY-SA归因
不隶属于 StackOverflow
scroll top