Question

How to find all German-Wikivoyage articles that are not Wikidata-linked to an English-Wikivoyage article?

For instance:

Could be in Python, but bash or any tool or even HTTP request is fine, the simpler the better.

Was it helpful?

Solution

Magnus Manske's Not in the other language can do this and outputs a tidy PagePile with around 12k items.

There are related tools if you also want to include the few articles not yet on Wikidata.

OTHER TIPS

I think the best way would be to access the SQL database for that.

For that, you can either get an account on Tool Labs or you can download the dump of the necessary tables of the database (here, that would be langlinks and page) and then import those into a local MySQL database.

The query would look something like:

select page_title
from page
where not exists
    (select * from langlinks where ll_from = page_id and ll_lang  = 'en')
and page_namespace = 0
and page_is_redirect = 0

This list also wouldn't contain pages that have a language link to English Wikivoyage using the old (in-page) method, but I believe those should be rare.

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top