catexport

tool for working with Wikimedia data

Demo

Export Wikimedia Commons categories to your local machine with a convenient web user interface.

Try it here: http://catexport.herokuapp.com

Data

  • Wikimedia Commons API

Team

  • odi
  • and other team members

At #glamhack a number of categories were made available offline by loleg using this shell script and magnus-toolserver.

catexport

The aims of this projects are GLAM institutions that provide their data to the Wikimedia Commons projects and want to extract the categorization done by the community in a structured way. This tool uses the API of Wikimedia Commons in the background. It generates CSV with the following format

filename,category

If a file has multiple categories, there will be multiple entries for it. The first column acts as a unique identifier of the file.

This content is a preview from an external site.
 

Event finish

Add license and link to GitHub (@Stefan Oderbolz)

Added a bunch of extra information on the index page (@Stefan Oderbolz)

Fixed default text (@Stefan Oderbolz)

Get 100 records by each call (@Stefan Oderbolz)

Error handling (@Stefan Oderbolz)

Add index.html file (@Stefan Oderbolz)

Added README (@Stefan Oderbolz)

Add hostname to example (@Stefan Oderbolz)

Make sure app runs on heroku (@Stefan Oderbolz)

Recursive get all categories from Wikimedia Commons (@Stefan Oderbolz)

Initial commit (@Stefan Oderbolz)

Start