Sketching
60

WikiCommons metadata analysis tool

A metadata analysis tool comparing metadata of GLAM source systems with Wikimedia Commons.

⛶ Full screen

The Image Archive of the ETH Library is the largest Swiss GLAM image provider. Since the ETH-Bibliothek went Open Data in March 2015, the image archive has uploaded 60,000 out of more than 500'000 images to Wikimedia Commons. The tool Pattypan is currently used for uploading.

On the image database E-Pics Image Archive Online, volunteers have been able to comment on all images since January 2016, thereby improving the metadata. And they do it very diligently. More than 20,000 comments are received annually in the image archive and are incorporated into the metadata (see also our blog "Crowdsourcing").

However, the metadata on Wikimedia Commons is not updated and is therefore sometimes outdated, imprecise or even incorrect. The effort for the image archive to (manually) match the metadata would simply be too great. A tool does not yet exist.

Challenge

A general GLAM analysis tool that compares the metadata of the source system of the GLAMs (e. g. E-Pics Image Archive Online) and Wikimedia Commons and lists the differences. The analysis tool could highlight the differences (analogous to version control in Wikipedia), the user would have to manually choose for each hit whether the metadata is overwritten or not. Affected metadata fields: Title, date, description, ...

Automatic "overwriting" of metadata (update tool) is against the Wikimedia philosophy and is therefore undesirable. Furthermore, it is also possible that Wikipedians have made corrections themselves, which are not recorded by the image archive.

Data

Contact

Nicole Graf, Head Image Archive ETH Library

nicole.graf@library.ethz.ch

17.04.2021 17:30

Hackathon finished

17.04.2021 14:01 ~ nicole_graf

Cookbook

17.04.2021 13:39 ~ nicole_graf

Worked on documentation

17.04.2021 10:44 ~ nicole_graf

We're writing a cookbook for a 1) deluxe (within Pattypan) or 2) economy (outside Pattypan, i.e. OpenRefine?) solution

opendata.swiss

GLAM datasets on the open data platform of the Swiss Confederation

17.04.2021 10:41 ~ nicole_graf

Worked on documentation

16.04.2021 11:53 ~ nicole_graf

Looking for a Wikimedia Pro for a Wiki Commons Challenge focused on metadata adjustment

opendata.swiss

GLAM datasets on the open data platform of the Swiss Confederation

16.04.2021 11:51 ~ nicole_graf

Worked on documentation

16.04.2021 08:30

Hackathon started

29.03.2021 17:53

Team forming

GiFontenelle has joined!

23.03.2021 09:17 ~ nicole_graf

Worked on documentation

23.03.2021 08:58

Team forming

nicole_graf has joined!

23.03.2021 08:58

Project started

Initialized by nicole_graf 🎉

Connect to our community on: forum.opendata.ch | twitter | facebook

All attendees, sponsors, partners, volunteers and staff at our hackathon are required to agree with the Hack Code of Conduct. Organisers will enforce this code throughout the event. We expect cooperation from all participants to ensure a safe environment for everybody. For more details on how the event is run, see the Guidelines on our wiki.

Creative Commons LicenceThe contents of this website, unless otherwise stated, are licensed under a Creative Commons Attribution 4.0 International License.