OCR and linking of MEG inventories

Create a program to read and transcribe the handwritten MEG inventories and link the corresponding pages to the database

10

⛶  Fullscreen ↓  Download

The first inventories of the collection of MEG are old handwritten inventory books. These books contain some very important information regarding the objects and their provenance. Not only do they inform about the date at which the object was collected or about the name of the person who collected it. They also provide sometimes some very useful contextualizing information about the biography of the object. Plus, they show how the objects were used by European institutions to produce colonial knowledge on colonized people and their material heritage.

The information provided by these documents hasn’t always been transferred to the digital database. Thus, curators and researchers spend a lot of time searching the old inventories to potentially find some more information on the objects they’re working on. Such a program would be really helpful for all (provenance) research to be done in the future on the MEG objects.

Traces of colonial knowledge production

The City of Geneva's ethnography collection was brought together in 1901 in a single exhibition site, which became the Musée d'ethnographie. The collection is made up of several collections from different public and private museums, including the Musée des missions and the Musée archéologique. These collections are described in original handwritten inventories, with specific numbering. When the Musée d'ethnographie was created, a new inventory grouping together the various collections was handwritten until 1973. Between 1973 and 1993, the continuation of this register was typed. At the same time, to facilitate consultation and protect the handwritten registers, the entire MEG inventory register (1901-1971) was also typed. The paper registers were subsequently replaced by computerised systems.

Example:

The inventory entry of the object ETHAF 034815, Victory at the Battle of Adwa (1896) is linked with two inventory book (see bottom of the inventory page).

 Title

The first inventory Copie dactylographiée en 13 volumes de l'Inventaire original MEG manuscrit (which is a typed copy of the other one) was digitized with an OCR system. So, in this case, it was possible to link the exact page of the inventory related to this object.

When you click on the link, you access the corresponding inventory entry:

 Title

The handwritten inventories however wasn't yet digitized with an OCR system. Thus, the link leads to the whole handwritten inventory book.

 Title

Instead of leading to the whole inventory book, we'd like the link to lead to the corresponding page of the inventory:

 Title

The idea of this challenge would thus be

  • to create a program to read and transcribe the handwritten MEG inventories
  • to link the page containing an inventory entry with the corresponding object page in the museum database.

One problem case to be faced during the challenge: The inventory sheet of the number 025988 extends overs five pages of the register. (See inventory book number 11, page 12. Inventorybook 11 pp 12.JPG

Event finish

Edited (version 22)

1 year ago ~ marina_berazategui

Joined the team

1 year ago ~ Hugo

Start

Edited (version 19)

1 year ago ~ gaston

Edited (version 17)

1 year ago ~ marina_berazategui

Edited (version 16)

1 year ago ~ marina_berazategui

Joined the team

1 year ago ~ timtom

Edited (version 15)

1 year ago ~ liowalter

Joined the team

1 year ago ~ liowalter

Edited (version 14)

1 year ago ~ marina_berazategui

Edited (version 12)

1 year ago ~ marina_berazategui

Edited (version 10)

1 year ago ~ marina_berazategui

Edited (version 8)

1 year ago ~ marina_berazategui

Edited (version 6)

1 year ago ~ marina_berazategui

Edited (version 4)

1 year ago ~ marina_berazategui

Joined the team

1 year ago ~ marina_berazategui

Challenge shared
Tap here to review.

1 year ago ~ marina_berazategui
 
Contributed 1 year ago by marina_berazategui for GLAMhack 2023
All attendees, sponsors, partners, volunteers and staff at our hackathon are required to agree with the Hack Code of Conduct. Organisers will enforce this code throughout the event. We expect cooperation from all participants to ensure a safe environment for everybody.

Creative Commons LicenceThe contents of this website, unless otherwise stated, are licensed under a Creative Commons Attribution 4.0 International License.