This Challenge was posted 4 months ago

Challenge view

Back to Project

Sex and Crime und Kneippenschlägereien in Early Modern Zurich

Minutes reported by pastor in Early Modern Zurich

Demo

Make the "Stillstandsprotokolle" searchable, georeferenced and browsable and display them on a map.

For more Info see our Github Repository

Access the documents: archives-quickaccess.ch/search/stazh/stpzh

Data

Team

  • Ernst Rosser, ernst.rosser@gmail.com
  • Tobias Hodel, tobias.hodel@ji.zh.ch
  • Barbara Leimgruber, Barbara.Leimgruber@ji.zh.ch
  • Rebekka Plüss, Rebekka.Pluess@ji.zh.ch
  • Ismail Prada, ismail.prada@gmail.com
  • Matthias Mazenauer, matthias.mazenauer@statistik.ji.zh.ch

#glamhack2018

Sex and Crime und Kneipenschlägereien in der Frühen Neuzeit

Goal

Make the data ("Stillstandsprotokolle des 17. Jahrhunderts") better searchable and georeference it for visualization.

Team

  • Ernst Rosser, ernst.rosser@gmail.com
  • Barbara Leimgruber, Barbara.Leimgruber@ji.zh.ch
  • Rebekka Plüss, Rebekka.Pluess@ji.zh.ch
  • Ismail Prada, ismail.prada@gmail.com
  • Matthias Mazenauer, matthias.mazenauer@statistik.ji.zh.ch
  • Tobias Hodel, tobias.hodel@ji.zh.ch

Data sources:

Steps taken

  • Create lookup for normalized strings (https://github.com/mmznr/Staatsarchiv-GLAMhack/blob/master/woerterStillstand_Result.tsv)
  • Annotate named entities (normalization) -> places (also add BfS-data) -> persons (normalization to be used for auto-complete in search)
  • Cluster words -> based on "Frequenztabelle Stillstandsprotokolle", see https://github.com/mmznr/Staatsarchiv-GLAMhack/blob/master/README.md#frequency-list-of-word-cluster -> to be used to refer to topic/concept
  • Cluster documents -> to be used as keyword(s) in TEI header = Scripts for clustering, see folder "code"
  • Create script to add information as tags (in body) to write in XML (in work)

Lemmatization/Normalisation

  • Done: Wordlist and Frequencies

  • ToDo: POS tagging

Named Entities

  • Names of persons: done A-D

  • Names of places: done A-K

Visualization

Word-Cluster

Visualization

(using fasttext) https://github.com/mmznr/Staatsarchiv-GLAMhack/tree/master/Visualisierungen/clusters.png https://github.com/mmznr/Staatsarchiv-GLAMhack/tree/master/Visualisierungen/clusters2.png

Frequency list of Word-Cluster

https://docs.google.com/spreadsheets/d/1rFo7p9YsQRwJufMuWGw2677acOsWevcmm-lN5RVBJv4/edit?usp=sharing

GIS Visualization

https://beta.observablehq.com/@mmznrstat/sex-and-crime-und-kneipenschlagereien-in-der-fruhen-neuzei

  • Done: Borders from swisstopo via Linked Data, Matching of the settlements of the canton of Zurich

  • ToDo: Get List of old names of this settlements, match them and show all relating documents of a settlement (or municipality)

This content is a preview from an external site.