Challenge view
Back to ProjectSex and Crime und Kneippenschlägereien in Early Modern Zurich
Minutes reported by pastor in Early Modern Zurich
Make the "Stillstandsprotokolle" searchable, georeferenced and browsable and display them on a map.
For more Info see our Github Repository
Access the documents: archives-quickaccess.ch/search/stazh/stpzh
Data
- Primary Data
- Secondary data
- Siedlungsverzeichnis des Kantons Zürich: http://www.web.statistik.zh.ch/cms_siedlungsverzeichnis/daten.php
Team
- Ernst Rosser, ernst.rosser@gmail.com
- Tobias Hodel, tobias.hodel@ji.zh.ch
- Barbara Leimgruber, Barbara.Leimgruber@ji.zh.ch
- Rebekka Plüss, Rebekka.Pluess@ji.zh.ch
- Ismail Prada, ismail.prada@gmail.com
- Matthias Mazenauer, matthias.mazenauer@statistik.ji.zh.ch
#glamhack2018
Sex and Crime und Kneipenschlägereien in der Frühen Neuzeit
Goal
Make the data ("Stillstandsprotokolle des 17. Jahrhunderts") better searchable and georeference it for visualization.
Team
- Ernst Rosser, ernst.rosser@gmail.com
- Barbara Leimgruber, Barbara.Leimgruber@ji.zh.ch
- Rebekka Plüss, Rebekka.Pluess@ji.zh.ch
- Ismail Prada, ismail.prada@gmail.com
- Matthias Mazenauer, matthias.mazenauer@statistik.ji.zh.ch
- Tobias Hodel, tobias.hodel@ji.zh.ch
Data sources:
Primary Data
Secondary data
Steps taken
- Create lookup for normalized strings (https://github.com/mmznr/Staatsarchiv-GLAMhack/blob/master/woerterStillstand_Result.tsv)
- Annotate named entities (normalization) -> places (also add BfS-data) -> persons (normalization to be used for auto-complete in search)
- Cluster words -> based on "Frequenztabelle Stillstandsprotokolle", see https://github.com/mmznr/Staatsarchiv-GLAMhack/blob/master/README.md#frequency-list-of-word-cluster -> to be used to refer to topic/concept
- Cluster documents -> to be used as keyword(s) in TEI header = Scripts for clustering, see folder "code"
- Create script to add information as tags (in body) to write in XML (in work)
Lemmatization/Normalisation
Done: Wordlist and Frequencies
ToDo: POS tagging
Named Entities
Names of persons: done A-D
Names of places: done A-K
Visualization
Word-Cluster
Visualization
(using fasttext) https://github.com/mmznr/Staatsarchiv-GLAMhack/tree/master/Visualisierungen/clusters.png https://github.com/mmznr/Staatsarchiv-GLAMhack/tree/master/Visualisierungen/clusters2.png
Frequency list of Word-Cluster
https://docs.google.com/spreadsheets/d/1rFo7p9YsQRwJufMuWGw2677acOsWevcmm-lN5RVBJv4/edit?usp=sharing
GIS Visualization
https://beta.observablehq.com/@mmznrstat/sex-and-crime-und-kneipenschlagereien-in-der-fruhen-neuzei
Done: Borders from swisstopo via Linked Data, Matching of the settlements of the canton of Zurich
ToDo: Get List of old names of this settlements, match them and show all relating documents of a settlement (or municipality)