Challenge view
Back to ProjectSex and Crime und Kneippenschlägereien in Early Modern Zurich
Minutes reported by pastor in Early Modern Zurich
Make the "Stillstandsprotokolle" searchable, georeferenced and browsable and display them on a map.
For more Info see our Github Repository
Access the documents: archives-quickaccess.ch/search/stazh/stpzh
Data
- Primary Data
- Secondary data
- Siedlungsverzeichnis des Kantons Zürich: http://www.web.statistik.zh.ch/cms_siedlungsverzeichnis/daten.php
Team
- Ernst Rosser, ernst.rosser@gmail.com
- Tobias Hodel, tobias.hodel@ji.zh.ch
- Barbara Leimgruber, Barbara.Leimgruber@ji.zh.ch
- Rebekka Plüss, Rebekka.Pluess@ji.zh.ch
- Ismail Prada, ismail.prada@gmail.com
- Matthias Mazenauer, matthias.mazenauer@statistik.ji.zh.ch
#glamhack2018
Sex and Crime und Kneipenschlägereien in der Frühen Neuzeit
Goal
Make the data ("Stillstandsprotokolle des 17. Jahrhunderts") better searchable and georeference it for visualization.
Team
- Ernst Rosser, ernst.rosser@gmail.com
- Barbara Leimgruber, Barbara.Leimgruber@ji.zh.ch
- Rebekka Plüss, Rebekka.Pluess@ji.zh.ch
- Ismail Prada, ismail.prada@gmail.com
- Matthias Mazenauer, matthias.mazenauer@statistik.ji.zh.ch
- Tobias Hodel, tobias.hodel@ji.zh.ch
Data sources:
-
Primary Data
-
Secondary data
Steps taken
- Create lookup for normalized strings (https://github.com/mmznr/Staatsarchiv-GLAMhack/blob/master/woerterStillstand_Result.tsv)
- Annotate named entities (normalization) -> places (also add BfS-data) -> persons (normalization to be used for auto-complete in search)
- Cluster words -> based on "Frequenztabelle Stillstandsprotokolle", see https://github.com/mmznr/Staatsarchiv-GLAMhack/blob/master/README.md#frequency-list-of-word-cluster -> to be used to refer to topic/concept
- Cluster documents -> to be used as keyword(s) in TEI header = Scripts for clustering, see folder "code"
- Create script to add information as tags (in body) to write in XML (in work)
Lemmatization/Normalisation
-
Done: Wordlist and Frequencies
-
ToDo: POS tagging
Named Entities
-
Names of persons: done A-D
-
Names of places: done A-K
Visualization
Word-Cluster
Visualization
(using fasttext) https://github.com/mmznr/Staatsarchiv-GLAMhack/tree/master/Visualisierungen/clusters.png https://github.com/mmznr/Staatsarchiv-GLAMhack/tree/master/Visualisierungen/clusters2.png
Frequency list of Word-Cluster
https://docs.google.com/spreadsheets/d/1rFo7p9YsQRwJufMuWGw2677acOsWevcmm-lN5RVBJv4/edit?usp=sharing
GIS Visualization
https://beta.observablehq.com/@mmznrstat/sex-and-crime-und-kneipenschlagereien-in-der-fruhen-neuzei
-
Done: Borders from swisstopo via Linked Data, Matching of the settlements of the canton of Zurich
-
ToDo: Get List of old names of this settlements, match them and show all relating documents of a settlement (or municipality)
Letterjongg
Open Cultural Data Hackathon 2018
SPARQLfish