Audio Analysis Challenge

(07) Retrieve as much information as possible from an audio collection, through various Machine Learning/Natural Language Processing methods

Challenge

Retrieve as much information as possible from an audio collection, through various Machine Learning/Natural Language Processing methods:

  • speech-to-text
  • speech emotion recognition / sentiment analysis (from the transcription text or directly on audio, if doable): classify and tag speech/speakers’ sentiment based on their polarity (positive, negative, or neutral) or beyond (different emotions)
  • eventually data visualizations based on the results (e.g., https://50-jahre-hitparade.ch/analysis/ - from where the chart above comes from)

Dataset

Collection “Radio pleine lune”: Radio Pleine Lune, was a feminist radio program in the Geneva region that started with pirate broadcasts in 1979. The collection has been deposited in the Archives contestataires in Geneva, which collects, preserves, and valorizes documents from social movements of the second half of the 20th century. The program existed from 1980 to 1999. It is of particular importance for the Archives contestataires insofar as it gives an account of the various media forms used by protest movements in the second half of the 20th century. The materials represent broadcasts, thus direct recordings in the studio, as well as some rush documents, essentially interviews.

Information about the collection:

http://inventaires.archivescontestataires.ch/index.php/fonds-radio-pleine-lune https://memobase.ch/fr/recordSet/acc-001

Metadata:

https://api.memobase.ch/record/advancedSearch?q=isPartOf:mbrs:acc-001 Metadata are in French. Most relevant fields are the title, the abstract and the keywords (hasSubject).

Data: 443 audio recordings.

Possible issues:

  • not enough training data
  • chaotic corpus (multiple voices, live speaking)

Needs: developers with experience with audio analysis algorithms; eventually, web designers.

Event finished

05.11.2022 16:30

Joined the team

05.11.2022 08:30 ~ loc_jaouen

Edited content version 22

05.11.2022 08:23 ~ roberta_padlina

Edited content version 20

05.11.2022 08:21 ~ roberta_padlina

Testing different solutions for speech-to-text

05.11.2022 08:20 ~ roberta_padlina

Ask

05.11.2022 08:19

Edited content version 16

05.11.2022 08:19 ~ roberta_padlina

Edited content version 14

04.11.2022 21:34 ~ Darienne

Event started

04.11.2022 09:00

Edited content version 12

03.11.2022 13:46 ~ jonaslendenmann

Ask

31.10.2022 09:13

Edited content version 8

31.10.2022 09:13 ~ roberta_padlina

Edited content version 6

31.10.2022 09:13 ~ roberta_padlina

Edited content version 4

31.10.2022 09:12 ~ roberta_padlina

Joined the team

31.10.2022 09:09 ~ roberta_padlina

Challenge posted

31.10.2022 09:09 ~ roberta_padlina
All attendees, sponsors, partners, volunteers and staff at our hackathon are required to agree with the Hack Code of Conduct. Organisers will enforce this code throughout the event. We expect cooperation from all participants to ensure a safe environment for everybody.

Creative Commons LicenceThe contents of this website, unless otherwise stated, are licensed under a Creative Commons Attribution 4.0 International License.

GLAMhack 2022