Challenge view
Back to Project07. Audio Analysis Challenge
Retrieve as much information as possible from an audio collection, through various Machine Learning/Natural Language Processing methods
Retrieve as much information as possible from an audio collection, through various Machine Learning/Natural Language Processing methods: * speech-to-text * speech emotion recognition / sentiment analysis (from the transcription text or directly on audio, if doable): classify and tag speech/speakers’ sentiment based on their polarity (positive, negative, or neutral) or beyond (different emotions) * eventually data visualizations based on the results (e.g., https://50-jahre-hitparade.ch/analysis/ - from where the chart above comes from)
Dataset:
Collection “Radio pleine lune”: Radio Pleine Lune, was a feminist radio program in the Geneva region that started with pirate broadcasts in 1979. The collection has been deposited in the Archives contestataires in Geneva, which collects, preserves, and valorizes documents from social movements of the second half of the 20th century. The program existed from 1980 to 1999. It is of particular importance for the Archives contestataires insofar as it gives an account of the various media forms used by protest movements in the second half of the 20th century. The materials represent broadcasts, thus direct recordings in the studio, as well as some rush documents, essentially interviews.
Information about the collection: http://inventaires.archivescontestataires.ch/index.php/fonds-radio-pleine-lune https://memobase.ch/fr/recordSet/acc-001
Metadata: https://api.memobase.ch/record/advancedSearch?q=isPartOf:mbrs:acc-001 Metadata are in French. Most relevant fields are the title, the abstract and the keywords (hasSubject).
Data: 443 audio recordings.
Possible issues: * not enough training data * chaotic corpus (multiple voices, live speaking)
Needs: developers with experience with audio analysis algorithms; eventually, web designers.