Edited (version 60)
Perceptual Plausibility
Build a game for classifying images as plausible (human-created) or implausible (machine-generated), that could also be "played" by an autonomous agent in order to automatically discern the provenance of content in a GLAM context, such as the images or other content shared on Wikimedia Commons. Make sure to include both the metadata (such as the Wikidata record) as well as the actual content in the assessment. A collection of synthetic and actual images from a Swiss municipality will be provided for this challenge.
To support investigation into the provenance of open cultural content, our challenge was to gamify data collection in Wikimedia - in particular, to decide the extent (e.g. a non-binary rating) to which an image is synthetic or generative in origin. The idea was to build on prior work like the Depiction Wikidata Game, which asks users to "Decide whether the image depicts the named item", or not. See Challenge Notes below for further details.
GLAMhack 2024 Project
We started with brainstorming options, developing a sensible mission statement, and planning the work in Excalidraw:
In the shared Etherpad we are collecting links to tools and resources.
Using a slightly adapted version of the WHY-NOW-HOW matrix, our features were quickly prioritized.
Pen and paper usually works best for nailing down a sketch (we went through a few iterations ..) with a vision that we can commit to.
We discovered WikiGuess by ComputerCow: a simple yet delightful game about guessing real/fake Wikipedia titles that helped us to test out our concept.
Then we set up a Flet/Python app open source project to flesh this out a bit more, and shared this in GitHub. More screenshots are in presentation slides and details of how to run it are in the README section below. Due to an open bug we have not yet been able to publish the prototype.
In parallel, we looked into how an AI evaluation or enhancement could be plugged into our app. Selecting several open source machine learning models, and testing them in the ComfyUI framework, we were able to demonstrate the potential of these techniques. Our model parameters are shared in this Switch drive folder. The workflow was deployed on Replicate, but we did not have time to connect the API.
Finally, a Wikipedia article on the topic of our challenge "Perceptual Plausibility" has been drafted, and we came up with the name Wiglitcher for our app (a portmanteau of Wiki and Glitch). A link on neuropsychological research on the topic "Reality Bending" may lead you to more interesting exploration of the topic!
Thanks to absolutely everyone at the hackathon, especially to the Pimp my portrait gallery team at the next table for their valuable support and collaboration.
Challenge Notes
As a dataset for this project, we propose to compile and curate additional media from artists in the municipality of Köniz, where a culture festival is being run in parallel to this year's GLAMhack. In addition to the 100+ existing Wikimedia Commons entries we can source additional materials to upload. We can then generate an equal amount of images using Stable Diffusion (with Community License) close to the subject matter of the Commons collection. Some of these images will be "uncanny", making it hard to decide their origins. The complete collection will be provided as a Data Package in open formats. A game could be prototyped with an open source framework used for citizen science (e.g. PyBossa) and inspired by other crowdsourcing projects (e.g. StreetComplete).
The output of this challenge should be a concept sketch, a prototype, or even playable game, that can engage the public and contribute to the debate on Perceptual Plausibility outlined below.
Screenshot of the Depiction Wikidata game, Sigalov and Nachmias 2023
The GLAM community is at the heart of academic and critical debate on the use of machine learning (ML) in creative domains. Whether generating text, images, sounds, video, or 3D metaverses, we explore the multimedia advances and augmentative potential of artificial intelligence (AI) with active interest and critical reflection on the impact to society.
With ML and AI now built into illustration tools, photocameras, and other recording devices, it may be argued that the lines of what is or what is not "generative" are indeed increasingly blurry. One aspect of this discussion that we might agree on as designers or critics alike, is that Perceptual Plausibility - such as photo-realism\, in the case of images - is an important differentiating factor. See the History of Text-to-image models and Uncanny Valley for further references on Wikipedia.
There are numerous tools and even ML-based benchmarks to test the generative provenance of uploaded content, with varying levels of accuracy and applicability. We even see increasing use of adversarial techniques like Nightshade. The Turing test begs the question of whether determining plausibility (realism) as a human observer or moderator will be possible for much longer. See also: Stable Diffusion and Why It Matters and Evaluating Diffusion Models.
Here is an excerpt of the recently posted official guidance on Commons:AI-generated media states:
Per the Commons project scope, only media that are realistically useful for an educational purpose should be hosted on Commons. Just because an AI image is interesting, pretty, or looks like a work of art, that doesn't mean that it is necessarily within the scope of Commons. While some AI-generated media fall within our scope, media that lack a realistic educational use may be nominated for deletion.
The legal discussion, a confusing cornucopia of legislative debates on AI and copyright around the world related to this, was neatly summed up last October by Creative Commons in Understanding CC Licenses and Generative AI:
We encourage the use of CC0 for those works that do not involve a significant degree of human creativity, to clarify the intellectual property status of the work and to ensure the public domain grows and thrives. ... Neither copyright nor CC licenses can or should address all of the ways that AI might impact people. There are no easy solutions, but it is clear we need to step outside of copyright to work together on governance, regulatory frameworks, societal norms, and many other mechanisms to enable us to harness AI technologies and practices for good.
In terms of content labelling, there have been several initiatives in this community:
- How should AI-generated media be handled? section of the Guidance mentioned above;
- Category:AI-generated images by Commons users;
- Wikimedia Commons AI/Essay, part of a rejected Proposal for a sister project (similar to WikiAI);
- AI Sauna/AI for Wikimedia Commons at a recent community event;
Recently the Vimeo video sharing platform, an early adopter of CC licenses, announced new AI guidelines, along with a revised Terms of Service.
Image: Vimeo Help Center
A project like this one could also be used to explore critical questions about the way online gigs and microtasking is used for building AI datasets: see Inside the AI Factory, and Are CAPTCHAs used to to train AI?
This image, and the one in the header, were rendered using Stable DIffusion XL 1.0.
Please feel free to leave a Comment here with any further links.
Wiglitcher App
For GLAMhack 2024..
To run the app:
Create a .env
file and put the Personal key that you made on Wikimedia API in there like this:
WIKIMEDIA_TOKEN=eyJ0eXAiOiJKV1QiLCJ...
Then you can:
flet run [app_directory]
Prototype
Added screenshots to project notes and uploaded final presentation
Event finish
MediaWiki API
video (@irmakoz1)
Prototype
Merge branch 'main' of https://github.com/snappy91/Wiglitcher (@irmakoz1)
photo (@irmakoz1)
images (@snappy91)
Replicate API
Merge branch 'main' of github.com:snappy91/Wiglitcher
birds (@irmakoz1)
text-ai (@irmakoz1)
Taking a break. In the library 🩵📚
Prototype
photo-of-nodes
nodes photo (@irmakoz1)
Merge branch 'main' of https://github.com/snappy91/Wiglitcher (@irmakoz1)
comfyai-api
image AI (@irmakoz1)
Prototype
README
Client storage
Routes navigation
comfyai api to the app : https://9elements.com/blog/hosting-a-comfyui-workflow-via-api/
Prototype
Navigation and API
Merged main
"improve" view (@snappy91)
Refactored into pages
Sketching
Merge branch 'main' of github.com:snappy91/Wiglitcher
page views added (@snappy91)
Good morning Lucerne! (8bit remix)
Sketching
Repository updated
Research
8-|
initial commit :) (@snappy91)
Initial commit (@snappy91)
Repository on GitHub: https://github.com/snappy91/Wiglitcher
Research
What, overengineering? We used Excalidraw and Etherpad to brainstorm, but it was a good old-fashioned pen & paper that helped us to focus the ideas and agree on our deliverables!
hard work only!
Thanks to our colleagues at the next table we have found our focus and fresh motivation! Nobody is too keen on the name though 😅 ideas welcome!
Start
Joined the team
Project
We are getting started in the library 🤫 Just added an Excalidraw mood board to our 📖 project page to refine the idea.
Joined the team
Looking forward to #GLAMhack! Just 2 weeks to go 🗺️ Here's a 100-year-old map to help find our way.
Joined the team