Perceptual Plausibility

Build a game for classifying images as plausible (human-created) or implausible (machine-generated), that could also be "played" by an autonomous agent in order to automatically discern the provenance of content in a GLAM context, such as the images or other content shared on Wikimedia Commons. Make sure to include both the metadata (such as the Wikidata record) as well as the actual content in the assessment. A collection of synthetic and actual images from a Swiss municipality will be provided for this challenge.

@Basement

Pitch Team Readme 62 Log

↓ Open ✨ Demo

To support investigation into the provenance of open cultural content, our challenge was to gamify data collection in Wikimedia - in particular, to decide the extent (e.g. a non-binary rating) to which an image is synthetic or generative in origin. The idea was to build on prior work like the Depiction Wikidata Game, which asks users to "Decide whether the image depicts the named item", or not. See Challenge Notes below for further details.

GLAMhack 2024 Project

We started with brainstorming options, developing a sensible mission statement, and planning the work in Excalidraw:

Writing out our goals in Excalidraw

In the shared Etherpad we are collecting links to tools and resources.

Design thinking tool

Using a slightly adapted version of the WHY-NOW-HOW matrix, our features were quickly prioritized.

Photo of a sketch

Pen and paper usually works best for nailing down a sketch (we went through a few iterations ..) with a vision that we can commit to.

Screenshot of Wikiguess game

We discovered WikiGuess by ComputerCow: a simple yet delightful game about guessing real/fake Wikipedia titles that helped us to test out our concept.

Screenshot of app

Then we set up a Flet/Python app open source project to flesh this out a bit more, and shared this in GitHub. More screenshots are in presentation slides and details of how to run it are in the README section below. Due to an open bug we have not yet been able to publish the prototype.

ComfyUI screenshot

In parallel, we looked into how an AI evaluation or enhancement could be plugged into our app. Selecting several open source machine learning models, and testing them in the ComfyUI framework, we were able to demonstrate the potential of these techniques. Our model parameters are shared in this Switch drive folder. The workflow was deployed on Replicate, but we did not have time to connect the API.

Wikipedia draft

Finally, a Wikipedia article on the topic of our challenge "Perceptual Plausibility" has been drafted, and we came up with the name Wiglitcher for our app (a portmanteau of Wiki and Glitch). A link on neuropsychological research on the topic "Reality Bending" may lead you to more interesting exploration of the topic!

Thanks to absolutely everyone at the hackathon, especially to the Pimp my portrait gallery team at the next table for their valuable support and collaboration.

Challenge Notes

As a dataset for this project, we propose to compile and curate additional media from artists in the municipality of Köniz, where a culture festival is being run in parallel to this year's GLAMhack. In addition to the 100+ existing Wikimedia Commons entries we can source additional materials to upload. We can then generate an equal amount of images using Stable Diffusion (with Community License) close to the subject matter of the Commons collection. Some of these images will be "uncanny", making it hard to decide their origins. The complete collection will be provided as a Data Package in open formats. A game could be prototyped with an open source framework used for citizen science (e.g. PyBossa) and inspired by other crowdsourcing projects (e.g. StreetComplete).

The output of this challenge should be a concept sketch, a prototype, or even playable game, that can engage the public and contribute to the debate on Perceptual Plausibility outlined below.

Screenshot of the Depiction Wikidata game, Sigalov and Nachmias 2023

The GLAM community is at the heart of academic and critical debate on the use of machine learning (ML) in creative domains. Whether generating text, images, sounds, video, or 3D metaverses, we explore the multimedia advances and augmentative potential of artificial intelligence (AI) with active interest and critical reflection on the impact to society.

With ML and AI now built into illustration tools, photocameras, and other recording devices, it may be argued that the lines of what is or what is not "generative" are indeed increasingly blurry. One aspect of this discussion that we might agree on as designers or critics alike, is that Perceptual Plausibility - such as photo-realism, in the case of images - is an important differentiating factor. See the History of Text-to-image models and Uncanny Valley for further references on Wikipedia.

There are numerous tools and even ML-based benchmarks to test the generative provenance of uploaded content, with varying levels of accuracy and applicability. We even see increasing use of adversarial techniques like Nightshade. The Turing test begs the question of whether determining plausibility (realism) as a human observer or moderator will be possible for much longer. See also: Stable Diffusion and Why It Matters and Evaluating Diffusion Models.

Here is an excerpt of the recently posted official guidance on Commons:AI-generated media states:

Per the Commons project scope, only media that are realistically useful for an educational purpose should be hosted on Commons. Just because an AI image is interesting, pretty, or looks like a work of art, that doesn't mean that it is necessarily within the scope of Commons. While some AI-generated media fall within our scope, media that lack a realistic educational use may be nominated for deletion.

The legal discussion, a confusing cornucopia of legislative debates on AI and copyright around the world related to this, was neatly summed up last October by Creative Commons in Understanding CC Licenses and Generative AI:

We encourage the use of CC0 for those works that do not involve a significant degree of human creativity, to clarify the intellectual property status of the work and to ensure the public domain grows and thrives. ... Neither copyright nor CC licenses can or should address all of the ways that AI might impact people. There are no easy solutions, but it is clear we need to step outside of copyright to work together on governance, regulatory frameworks, societal norms, and many other mechanisms to enable us to harness AI technologies and practices for good.

In terms of content labelling, there have been several initiatives in this community:

How should AI-generated media be handled? section of the Guidance mentioned above;
Category:AI-generated images by Commons users;
Wikimedia Commons AI/Essay, part of a rejected Proposal for a sister project (similar to WikiAI);
AI Sauna/AI for Wikimedia Commons at a recent community event;

Recently the Vimeo video sharing platform, an early adopter of CC licenses, announced new AI guidelines, along with a revised Terms of Service.

Image: Vimeo Help Center

A project like this one could also be used to explore critical questions about the way online gigs and microtasking is used for building AI datasets: see Inside the AI Factory, and Are CAPTCHAs used to to train AI?

This image, and the one in the header, were rendered using Stable DIffusion XL 1.0.

Please feel free to leave a Comment here with any further links.

Wiglitcher App

For GLAMhack 2024..

To run the app:

Create a .env file and put the Personal key that you made on Wikimedia API in there like this:

WIKIMEDIA_TOKEN=eyJ0eXAiOiJKV1QiLCJ...

Then you can:

flet run [app_directory]

This content is a preview from an external site.

👋 Contact ✨ Demo 💻 Source

⎌

Edited (version 60)

4 months ago ~ loleg

Prototype

Added screenshots to project notes and uploaded final presentation

4 months ago ~ loleg

⎌

Edited (version 58)

4 months ago ~ loleg

⎌

Edited (version 57)

4 months ago ~ loleg

Event finish

MediaWiki API

GLAMhack 2024

Perceptual Plausibility

GLAMhack 2024 Project

Challenge Notes

Wiglitcher App

Prototype

Event finish

Prototype

Prototype

Prototype

Prototype

Sketching

Sketching

Research

Research

Start

Project