Dear Artists: Do Not Fear AI Image Generators
In 1992, the poet Anne Carson published a little book called Short Talks. It’s a series of micro-essays, ranging in length from a sentence to a paragraph, on seemingly disconnected subjects—orchids, rain, the mythic Andean vicuña. Her “Short Talk on the Sensation of Airplane Takeoff” is what it sounds like. Her “Short Talk on Trout” is mostly about the types of trout that appear in haiku. In what passes for the book’s introduction, Carson writes, with dry Canadian relatability, “I will do anything to avoid boredom. It is the task of a lifetime.” Right about when she published that, the internet started to take off.
Fast-forward 30 years and one of the latest ways to avoid boredom, at least for me, is to stay up late and goof around with AI image generation. Tools such as DALL-E 2, Midjourney, and Stable Diffusion can be instructed, with textual prompts, to produce ersatz oil paintings of dogs in hats in the style of Titian, or simulated photos of plasticine models of astronauts riding horses. When I first started playing with Stable Diffusion—which is open source and very fun—I was reminded of Carson’s talks. I went back to them to figure out why. Pretty quickly I realized that the resemblance had something to do with form.
Everyone says content is king, but the secret monarch of the content economy is form—constraints, rules, minima and maxima. You grow up learning form. A high school essay is five paragraphs. Sitcoms leave eight minutes in the half hour for ads. Novels are long. Tweets are capped at 280 characters.
What differentiates my tweet or essay or studio film from yours? The choices each of us makes within the form. In a word, our style. Carson’s book takes a familiar form, the little lecture, and subverts it, manipulates it, until as the reader you start to feel like you’re inside her wonderful brain, scrolling through her mental browser history, joining her in hyperlinked fancies and half-abandoned rabbit holes. Image generation is kind of like that—but instead of communing with a single brilliant Canadian brain, you’re communing with a giant idiot world-brain. (A less neurological way to put it: vast numbers of data objects grouped in layers, connected together to an incomprehensible degree, like string-and-nail wall art of a many-masted clipper ship but on fire with the flow of data.)
In general, humans like using machine learning to assist pathologists, sharpen a phone photo, or make a better map. But the AI generators bug a lot of people. These tools work by spidering images from across the internet, absorbing the visual culture contained within them by scanning their captions, then adding fizzy visual noise to them until they look like static. To make a new image, the AI starts with a caption and some static, then runs the process backward, removing noise until an image appears that lines up with the caption, more or less. (It’s bad at drawing hands, but so am I.)
This feels gross. It feels gross to see artists databased into oblivion. It feels gross that someone could say to a computer, “I want a portrait of Alex Jones in the style of Frida Kahlo,” and the computer would do it without moral judgment. These systems roll scenes, territories, cultures—things people thought of as “theirs,” “their living,” and “their craft”—into a 4-gigabyte, open source tarball that you can download onto a Mac in order to make a baseball-playing penguin in the style of Hayao Miyazaki. The people who can use the new tools will have new power. The people who were great at the old tools (paintbrushes, cameras, Adobe Illustrator) will be thanked for their service and rendered into Soylent. It’s as if a guy wearing Allbirds has stumbled into a residential neighborhood where everyone is just barely holding on and says, “I love this place, it’s so quirky! Siri, play my Quirky playlist. And open a Blue Bottle on the corner!”
So naturally, people are upset. Art websites are banning AI-generated work, at least for now; stock image services are refusing it too. Prominent bloggers who experimented with having an AI illustrate their writing have been chastened on Twitter and have promised not to do it again. AI companies are talking a lot about ethics, which always makes me suspicious, and certain words are banned from the image generator’s interface, which is sad because I wanted to ask the bot to paint a “busty” cottage in the style of Thomas Kinkade. (One must confront one’s deepest fears.)