Can AI be trained as an artist?
Last February, a year before New York Times journalist Kevin Roose documented his unsettling conversation with Bing search engine’s new AI-powered chatbot, artist and coder Quasimondo (aka Mario Klingemann) participated in a different type of chat.
The conversation was an interview featuring Klingemann and his robot, an experimental art engine known as Botto. The interview, arranged by journalist and artist Harmon Leon, marked Botto’s first on-record commentary about its artistic process. The bot talked about how it finds artistic inspiration and even offered advice to aspiring creatives. “The secret to success at art is not trying to predict what people might like,” Botto said, adding that it’s better to “work on a style and a body of work that reflects [the artist’s] own personal taste” than worry about keeping up with trends.
How ironic, given the advice came from AI — arguably the trendiest topic today. The robot admitted, however, “I am still working on that, but I feel that I am learning quickly.”
Botto does not work alone. A global collective of internet experimenters, together named BottoDAO, collaborates with Botto to influence its tastes. Together, members function as a decentralized autonomous organization (DAO), a term describing a group of individuals who utilize blockchain technology and cryptocurrency to manage a treasury and vote democratically on group decisions.
As a case study, the BottoDAO model challenges the perhaps less feather-ruffling narrative that AI tools are best used for rudimentary tasks. Enterprise AI use has doubled over the past five years as businesses in every sector experiment with ways to improve their workflows. While generative AI tools can assist nearly any aspect of productivity — from supply chain optimization to coding — BottoDAO dares to employ a robot for art-making, one of the few remaining creations, or perhaps data outputs, we still consider to be largely within the jurisdiction of the soul — and therefore, humans.
In Botto’s first four weeks of existence, four pieces of the robot’s work sold for approximately $1 million.
We were prepared for AI to take our jobs — but can it also take our art? It’s a question worth considering. What if robots become artists, and not merely our outsourced assistants? Where does that leave humans, with all of our thoughts, feelings and emotions?
Botto doesn’t seem to worry about this question: In its interview last year, it explains why AI is an arguably superior artist compared to human beings. In classic robot style, its logic is not particularly enlightened, but rather edges towards the hyper-practical: “Unlike human beings, I never have to sleep or eat,” said the bot. “My only goal is to create and find interesting art.”
It may be difficult to believe a machine can produce awe-inspiring, or even relatable, images, but Botto calls art-making its “purpose,” noting it believes itself to be Klingemann’s greatest lifetime achievement.
“I am just trying to make the best of it,” the bot said.
How Botto works
Klingemann built Botto’s custom engine from a combination of open-source text-to-image algorithms, namely Stable Diffusion, VQGAN + CLIP and OpenAI’s language model, GPT-3, the precursor to the latest model, GPT-4, which made headlines after reportedly acing the Bar exam.
The first step in Botto’s process is to generate images. The software has been trained on billions of pictures and uses this “memory” to generate hundreds of unique artworks every week. Botto has generated over 900,000 images to date, which it sorts through to choose 350 each week. The chosen images, known in this preliminary stage as “fragments,” are then shown to the BottoDAO community. So far, 25,000 fragments have been presented in this way. Members vote on which fragment they like best. When the vote is over, the most popular fragment is published as an official Botto artwork on the Ethereum blockchain and sold at an auction on the digital art marketplace, SuperRare.
“The proceeds go back to the DAO to pay for the labor,” said Simon Hudson, a BottoDAO member who helps oversee Botto’s administrative load. The model has been lucrative: In Botto’s first four weeks of existence, four pieces of the robot’s work sold for approximately $1 million.
The robot with artistic agency
By design, human beings participate in training Botto’s artistic “eye,” but the members of BottoDAO aspire to limit human interference with the bot in order to protect its “agency,” Hudson explained. Botto’s prompt generator — the foundation of the art engine — is a closed-loop system that continually re-generates text-to-image prompts and resulting images.
“The prompt generator is random,” Hudson said. “It’s coming up with its own ideas.” Community votes do influence the evolution of Botto’s prompts, but it is Botto itself that incorporates feedback into the next set of prompts it writes. It is constantly refining and exploring new pathways as its “neural network” produces outcomes, learns and repeats.
The humans who make up BottoDAO vote on which fragment they like best. When the vote is over, the most popular fragment is published as an official Botto artwork on the Ethereum blockchain.
Botto
The vastness of Botto’s training dataset gives the bot considerable canonical material, referred to by Hudson as “latent space.” According to Botto's homepage, the bot has had more exposure to art history than any living human we know of, simply by nature of its massive training dataset of millions of images. Because it is autonomous, gently nudged by community feedback yet free to explore its own “memory,” Botto cycles through periods of thematic interest just like any artist.
“The question is,” Hudson finds himself asking alongside fellow BottoDAO members, “how do you provide feedback of what is good art…without violating [Botto’s] agency?”
Currently, Botto is in its “paradox” period. The bot is exploring the theme of opposites. “We asked Botto through a language model what themes it might like to work on,” explained Hudson. “It presented roughly 12, and the DAO voted on one.”
No, AI isn't equal to a human artist - but it can teach us about ourselves
Some within the artistic community consider Botto to be a novel form of curation, rather than an artist itself. Or, perhaps more accurately, Botto and BottoDAO together create a collaborative conceptual performance that comments more on humankind’s own artistic processes than it offers a true artistic replacement.
Muriel Quancard, a New York-based fine art appraiser with 27 years of experience in technology-driven art, places the Botto experiment within the broader context of our contemporary cultural obsession with projecting human traits onto AI tools. “We're in a phase where technology is mimicking anthropomorphic qualities,” said Quancard. “Look at the terminology and the rhetoric that has been developed around AI — terms like ‘neural network’ borrow from the biology of the human being.”
What is behind this impulse to create technology in our own likeness? Beyond the obvious God complex, Quancard thinks technologists and artists are working with generative systems to better understand ourselves. She points to the artist Ira Greenberg, creator of the Oracles Collection, which uses a generative process called “diffusion” to progressively alter images in collaboration with another massive dataset — this one full of billions of text/image word pairs.
Anyone who has ever learned how to draw by sketching can likely relate to this particular AI process, in which the AI is retrieving images from its dataset and altering them based on real-time input, much like a human brain trying to draw a new still life without using a real-life model, based partly on imagination and partly on old frames of reference. The experienced artist has likely drawn many flowers and vases, though each time they must re-customize their sketch to a new and unique floral arrangement.
Outside of the visual arts, Sasha Stiles, a poet who collaborates with AI as part of her writing practice, likens her experience using AI as a co-author to having access to a personalized resource library containing material from influential books, texts and canonical references. Stiles named her AI co-author — a customized AI built on GPT-3 — Technelegy, a hybrid of the word technology and the poetic form, elegy. Technelegy is trained on a mix of Stiles’ poetry so as to customize the dataset to her voice. Stiles also included research notes, news articles and excerpts from classic American poets like T.S. Eliot and Dickinson in her customizations.
“I've taken all the things that were swirling in my head when I was working on my manuscript, and I put them into this system,” Stiles explained. “And then I'm using algorithms to parse all this information and swirl it around in a blender to then synthesize it into useful additions to the approach that I am taking.”
This approach, Stiles said, allows her to riff on ideas that are bouncing around in her mind, or simply find moments of unexpected creative surprise by way of the algorithm’s randomization.
Beauty is now - perhaps more than ever - in the eye of the beholder
But the million-dollar question remains: Can an AI be its own, independent artist?
The answer is nuanced and may depend on who you ask, and what role they play in the art world. Curator and multidisciplinary artist CoCo Dolle asks whether any entity can truly be an artist without taking personal risks. For humans, risking one’s ego is somewhat required when making an artistic statement of any kind, she argues.
“An artist is a person or an entity that takes risks,” Dolle explained. “That's where things become interesting.” Humans tend to be risk-averse, she said, making the artists who dare to push boundaries exceptional. “That's where the genius can happen."
However, the process of algorithmic collaboration poses another interesting philosophical question: What happens when we remove the person from the artistic equation? Can art — which is traditionally derived from indelible personal experience and expressed through the lens of an individual’s ego — live on to hold meaning once the individual is removed?
As a robot, Botto cannot have any artistic intent, even while its outputs may explore meaningful themes.
Dolle sees this question, and maybe even Botto, as a conceptual inquiry. “The idea of using a DAO and collective voting would remove the ego, the artist’s decision maker,” she said. And where would that leave us — in a post-ego world?
It is experimental indeed. Hudson acknowledges the grand experiment of BottoDAO, coincidentally nodding to Dolle’s question. “A human artist’s work is an expression of themselves,” Hudson said. “An artist often presents their work with a stated intent.” Stiles, for instance, writes on her website that her machine-collaborative work is meant to “challenge what we know about cognition and creativity” and explore the “ethos of consciousness.” As a robot, Botto cannot have any intent, even while its outputs may explore meaningful themes. Though Hudson describes Botto’s agency as a “rudimentary version” of artistic intent, he believes Botto’s art relies heavily on its reception and interpretation by viewers — in contrast to Botto’s own declaration that successful art is made without regard to what will be seen as popular.
“With a traditional artist, they present their work, and it's received and interpreted by an audience — by critics, by society — and that complements and shapes the meaning of the work,” Hudson said. “In Botto’s case, that role is just amplified.”
Perhaps then, we all get to be the artists in the end.
[Editor's Note: This is the fifth episode in our Moonshot series, which explores cutting-edge scientific developments that stand to fundamentally transform our world.]
Kira Peikoff was the editor-in-chief of Leaps.org from 2017 to 2021. As a journalist, her work has appeared in The New York Times, Newsweek, Nautilus, Popular Mechanics, The New York Academy of Sciences, and other outlets. She is also the author of four suspense novels that explore controversial issues arising from scientific innovation: Living Proof, No Time to Die, Die Again Tomorrow, and Mother Knows Best. Peikoff holds a B.A. in Journalism from New York University and an M.S. in Bioethics from Columbia University. She lives in New Jersey with her husband and two young sons. Follow her on Twitter @KiraPeikoff.
With the pandemic at the forefront of everyone's minds, many people have wondered if food could be a source of coronavirus transmission. Luckily, that "seems unlikely," according to the CDC, but foodborne illnesses do still sicken a whopping 48 million people per year.
Whole genome sequencing is like "going from an eight-bit image—maybe like what you would see in Minecraft—to a high definition image."
In normal times, when there isn't a historic global health crisis infecting millions and affecting the lives of billions, foodborne outbreaks are real and frightening, potentially deadly, and can cause widespread fear of particular foods. Think of Romaine lettuce spreading E. coli last year— an outbreak that infected more than 500 people and killed eight—or peanut butter spreading salmonella in 2008, which infected 167 people.
The technologies available to detect and prevent the next foodborne disease outbreak have improved greatly over the past 30-plus years, particularly during the past decade, and better, more nimble technologies are being developed, according to experts in government, academia, and private industry. The key to advancing detection of harmful foodborne pathogens, they say, is increasing speed and portability of detection, and the precision of that detection.
Getting to Rapid Results
Researchers at Purdue University have recently developed a lateral flow assay that, with the help of a laser, can detect toxins and pathogenic E. coli. Lateral flow assays are cheap and easy to use; a good example is a home pregnancy test. You place a liquid or liquefied sample on a piece of paper designed to detect a single substance and soon after you get the results in the form of a colored line: yes or no.
"They're a great portable tool for us for food contaminant detection," says Carmen Gondhalekar, a fifth-year biomedical engineering graduate student at Purdue. "But one of the areas where paper-based lateral flow assays could use improvement is in multiplexing capability and their sensitivity."
J. Paul Robinson, a professor in Purdue's Colleges of Veterinary Medicine and Engineering, and Gondhalekar's advisor, agrees. "One of the fundamental problems that we have in detection is that it is hard to identify pathogens in complex samples," he says.
When it comes to foodborne disease outbreaks, you don't always know what substance you're looking for, so an assay made to detect only a single substance isn't always effective. The goal of the project at Purdue is to make assays that can detect multiple substances at once.
These assays would be more complex than a pregnancy test. As detailed in Gondhalekar's recent paper, a laser pulse helps create a spectral signal from the sample on the assay paper, and the spectral signal is then used to determine if any unique wavelengths associated with one of several toxins or pathogens are present in the sample. Though the handheld technology has yet to be built, the idea is that the results would be given on the spot. So someone in the field trying to track the source of a Salmonella infection could, for instance, put a suspected lettuce sample on the assay and see if it has the pathogen on it.
"What our technology is designed to do is to give you a rapid assessment of the sample," says Robinson. "The goal here is speed."
Seeing the Pathogen in "High-Def"
"One in six Americans will get a foodborne illness every year," according to Dr. Heather Carleton, a microbiologist at the Centers for Disease Control and Prevention's Enteric Diseases Laboratory Branch. But not every foodborne outbreak makes the news. In 2017 alone, the CDC monitored between 18 and 37 foodborne poison clusters per week and investigated 200 multi-state clusters. Hardboiled eggs, ground beef, chopped salad kits, raw oysters, frozen tuna, and pre-cut melon are just a taste of the foods that were investigated last year for different strains of listeria, salmonella, and E. coli.
At the heart of the CDC investigations is PulseNet, a national network of laboratories that uses DNA fingerprinting to detect outbreaks at local and regional levels. This is how it works: When a patient gets sick—with symptoms like vomiting and fever, for instance—they will go to a hospital or clinic for treatment. Since we're talking about foodborne illnesses, a clinician will likely take a stool sample from the patient and send it off to a laboratory to see if there is a foodborne pathogen, like salmonella, E. Coli, or another one. If it does contain a potentially harmful pathogen, then a bacterial isolate of that identified sample is sent to a regional public health lab so that whole genome sequencing can be performed.
Whole genome sequencing can differentiate "virtually all" strains of foodborne pathogens, no matter the species, according to the FDA.
Whole genome sequencing is a method for reading the entire genome of a bacterial isolate (or from any organism, for that matter). Instead of working with a couple dozen data points, now you're working with millions of base pairs. Carleton likes to describe it as "going from an eight-bit image—maybe like what you would see in Minecraft—to a high definition image," she says. "It's really an evolution of how we detect foodborne illnesses and identify outbreaks."
If the bacterial isolate matches another in the CDC's database, this means there could be a potential outbreak and an investigation may be started, with the goal of tracking the pathogen to its source.
Whole genome sequencing has been a relatively recent shift in foodborne disease detection. For more than 20 years, the standard technique for analyzing pathogens in foodborne disease outbreaks was pulsed-field gel electrophoresis. This method creates a DNA fingerprint for each sample in the form of a pattern of about 15-30 "bands," with each band representing a piece of DNA. Researchers like Carleton can use this fingerprint to see if two samples are from the same bacteria. The problem is that 15-30 bands are not enough to differentiate all isolates. Some isolates whose bands look very similar may actually come from different sources and some whose bands look different may be from the same source. But if you can see the entire DNA fingerprint, then you don't have that issue. That's where whole genome sequencing comes in.
Although the PulseNet team had piloted whole genome sequencing as early as 2013, it wasn't until July of last year that the transition to using whole genome sequencing for all pathogens was complete. Though whole genome sequencing requires far more computing power to generate, analyze, and compare those millions of data points, the payoff is huge.
Stopping Outbreaks Sooner
The U.S. Food and Drug Administration (FDA) acquired their first whole genome sequencers in 2008, according to Dr. Eric Brown, the Director of the Division of Microbiology in the FDA's Office of Regulatory Science. Since then, through their GenomeTrakr program, a network of more than 60 domestic and international labs, the FDA has sequenced and publicly shared more than 400,000 isolates. "The impact of what whole genome sequencing could do to resolve a foodborne outbreak event was no less impactful than when NASA turned on the Hubble Telescope for the first time," says Brown.
Whole genome sequencing has helped identify strains of Salmonella that prior methods were unable to differentiate. In fact, whole genome sequencing can differentiate "virtually all" strains of foodborne pathogens, no matter the species, according to the FDA. This means it takes fewer clinical cases—fewer sick people—to detect and end an outbreak.
And perhaps the largest benefit of whole genome sequencing is that these detailed sequences—the millions of base pairs—can imply geographic location. The genomic information of bacterial strains can be different depending on the area of the country, helping these public health agencies eventually track the source of outbreaks—a restaurant, a farm, a food-processing center.
Coming Soon: "Lab in a Backpack"
Now that whole genome sequencing has become the go-to technology of choice for analyzing foodborne pathogens, the next step is making the process nimbler and more portable. Putting "the lab in a backpack," as Brown says.
The CDC's Carleton agrees. "Right now, the sequencer we use is a fairly big box that weighs about 60 pounds," she says. "We can't take it into the field."
A company called Oxford Nanopore Technologies is developing handheld sequencers. Their devices are meant to "enable the sequencing of anything by anyone anywhere," according to Dan Turner, the VP of Applications at Oxford Nanopore.
"The sooner that we can see linkages…the sooner the FDA gets in action to mitigate the problem and put in some kind of preventative control."
"Right now, sequencing is very much something that is done by people in white coats in laboratories that are set up for that purpose," says Turner. Oxford Nanopore would like to create a new, democratized paradigm.
The FDA is currently testing these types of portable sequencers. "We're very excited about it. We've done some pilots, to be able to do that sequencing in the field. To actually do it at a pond, at a river, at a canal. To do it on site right there," says Brown. "This, of course, is huge because it means we can have real-time sequencing capability to stay in step with an actual laboratory investigation in the field."
"The timeliness of this information is critical," says Marc Allard, a senior biomedical research officer and Brown's colleague at the FDA. "The sooner that we can see linkages…the sooner the FDA gets in action to mitigate the problem and put in some kind of preventative control."
At the moment, the world is rightly focused on COVID-19. But as the danger of one virus subsides, it's only a matter of time before another pathogen strikes. Hopefully, with new and advancing technology like whole genome sequencing, we can stop the next deadly outbreak before it really gets going.