How AI helped make mRNA vaccines
Story by Big Think
For most of history, artificial intelligence (AI) has been relegated almost entirely to the realm of science fiction. Then, in late 2022, it burst into reality — seemingly out of nowhere — with the popular launch of ChatGPT, the generative AI chatbot that solves tricky problems, designs rockets, has deep conversations with users, and even aces the Bar exam.
But the truth is that before ChatGPT nabbed the public’s attention, AI was already here, and it was doing more important things than writing essays for lazy college students. Case in point: It was key to saving the lives of tens of millions of people.
AI-designed mRNA vaccines
As Dave Johnson, chief data and AI officer at Moderna, told MIT Technology Review‘s In Machines We Trust podcast in 2022, AI was integral to creating the company’s highly effective mRNA vaccine against COVID. Moderna and Pfizer/BioNTech’s mRNA vaccines collectively saved between 15 and 20 million lives, according to one estimate from 2022.
Johnson described how AI was hard at work at Moderna, well before COVID arose to infect billions. The pharmaceutical company focuses on finding mRNA therapies to fight off infectious disease, treat cancer, or thwart genetic illness, among other medical applications. Messenger RNA molecules are essentially molecular instructions for cells that tell them how to create specific proteins, which do everything from fighting infection, to catalyzing reactions, to relaying cellular messages.
Johnson and his team put AI and automated robots to work making lots of different mRNAs for scientists to experiment with. Moderna quickly went from making about 30 per month to more than one thousand. They then created AI algorithms to optimize mRNA to maximize protein production in the body — more bang for the biological buck.
For Johnson and his team’s next trick, they used AI to automate science, itself. Once Moderna’s scientists have an mRNA to experiment with, they do pre-clinical tests in the lab. They then pore over reams of data to see which mRNAs could progress to the next stage: animal trials. This process is long, repetitive, and soul-sucking — ill-suited to a creative scientist but great for a mindless AI algorithm. With scientists’ input, models were made to automate this tedious process.
“We don’t think about AI in the context of replacing humans,” says Dave Johnson, chief data and AI officer at Moderna. “We always think about it in terms of this human-machine collaboration, because they’re good at different things. Humans are really good at creativity and flexibility and insight, whereas machines are really good at precision and giving the exact same result every single time and doing it at scale and speed.”
All these AI systems were in put in place over the past decade. Then COVID showed up. So when the genome sequence of the coronavirus was made public in January 2020, Moderna was off to the races pumping out and testing mRNAs that would tell cells how to manufacture the coronavirus’s spike protein so that the body’s immune system would recognize and destroy it. Within 42 days, the company had an mRNA vaccine ready to be tested in humans. It eventually went into hundreds of millions of arms.
Biotech harnesses the power of AI
Moderna is now turning its attention to other ailments that could be solved with mRNA, and the company is continuing to lean on AI. Scientists are still coming to Johnson with automation requests, which he happily obliges.
“We don’t think about AI in the context of replacing humans,” he told the Me, Myself, and AI podcast. “We always think about it in terms of this human-machine collaboration, because they’re good at different things. Humans are really good at creativity and flexibility and insight, whereas machines are really good at precision and giving the exact same result every single time and doing it at scale and speed.”
Moderna, which was founded as a “digital biotech,” is undoubtedly the poster child of AI use in mRNA vaccines. Moderna recently signed a deal with IBM to use the company’s quantum computers as well as its proprietary generative AI, MoLFormer.
Moderna’s success is encouraging other companies to follow its example. In January, BioNTech, which partnered with Pfizer to make the other highly effective mRNA vaccine against COVID, acquired the company InstaDeep for $440 million to implement its machine learning AI across its mRNA medicine platform. And in May, Chinese technology giant Baidu announced an AI tool that designs super-optimized mRNA sequences in minutes. A nearly countless number of mRNA molecules can code for the same protein, but some are more stable and result in the production of more proteins. Baidu’s AI, called “LinearDesign,” finds these mRNAs. The company licensed the tool to French pharmaceutical company Sanofi.
Writing in the journal Accounts of Chemical Research in late 2021, Sebastian M. Castillo-Hair and Georg Seelig, computer engineers who focus on synthetic biology at the University of Washington, forecast that AI machine learning models will further accelerate the biotechnology research process, putting mRNA medicine into overdrive to the benefit of all.
This article originally appeared on Big Think, home of the brightest minds and biggest ideas of all time.
Scientists make progress with growing organs for transplants
Story by Big Think
For over a century, scientists have dreamed of growing human organs sans humans. This technology could put an end to the scarcity of organs for transplants. But that’s just the tip of the iceberg. The capability to grow fully functional organs would revolutionize research. For example, scientists could observe mysterious biological processes, such as how human cells and organs develop a disease and respond (or fail to respond) to medication without involving human subjects.
Recently, a team of researchers from the University of Cambridge has laid the foundations not just for growing functional organs but functional synthetic embryos capable of developing a beating heart, gut, and brain. Their report was published in Nature.
The organoid revolution
In 1981, scientists discovered how to keep stem cells alive. This was a significant breakthrough, as stem cells have notoriously rigorous demands. Nevertheless, stem cells remained a relatively niche research area, mainly because scientists didn’t know how to convince the cells to turn into other cells.
Then, in 1987, scientists embedded isolated stem cells in a gelatinous protein mixture called Matrigel, which simulated the three-dimensional environment of animal tissue. The cells thrived, but they also did something remarkable: they created breast tissue capable of producing milk proteins. This was the first organoid — a clump of cells that behave and function like a real organ. The organoid revolution had begun, and it all started with a boob in Jello.
For the next 20 years, it was rare to find a scientist who identified as an “organoid researcher,” but there were many “stem cell researchers” who wanted to figure out how to turn stem cells into other cells. Eventually, they discovered the signals (called growth factors) that stem cells require to differentiate into other types of cells.
For a human embryo (and its organs) to develop successfully, there needs to be a “dialogue” between these three types of stem cells.
By the end of the 2000s, researchers began combining stem cells, Matrigel, and the newly characterized growth factors to create dozens of organoids, from liver organoids capable of producing the bile salts necessary for digesting fat to brain organoids with components that resemble eyes, the spinal cord, and arguably, the beginnings of sentience.
Synthetic embryos
Organoids possess an intrinsic flaw: they are organ-like. They share some characteristics with real organs, making them powerful tools for research. However, no one has found a way to create an organoid with all the characteristics and functions of a real organ. But Magdalena Żernicka-Goetz, a developmental biologist, might have set the foundation for that discovery.
Żernicka-Goetz hypothesized that organoids fail to develop into fully functional organs because organs develop as a collective. Organoid research often uses embryonic stem cells, which are the cells from which the developing organism is created. However, there are two other types of stem cells in an early embryo: stem cells that become the placenta and those that become the yolk sac (where the embryo grows and gets its nutrients in early development). For a human embryo (and its organs) to develop successfully, there needs to be a “dialogue” between these three types of stem cells. In other words, Żernicka-Goetz suspected the best way to grow a functional organoid was to produce a synthetic embryoid.
As described in the aforementioned Nature paper, Żernicka-Goetz and her team mimicked the embryonic environment by mixing these three types of stem cells from mice. Amazingly, the stem cells self-organized into structures and progressed through the successive developmental stages until they had beating hearts and the foundations of the brain.
“Our mouse embryo model not only develops a brain, but also a beating heart [and] all the components that go on to make up the body,” said Żernicka-Goetz. “It’s just unbelievable that we’ve got this far. This has been the dream of our community for years and major focus of our work for a decade and finally we’ve done it.”
If the methods developed by Żernicka-Goetz’s team are successful with human stem cells, scientists someday could use them to guide the development of synthetic organs for patients awaiting transplants. It also opens the door to studying how embryos develop during pregnancy.
Scientists find enzymes in nature that could replace toxic chemicals
Some 900 miles off the coast of Portugal, nine major islands rise from the mid-Atlantic. Verdant and volcanic, the Azores archipelago hosts a wealth of biodiversity that keeps field research scientist, Marlon Clark, returning for more. “You’ve got this really interesting biogeography out there,” says Clark. “There’s real separation between the continents, but there’s this inter-island dispersal of plants and seeds and animals.”
It’s a visual paradise by any standard, but on a microscopic level, there’s even more to see. The Azores’ nutrient-rich volcanic rock — and its network of lagoons, cave systems, and thermal springs — is home to a vast array of microorganisms found in a variety of microclimates with different elevations and temperatures.
Clark works for Basecamp Research, a biotech company headquartered in London, and his job is to collect samples from ecosystems around the world. By extracting DNA from soil, water, plants, microbes and other organisms, Basecamp is building an extensive database of the Earth’s proteins. While DNA itself isn’t a protein, the information stored in DNA is used to create proteins, so extracting, sequencing, and annotating DNA allows for the discovery of unique protein sequences.
Using what they’re finding in the middle of the Atlantic and beyond, Basecamp’s detailed database is constantly growing. The outputs could be essential for cleaning up the damage done by toxic chemicals and finding alternatives to these chemicals.
Catalysts for change
Proteins provide structure and function in all living organisms. Some of these functional proteins are enzymes, which quite literally make things happen.
“Industrial chemistry is heavily polluting, especially the chemistry done in pharmaceutical drug development. Biocatalysis is providing advantages, both to make more complex drugs and to be more sustainable, reducing the pollution and toxicity of conventional chemistry," says Ahir Pushpanath, who heads partnerships for Basecamp.
“Enzymes are perfectly evolved catalysts,” says Ahir Pushpanath, a partnerships lead at Basecamp. ”Enzymes are essentially just a polymer, and polymers are made up of amino acids, which are nature’s building blocks.” He suggests thinking about it like Legos — if you have a bunch of Lego pieces and use them to build a structure that performs a function, “that’s basically how an enzyme works. In nature, these monuments have evolved to do life’s chemistry. If we didn’t have enzymes, we wouldn’t be alive.”
In our own bodies, enzymes catalyze everything from vision to digesting food to regrowing muscles, and these same types of enzymes are necessary in the pharmaceutical, agrochemical and fine chemical industries. But industrial conditions differ from those inside our bodies. So, when scientists need certain chemical reactions to create a particular product or substance, they make their own catalysts in their labs — generally through the use of petroleum and heavy metals.
These petrochemicals are effective and cost-efficient, but they’re wasteful and often hazardous. With growing concerns around sustainability and long-term public health, it's essential to find alternative solutions to toxic chemicals. “Industrial chemistry is heavily polluting, especially the chemistry done in pharmaceutical drug development,” Pushpanath says.
Basecamp is trying to replace lab-created catalysts with enzymes found in the wild. This concept is called biocatalysis, and in theory, all scientists have to do is find the right enzymes for their specific need. Yet, historically, researchers have struggled to find enzymes to replace petrochemicals. When they can’t identify a suitable match, they turn to what Pushpanath describes as “long, iterative, resource-intensive, directed evolution” in the laboratory to coax a protein into industrial adaptation. But the latest scientific advances have enabled these discoveries in nature instead.
Marlon Clark, a research scientist at Basecamp Research, looks for novel biochemistries in the Azores.
Glen Gowers
Enzyme hunters
Whether it’s Clark and a colleague setting off on an expedition, or a local, on-the-ground partner gathering and processing samples, there’s a lot to be learned from each collection. “Microbial genomes contain complete sets of information that define an organism — much like how letters are a code allowing us to form words, sentences, pages, and books that contain complex but digestible knowledge,” Clark says. He thinks of the environmental samples as biological libraries, filled with thousands of species, strains, and sequence variants. “It’s our job to glean genetic information from these samples.”
“We can actually dream up new proteins using generative AI," Pushpanath says.
Basecamp researchers manage this feat by sequencing the DNA and then assembling the information into a comprehensible structure. “We’re building the ‘stories’ of the biota,” Clark says. The more varied the samples, the more valuable insights his team gains into the characteristics of different organisms and their interactions with the environment. Sequencing allows scientists to examine the order of nucleotides — the organic molecules that form DNA — to identify genetic makeups and find changes within genomes. The process used to be too expensive, but the cost of sequencing has dropped from $10,000 a decade ago to as low as $100. Notably, biocatalysis isn’t a new concept — there have been waves of interest in using natural enzymes in catalysis for over a century, Pushpanath says. “But the technology just wasn’t there to make it cost effective,” he explains. “Sequencing has been the biggest boon.”
AI is probably the second biggest boon.
“We can actually dream up new proteins using generative AI,” Pushpanath says, which means that biocataylsis now has real potential to scale.
Glen Gowers, the co-founder of Basecamp, compares the company’s AI approach to that of social networks and streaming services. Consider how these platforms suggest connecting with the friends of your friends, or how watching one comedy film from the 1990s leads to a suggestion of three more.
“They’re thinking about data as networks of relationships as opposed to lists of items,” says Gowers. “By doing the same, we’re able to link the metadata of the proteins — by their relationships to each other, the environments in which they’re found, the way those proteins might look similar in sequence and structure, their surrounding genome context — really, this just comes down to creating a searchable network of proteins.”
On an Azores island, this volcanic opening may harbor organisms that can help scientists identify enzymes for biocatalysis to replace toxic chemicals.
Emma Bolton
Uwe Bornscheuer, professor at the Institute of Biochemistry at the University of Greifswald, and co-founder of Enzymicals, another biocatalysis company, says that the development of machine learning is a critical component of this work. “It’s a very hot topic, because the challenge in protein engineering is to predict which mutation at which position in the protein will make an enzyme suitable for certain applications,” Bornscheuer explains. These predictions are difficult for humans to make at all, let alone quickly. “It is clear that machine learning is a key technology.”
Benefiting from nature’s bounty
Biodiversity commonly refers to plants and animals, but the term extends to all life, including microbial life, and some regions of the world are more biodiverse than others. Building relationships with global partners is another key element to Basecamp’s success. Doing so in accordance with the access and benefit sharing principles set forth by the Nagoya Protocol — an international agreement that seeks to ensure the benefits of using genetic resources are distributed in a fair and equitable way — is part of the company's ethos. “There's a lot of potential for us, and there’s a lot of potential for our partners to have exactly the same impact in building and discovering commercially relevant proteins and biochemistries from nature,” Clark says.
Bornscheuer points out that Basecamp is not the first company of its kind. A former San Diego company called Diversa went public in 2000 with similar work. “At that time, the Nagoya Protocol was not around, but Diversa also wanted to ensure that if a certain enzyme or microorganism from Costa Rica, for example, were used in an industrial process, then people in Costa Rica would somehow profit from this.”
An eventual merger turned Diversa into Verenium Corporation, which is now a part of the chemical producer BASF, but it laid important groundwork for modern companies like Basecamp to continue to scale with today’s technologies.
“To collect natural diversity is the key to identifying new catalysts for use in new applications,” Bornscheuer says. “Natural diversity is immense, and over the past 20 years we have gained the advantages that sequencing is no longer a cost or time factor.”
This has allowed Basecamp to rapidly grow its database, outperforming Universal Protein Resource or UniProt, which is the public repository of protein sequences most commonly used by researchers. Basecamp’s database is three times larger, totaling about 900 million sequences. (UniProt isn’t compliant with the Nagoya Protocol, because, as a public database, it doesn’t provide traceability of protein sequences. Some scientists, however, argue that Nagoya compliance hinders progress.)
“Eventually, this work will reduce chemical processes. We’ll have cleaner processes, more sustainable processes," says Uwe Bornscheuer, a professor at the University of Greifswald.
With so much information available, Basecamp’s AI has been trained on “the true dictionary of protein sequence life,” Pushpanath says, which makes it possible to design sequences for particular applications. “Through deep learning approaches, we’re able to find protein sequences directly from our database, without the need for further laboratory-directed evolution.”
Recently, a major chemical company was searching for a specific transaminase — an enzyme that catalyzes a transfer of amino groups. “They had already spent a year-and-a-half and nearly two million dollars to evolve a public-database enzyme, and still had not reached their goal,” Pushpanath says. “We used our AI approaches on our novel database to yield 10 candidates within a week, which, when validated by the client, achieved the desired target even better than their best-evolved candidate.”
Basecamp’s other huge potential is in bioremediation, where natural enzymes can help to undo the damage caused by toxic chemicals. “Biocatalysis impacts both sides,” says Gowers. “It reduces the usage of chemicals to make products, and at the same time, where contamination sites do exist from chemical spills, enzymes are also there to kind of mop those up.”
So far, Basecamp's round-the-world sampling has covered 50 percent of the 14 major biomes, or regions of the planet that can be distinguished by their flora, fauna, and climate, as defined by the World Wildlife Fund. The other half remains to be catalogued — a key milestone for understanding our planet’s protein diversity, Pushpanath notes.
There’s still a long road ahead to fully replace petrochemicals with natural enzymes, but biocatalysis is on an upward trajectory. "Eventually, this work will reduce chemical processes,” Bornscheuer says. “We’ll have cleaner processes, more sustainable processes.”