COVID Variants Are Like “a Thief Changing Clothes” – and Our Camera System Barely Exists
Whether it's "natural selection" as Darwin called it, or it's "mutating" as the X-Men called it, living organisms change over time, developing thumbs or more efficient protein spikes, depending on the organism and the demands of its environment. The coronavirus that causes COVID-19, SARS-CoV-2, is not an exception, and now, after the virus has infected millions of people around the globe for more than a year, scientists are beginning to see those changes.
The notorious variants that have popped up include B.1.1.7, sometimes called the UK variant, as well as P.1 and B.1.351, which seem to have emerged in Brazil and South Africa respectively. As vaccinations are picking up pace, officials are warning that now
is not the time to become complacent or relax restrictions because the variants aren't well understood.
Some appear to be more transmissible, and deadlier, while others can evade the immune system's defenses better than earlier versions of the virus, potentially undermining the effectiveness of vaccines to some degree. Genomic surveillance, the process of sequencing the genetic code of the virus widely to observe changes and patterns, is a critical way that scientists can keep track of its evolution and work to understand how the variants might affect humans.
"It's like a thief changing clothes"
It's important to note that viruses mutate all the time. If there were funding and personnel to sequence the genome of every sample of the virus, scientists would see thousands of mutations. Not every variant deserves our attention. The vast majority of mutations are not important at all, but recognizing those that are is a crucial tool in getting and staying ahead of the virus. The work of sequencing, analyzing, observing patterns, and using public health tools as necessary is complicated and confusing to those without years of specialized training.
Jeremy Kamil, associate professor of microbiology and immunology at LSU Health Shreveport, in Louisiana, says that the variants developing are like a thief changing clothes. The thief goes in your house, steals your stuff, then leaves and puts on a different shirt and a wig, in the hopes you won't recognize them. Genomic surveillance catches the "thief" even in those different clothes.
One of the tricky things about variants is recognizing the point at which they move from interesting, to concerning at a local level, to dangerous in a larger context.
Understanding variants, both the uninteresting ones and the potentially concerning ones, gives public health officials and researchers at different levels a useful set of tools. Locally, knowing which variants are circulating in the community helps leaders know whether mask mandates and similar measures should be implemented or discontinued, or whether businesses and schools can open relatively safely.
There's more to it than observing new variants
Analysis is complex, particularly when it comes to understanding which variants are of concern. "So the question is always if a mutation becomes common, is that a random occurrence?" says Phoebe Lostroh, associate professor of molecular biology at Colorado College. "Or is the variant the result of some kind of selection because the mutation changes some property about the virus that makes it reproduce more quickly than variants of the virus that don't have that mutation? For a virus, [mutations can affect outcomes like] how much it replicates inside a person's body, how much somebody breathes it out, whether the particles that somebody might breathe in get smaller and can lead to greater transmission."
Along with all of those factors, accurate and useful genomic surveillance requires an understanding of where variants are occurring, how they are related, and an examination of why they might be prevalent.
For example, if a potentially worrisome variant appears in a community and begins to spread very quickly, it's not time to raise a public health alarm until several important questions have been answered, such as whether the variant is spreading due to specific events, or if it's happening because the mutation has allowed the virus to infect people more efficiently. Kamil offered a hypothetical scenario to explain: Imagine that a member of a community became infected and the virus mutated. That person went to church and three more people were infected, but one of them went to a karaoke bar and while singing infected 100 other people. Examining the conditions under which the virus has spread is, therefore, an essential part of untangling whether a mutation itself made the virus more transmissible or if an infected person's behaviors contributed to a local outbreak.
One of the tricky things about variants is recognizing the point at which they move from interesting, to concerning at a local level, to dangerous in a larger context. Genomic sequencing can help with that, but only when it's coordinated. When the same mutation occurs frequently, but is localized to one region, it's a concern, but when the same mutation happens in different places at the same time, it's much more likely that the "virus is learning that's a good mutation," explains Kamil.
The process is called convergent evolution, and it was a fascinating topic long before COVID. Just as your heritage can be traced through DNA, so can that of viruses, and when separate lineages develop similar traits it's almost like scientists can see evolution happening in real time. A mutation to SARS-CoV-2 that happens in more than one place at once is a mutation that makes it easier in some way for the virus to survive and that is when it may become alarming. The widespread, documented variants P.1 and B.1.351 are examples of convergence because they share some of the same virulent mutations despite having developed thousands of miles apart.
However, even variants that are emerging in different places at the same time don't present the kind of threat SARS-CoV-2 did in 2019. "This is nature," says Kamil. "It just means that this virus will not easily be driven to extinction or complete elimination by vaccines." Although a person who has already had COVID-19 can be reinfected with a variant, "it is almost always much milder disease" than the original infection, Kamil adds. Rather than causing full-fledged disease, variants have the potiental to "penetrate herd immunity, spreading relatively quietly among people who have developed natural immunity or been vaccinated, until the virus finds someone who has no immunity yet, and that person would be at risk of hospitalization-grade severe disease or death."
Surveillance and predictions
According to Lostroh, genomic surveillance can help scientists predict what's going to happen. "With the British strain, for instance, that's more transmissible, you can measure how fast it's doubling in the population and you can sort of tell whether we should take more measures against this mutation. Should we shut things down a little longer because that mutation is present in the population? That could be really useful if you did enough sampling in the population that you knew where it was," says Lostroh. If, for example, the more transmissible strain was present in 50 percent of cases, but in another county or state it was barely present, it would allow for rolling lockdowns instead of sweeping measures.
Variants are also extremely important when it comes to the development, manufacture, and distribution of vaccines. "You're also looking at medical countermeasures, such as whether your vaccine is still effective, or if your antiviral needs to be updated," says Lane Warmbrod, a senior analyst and research associate at Johns Hopkins Center for Health Security.
Properly funded and extensive genomic surveillance could eventually help control endemic diseases, too, like the seasonal flu, or other common respiratory infections. Kamil says he envisions a future in which genomic surveillance allows for prediction of sickness just as the weather is predicted today. "It's a 51 for infection today at the San Francisco Airport. There's been detection of some respiratory viruses," he says, offering an example. He says that if you're a vulnerable person, if you're immune-suppressed for some reason, you may want to wear a mask based on the sickness report.
The U.S. has the ability, but lacks standards
The benefits of widespread genomic surveillance are clear, and the United States certainly has the necessary technology, equipment, and personnel to carry it out. But, it's not happening at the speed and extent it needs to for the country to gain the benefits.
"The numbers are improving," said Kamil. "We're probably still at less than half a percent of all the samples that have been taken have been sequenced since the beginning of the pandemic."
Although there's no consensus on how many sequences is ideal for a robust surveillance program, modeling performed by the company Illumina suggests about 5 percent of positive tests should be sequenced. The reasons the U.S. has lagged in implementing a sequencing program are complex and varied, but solvable.
Perhaps the most important element that is currently missing is leadership. In order to conduct an effective genomic surveillance program, there need to be standards. The Johns Hopkins Center for Health Security recently published a paper with recommendations as to what kinds of elements need to be standardized in order to make the best use of sequencing technology and analysis.
"Along with which bioinformatic pipelines you're going to use to do the analyses, which sequencing strategy protocol are you going to use, what's your sampling strategy going to be, how is the data is going to be reported, what data gets reported," says Warmbrod. Currently, there's no guidance from the CDC on any of those things. So, while scientists can collect and report information, they may be collecting and reporting different information that isn't comparable, making it less useful for public health measures and vaccine updates.
Globally, one of the most important tools in making the information from genomic surveillance useful is GISAID, a platform designed for scientists to share -- and, importantly, to be credited for -- their data regarding genetic sequences of influenza. Originally, it was launched as a database of bird flu sequences, but has evolved to become an essential tool used by the WHO to make flu vaccine virus recommendations each year. Scientists who share their credentials have free access to the database, and anyone who uses information from the database must credit the scientist who uploaded that information.
Safety, logistics, and funding matter
Scientists at university labs and other small organizations have been uploading sequences to GISAID almost from the beginning of the pandemic, but their funding is generally limited, and there are no standards regarding information collection or reporting. Private, for-profit labs haven't had motivation to set up sequencing programs, although many of them have the logistical capabilities and funding to do so. Public health departments are understaffed, underfunded, and overwhelmed.
University labs may also be limited by safety concerns. The SARS-CoV-2 virus is dangerous, and there's a question of how samples should be transported to labs for sequencing.
Larger, for-profit organizations often have the tools and distribution capabilities to safely collect and sequence samples, but there hasn't been a profit motive. Genomic sequencing is less expensive now than ever before, but even at $100 per sample, the cost adds up -- not to mention the cost of employing a scientist with the proper credentials to analyze the sequence.
The path forward
The recently passed COVID-19 relief bill does have some funding to address genomic sequencing. Specifically, the American Rescue Plan Act includes $1.75 billion in funding for the Centers for Disease Control and Prevention's Advanced Molecular Detection (AMD) program. In an interview last month, CDC Director Rochelle Walensky said that the additional funding will be "a dial. And we're going to need to dial it up." AMD has already announced a collaboration called the Sequencing for Public Health Emergency Response, Epidemiology, and Surveillance (SPHERES) Initiative that will bring together scientists from public health, academic, clinical, and non-profit laboratories across the country with the goal of accelerating sequencing.
Such a collaboration is a step toward following the recommendations in the paper Warmbrod coauthored. Building capacity now, creating a network of labs, and standardizing procedures will mean improved health in the future. "I want to be optimistic," she says. "The good news is there are a lot of passionate, smart, capable people who are continuing to work with government and work with different stakeholders." She cautions, however, that without a national strategy we won't succeed.
"If we maximize the potential and create that framework now, we can also use it for endemic diseases," she says. "It's a very helpful system for more than COVID if we're smart in how we plan it."
[Editor's Note: This is the fifth episode in our Moonshot series, which explores cutting-edge scientific developments that stand to fundamentally transform our world.]
Kira Peikoff was the editor-in-chief of Leaps.org from 2017 to 2021. As a journalist, her work has appeared in The New York Times, Newsweek, Nautilus, Popular Mechanics, The New York Academy of Sciences, and other outlets. She is also the author of four suspense novels that explore controversial issues arising from scientific innovation: Living Proof, No Time to Die, Die Again Tomorrow, and Mother Knows Best. Peikoff holds a B.A. in Journalism from New York University and an M.S. in Bioethics from Columbia University. She lives in New Jersey with her husband and two young sons. Follow her on Twitter @KiraPeikoff.
With the pandemic at the forefront of everyone's minds, many people have wondered if food could be a source of coronavirus transmission. Luckily, that "seems unlikely," according to the CDC, but foodborne illnesses do still sicken a whopping 48 million people per year.
Whole genome sequencing is like "going from an eight-bit image—maybe like what you would see in Minecraft—to a high definition image."
In normal times, when there isn't a historic global health crisis infecting millions and affecting the lives of billions, foodborne outbreaks are real and frightening, potentially deadly, and can cause widespread fear of particular foods. Think of Romaine lettuce spreading E. coli last year— an outbreak that infected more than 500 people and killed eight—or peanut butter spreading salmonella in 2008, which infected 167 people.
The technologies available to detect and prevent the next foodborne disease outbreak have improved greatly over the past 30-plus years, particularly during the past decade, and better, more nimble technologies are being developed, according to experts in government, academia, and private industry. The key to advancing detection of harmful foodborne pathogens, they say, is increasing speed and portability of detection, and the precision of that detection.
Getting to Rapid Results
Researchers at Purdue University have recently developed a lateral flow assay that, with the help of a laser, can detect toxins and pathogenic E. coli. Lateral flow assays are cheap and easy to use; a good example is a home pregnancy test. You place a liquid or liquefied sample on a piece of paper designed to detect a single substance and soon after you get the results in the form of a colored line: yes or no.
"They're a great portable tool for us for food contaminant detection," says Carmen Gondhalekar, a fifth-year biomedical engineering graduate student at Purdue. "But one of the areas where paper-based lateral flow assays could use improvement is in multiplexing capability and their sensitivity."
J. Paul Robinson, a professor in Purdue's Colleges of Veterinary Medicine and Engineering, and Gondhalekar's advisor, agrees. "One of the fundamental problems that we have in detection is that it is hard to identify pathogens in complex samples," he says.
When it comes to foodborne disease outbreaks, you don't always know what substance you're looking for, so an assay made to detect only a single substance isn't always effective. The goal of the project at Purdue is to make assays that can detect multiple substances at once.
These assays would be more complex than a pregnancy test. As detailed in Gondhalekar's recent paper, a laser pulse helps create a spectral signal from the sample on the assay paper, and the spectral signal is then used to determine if any unique wavelengths associated with one of several toxins or pathogens are present in the sample. Though the handheld technology has yet to be built, the idea is that the results would be given on the spot. So someone in the field trying to track the source of a Salmonella infection could, for instance, put a suspected lettuce sample on the assay and see if it has the pathogen on it.
"What our technology is designed to do is to give you a rapid assessment of the sample," says Robinson. "The goal here is speed."
Seeing the Pathogen in "High-Def"
"One in six Americans will get a foodborne illness every year," according to Dr. Heather Carleton, a microbiologist at the Centers for Disease Control and Prevention's Enteric Diseases Laboratory Branch. But not every foodborne outbreak makes the news. In 2017 alone, the CDC monitored between 18 and 37 foodborne poison clusters per week and investigated 200 multi-state clusters. Hardboiled eggs, ground beef, chopped salad kits, raw oysters, frozen tuna, and pre-cut melon are just a taste of the foods that were investigated last year for different strains of listeria, salmonella, and E. coli.
At the heart of the CDC investigations is PulseNet, a national network of laboratories that uses DNA fingerprinting to detect outbreaks at local and regional levels. This is how it works: When a patient gets sick—with symptoms like vomiting and fever, for instance—they will go to a hospital or clinic for treatment. Since we're talking about foodborne illnesses, a clinician will likely take a stool sample from the patient and send it off to a laboratory to see if there is a foodborne pathogen, like salmonella, E. Coli, or another one. If it does contain a potentially harmful pathogen, then a bacterial isolate of that identified sample is sent to a regional public health lab so that whole genome sequencing can be performed.
Whole genome sequencing can differentiate "virtually all" strains of foodborne pathogens, no matter the species, according to the FDA.
Whole genome sequencing is a method for reading the entire genome of a bacterial isolate (or from any organism, for that matter). Instead of working with a couple dozen data points, now you're working with millions of base pairs. Carleton likes to describe it as "going from an eight-bit image—maybe like what you would see in Minecraft—to a high definition image," she says. "It's really an evolution of how we detect foodborne illnesses and identify outbreaks."
If the bacterial isolate matches another in the CDC's database, this means there could be a potential outbreak and an investigation may be started, with the goal of tracking the pathogen to its source.
Whole genome sequencing has been a relatively recent shift in foodborne disease detection. For more than 20 years, the standard technique for analyzing pathogens in foodborne disease outbreaks was pulsed-field gel electrophoresis. This method creates a DNA fingerprint for each sample in the form of a pattern of about 15-30 "bands," with each band representing a piece of DNA. Researchers like Carleton can use this fingerprint to see if two samples are from the same bacteria. The problem is that 15-30 bands are not enough to differentiate all isolates. Some isolates whose bands look very similar may actually come from different sources and some whose bands look different may be from the same source. But if you can see the entire DNA fingerprint, then you don't have that issue. That's where whole genome sequencing comes in.
Although the PulseNet team had piloted whole genome sequencing as early as 2013, it wasn't until July of last year that the transition to using whole genome sequencing for all pathogens was complete. Though whole genome sequencing requires far more computing power to generate, analyze, and compare those millions of data points, the payoff is huge.
Stopping Outbreaks Sooner
The U.S. Food and Drug Administration (FDA) acquired their first whole genome sequencers in 2008, according to Dr. Eric Brown, the Director of the Division of Microbiology in the FDA's Office of Regulatory Science. Since then, through their GenomeTrakr program, a network of more than 60 domestic and international labs, the FDA has sequenced and publicly shared more than 400,000 isolates. "The impact of what whole genome sequencing could do to resolve a foodborne outbreak event was no less impactful than when NASA turned on the Hubble Telescope for the first time," says Brown.
Whole genome sequencing has helped identify strains of Salmonella that prior methods were unable to differentiate. In fact, whole genome sequencing can differentiate "virtually all" strains of foodborne pathogens, no matter the species, according to the FDA. This means it takes fewer clinical cases—fewer sick people—to detect and end an outbreak.
And perhaps the largest benefit of whole genome sequencing is that these detailed sequences—the millions of base pairs—can imply geographic location. The genomic information of bacterial strains can be different depending on the area of the country, helping these public health agencies eventually track the source of outbreaks—a restaurant, a farm, a food-processing center.
Coming Soon: "Lab in a Backpack"
Now that whole genome sequencing has become the go-to technology of choice for analyzing foodborne pathogens, the next step is making the process nimbler and more portable. Putting "the lab in a backpack," as Brown says.
The CDC's Carleton agrees. "Right now, the sequencer we use is a fairly big box that weighs about 60 pounds," she says. "We can't take it into the field."
A company called Oxford Nanopore Technologies is developing handheld sequencers. Their devices are meant to "enable the sequencing of anything by anyone anywhere," according to Dan Turner, the VP of Applications at Oxford Nanopore.
"The sooner that we can see linkages…the sooner the FDA gets in action to mitigate the problem and put in some kind of preventative control."
"Right now, sequencing is very much something that is done by people in white coats in laboratories that are set up for that purpose," says Turner. Oxford Nanopore would like to create a new, democratized paradigm.
The FDA is currently testing these types of portable sequencers. "We're very excited about it. We've done some pilots, to be able to do that sequencing in the field. To actually do it at a pond, at a river, at a canal. To do it on site right there," says Brown. "This, of course, is huge because it means we can have real-time sequencing capability to stay in step with an actual laboratory investigation in the field."
"The timeliness of this information is critical," says Marc Allard, a senior biomedical research officer and Brown's colleague at the FDA. "The sooner that we can see linkages…the sooner the FDA gets in action to mitigate the problem and put in some kind of preventative control."
At the moment, the world is rightly focused on COVID-19. But as the danger of one virus subsides, it's only a matter of time before another pathogen strikes. Hopefully, with new and advancing technology like whole genome sequencing, we can stop the next deadly outbreak before it really gets going.