Scientists Are Building an “AccuWeather” for Germs to Predict Your Risk of Getting the Flu
Applied mathematician Sara del Valle works at the U.S.'s foremost nuclear weapons lab: Los Alamos. Once colloquially called Atomic City, it's a hidden place 45 minutes into the mountains northwest of Santa Fe. Here, engineers developed the first atomic bomb.
Like AccuWeather, an app for disease prediction could help people alter their behavior to live better lives.
Today, Los Alamos still a small science town, though no longer a secret, nor in the business of building new bombs. Instead, it's tasked with, among other things, keeping the stockpile of nuclear weapons safe and stable: not exploding when they're not supposed to (yes, please) and exploding if someone presses that red button (please, no).
Del Valle, though, doesn't work on any of that. Los Alamos is also interested in other kinds of booms—like the explosion of a contagious disease that could take down a city. Predicting (and, ideally, preventing) such epidemics is del Valle's passion. She hopes to develop an app that's like AccuWeather for germs: It would tell you your chance of getting the flu, or dengue or Zika, in your city on a given day. And like AccuWeather, it could help people alter their behavior to live better lives, whether that means staying home on a snowy morning or washing their hands on a sickness-heavy commute.
Sara del Valle of Los Alamos is working to predict and prevent epidemics using data and machine learning.
Since the beginning of del Valle's career, she's been driven by one thing: using data and predictions to help people behave practically around pathogens. As a kid, she'd always been good at math, but when she found out she could use it to capture the tentacular spread of disease, and not just manipulate abstractions, she was hooked.
When she made her way to Los Alamos, she started looking at what people were doing during outbreaks. Using social media like Twitter, Google search data, and Wikipedia, the team started to sift for trends. Were people talking about hygiene, like hand-washing? Or about being sick? Were they Googling information about mosquitoes? Searching Wikipedia for symptoms? And how did those things correlate with the spread of disease?
It was a new, faster way to think about how pathogens propagate in the real world. Usually, there's a 10- to 14-day lag in the U.S. between when doctors tap numbers into spreadsheets and when that information becomes public. By then, the world has moved on, and so has the disease—to other villages, other victims.
"We found there was a correlation between actual flu incidents in a community and the number of searches online and the number of tweets online," says del Valle. That was when she first let herself dream about a real-time forecast, not a 10-days-later backcast. Del Valle's group—computer scientists, mathematicians, statisticians, economists, public health professionals, epidemiologists, satellite analysis experts—has continued to work on the problem ever since their first Twitter parsing, in 2011.
They've had their share of outbreaks to track. Looking back at the 2009 swine flu pandemic, they saw people buying face masks and paying attention to the cleanliness of their hands. "People were talking about whether or not they needed to cancel their vacation," she says, and also whether pork products—which have nothing to do with swine flu—were safe to buy.
At the latest meeting with all the prediction groups, del Valle's flu models took first and second place.
They watched internet conversations during the measles outbreak in California. "There's a lot of online discussion about anti-vax sentiment, and people trying to convince people to vaccinate children and vice versa," she says.
Today, they work on predicting the spread of Zika, Chikungunya, and dengue fever, as well as the plain old flu. And according to the CDC, that latter effort is going well.
Since 2015, the CDC has run the Epidemic Prediction Initiative, a competition in which teams like de Valle's submit weekly predictions of how raging the flu will be in particular locations, along with other ailments occasionally. Michael Johannson is co-founder and leader of the program, which began with the Dengue Forecasting Project. Its goal, he says, was to predict when dengue cases would blow up, when previously an area just had a low-level baseline of sick people. "You'll get this massive epidemic where all of a sudden, instead of 3,000 to 4,000 cases, you have 20,000 cases," he says. "They kind of come out of nowhere."
But the "kind of" is key: The outbreaks surely come out of somewhere and, if scientists applied research and data the right way, they could forecast the upswing and perhaps dodge a bomb before it hit big-time. Questions about how big, when, and where are also key to the flu.
A big part of these projects is the CDC giving the right researchers access to the right information, and the structure to both forecast useful public-health outcomes and to compare how well the models are doing. The extra information has been great for the Los Alamos effort. "We don't have to call departments and beg for data," says del Valle.
When data isn't available, "proxies"—things like symptom searches, tweets about empty offices, satellite images showing a green, wet, mosquito-friendly landscape—are helpful: You don't have to rely on anyone's health department.
At the latest meeting with all the prediction groups, del Valle's flu models took first and second place. But del Valle wants more than weekly numbers on a government website; she wants that weather-app-inspired fortune-teller, incorporating the many diseases you could get today, standing right where you are. "That's our dream," she says.
This plot shows the the correlations between the online data stream, from Wikipedia, and various infectious diseases in different countries. The results of del Valle's predictive models are shown in brown, while the actual number of cases or illness rates are shown in blue.
(Courtesy del Valle)
The goal isn't to turn you into a germophobic agoraphobe. It's to make you more aware when you do go out. "If you know it's going to rain today, you're more likely to bring an umbrella," del Valle says. "When you go on vacation, you always look at the weather and make sure you bring the appropriate clothing. If you do the same thing for diseases, you think, 'There's Zika spreading in Sao Paulo, so maybe I should bring even more mosquito repellent and bring more long sleeves and pants.'"
They're not there yet (don't hold your breath, but do stop touching your mouth). She estimates it's at least a decade away, but advances in machine learning could accelerate that hypothetical timeline. "We're doing baby steps," says del Valle, starting with the flu in the U.S., dengue in Brazil, and other efforts in Colombia, Ecuador, and Canada. "Going from there to forecasting all diseases around the globe is a long way," she says.
But even AccuWeather started small: One man began predicting weather for a utility company, then helping ski resorts optimize their snowmaking. His influence snowballed, and now private forecasting apps, including AccuWeather's, populate phones across the planet. The company's progression hasn't been without controversy—privacy incursions, inaccuracy of long-term forecasts, fights with the government—but it has continued, for better and for worse.
Disease apps, perhaps spun out of a small, unlikely team at a nuclear-weapons lab, could grow and breed in a similar way. And both the controversies and public-health benefits that may someday spin out of them lie in the future, impossible to predict with certainty.
Podcast: The Friday Five Weekly Roundup in Health Research
The Friday Five covers five stories in research that you may have missed this week. There are plenty of controversies and troubling ethical issues in science – and we get into many of them in our online magazine – but this news roundup focuses on scientific creativity and progress to give you a therapeutic dose of inspiration headed into the weekend.
Here are the promising studies covered in this week's Friday Five:
- A new mask can detect Covid and send an alert to your phone
- More promising research for a breakthrough drug to treat schizophrenia
- AI tool can create new proteins
- Connections between an unhealthy gut and breast cancer
- Progress on the longevity drug, rapamycin
And an honorable mention this week: Certain exercises may benefit some types of memory more than others
Life is Emerging: Review of Siddhartha Mukherjee’s Song of the Cell
The DNA double helix is often the image spiraling at the center of 21st century advances in biomedicine and the growing bioeconomy. And yet, DNA is molecularly inert. DNA, the code for genes, is not alive and is not strictly necessary for life. Ought life be at the center of our communication of living systems? Is not the Cell a superior symbol of life and our manipulation of living systems?
A code for life isn’t a code without the life that instantiates it. A code for life must be translated. The cell is the basic unit of that translation. The cell is the minimal viable package of life as we know it. Therefore, cell biology is at the center of biomedicine’s greatest transformations, suggests Pulitzer-winning physician-scientist Siddhartha Mukherjee in his latest book, The Song of the Cell: The Exploration of Medicine and the New Human.
The Song of the Cell begins with the discovery of cells and of germ theory, featuring characters such as Louis Pasteur and Robert Koch, who brought the cell “into intimate contact with pathology and medicine.” This intercourse would transform biomedicine, leading to the insight that we can treat disease by thinking at the cellular level. The slightest rearrangement of sick cells might be the path toward alleviating suffering for the organism: eroding the cell walls of a bacterium while sparing our human cells; inventing a medium that coaxes sperm and egg to dance into cellular union for in vitro fertilization (IVF); designing molecular missiles that home to the receptors decorating the exterior of cancer cells; teaching adult skin cells to remember their embryonic state for regenerative medicines.
Mukherjee uses the bulk of the book to elucidate key cell types in the human body, along with their “connective relationships” that enable key organs and organ systems to function. This includes the immune system, the heart, the brain, and so on. Mukherjee’s distinctive style features compelling anecdotes and human stories that animate the scientific (and unscientific) processes that have led to our current state of understanding. In his chapter on neurons and the brain, for example, he integrates Santiago Ramon y Cajal’s meticulous black ink sketches of neurons into Mukherjee’s own personal encounter with clinical depression. In one lucid section, he interviews Dr. Helen Mayberg, a pioneering neurologist who takes seriously the descriptive power of her patients’ metaphors, as they suffer from “caves,” “holes,” “voids,” and “force fields” that render their lives gray. Dr. Mayberg aims to stimulate patients’ neuronal cells in a manner that brings back the color.
Beyond exposing the insight and inventiveness that has arisen out of cell-based thinking, it seems that Mukherjee’s bigger project is an epistemological one. The early chapters of The Song of the Cell continually hint at the potential for redefining the basic unit of biology as the cell rather than the gene. The choice to center biomedicine around cells is, above all, a conspicuous choice not to center it around genes (the subject of Mukherjee’s previous book, The Gene), because genes dominate popular science communication.
This choice of cells over genes is most welcome. Cells are alive. Genes are not. Letters—such as the As, Cs, Gs, and Ts that represent the nucleotides of DNA, which make up our genes—must be synthesized into a word or poem or song that offers a glimpse into deeper truths. A key idea embedded in this thinking is that of emergence. Whether in ancient myth or modern art, creation tends to be an emergent process, not a linearly coded script. The cell is our current best guess for the basic unit of life’s emergence, turning a finite set of chemical building blocks—nucleic acids, proteins, sugars, fats—into a replicative, evolving system for fighting stasis and entropy. The cell’s song is one for our times, for it is the song of biology’s emergence out of chemistry and physics, into the “frenetically active process” of homeostasis.
Re-centering our view of biology has practical consequences, too, for how we think about diagnosing and treating disease, and for inventing new medicines. Centering cells presents a challenge: which type of cell to place at the center? Rather than default to the apparent simplicity of DNA as a symbol because it represents the one master code for life, the tension in defining the diversity of cells—a mapping process still far from complete in cutting-edge biology laboratories—can help to create a more thoughtful library of cellular metaphors to shape both the practice and communication of biology.
Further, effective problem solving is often about operating at the right level, or the right scale. The cell feels like appropriate level at which to interrogate many of the diseases that ail us, because the senses that guide our own perceptions of sickness and health—the smoldering pain of inflammation, the tunnel vision of a migraine, the dizziness of a fluttering heart—are emergent.
This, unfortunately, is sort of where Mukherjee leaves the reader, under-exploring the consequences of a biology of emergence. Many practical and profound questions have to do with the ways that each scale of life feeds back on the others. In a tome on Cells and “the future human” I wished that Mukherjee had created more space for seeking the ways that cells will shape and be shaped by the future, of humanity and otherwise.
We are entering a phase of real-world bioengineering that features the modularization of cellular parts within cells, of cells within organs, of organs within bodies, and of bodies within ecosystems. In this reality, we would be unwise to assume that any whole is the mere sum of its parts.
For example, when discussing the regenerative power of pluripotent stem cells, Mukherjee raises the philosophical thought experiment of the Delphic boat, also known as the Ship of Theseus. The boat is made of many pieces of wood, each of which is replaced for repairs over the years, with the boat’s structure unchanged. Eventually none of the boat’s original wood remains: Is it the same boat?
Mukherjee raises the Delphic boat in one paragraph at the end of the chapter on stem cells, as a metaphor related to the possibility of stem cell-enabled regeneration in perpetuity. He does not follow any of the threads of potential answers. Given the current state of cellular engineering, about which Mukherjee is a world expert from his work as a physician-scientist, this book could have used an entire section dedicated to probing this question and, importantly, the ways this thought experiment falls apart.
We are entering a phase of real-world bioengineering that features the modularization of cellular parts within cells, of cells within organs, of organs within bodies, and of bodies within ecosystems. In this reality, we would be unwise to assume that any whole is the mere sum of its parts. Wholeness at any one of these scales of life—organelle, cell, organ, body, ecosystem—is what is at stake if we allow biological reductionism to assume away the relation between those scales.
In other words, Mukherjee succeeds in providing a masterful and compelling narrative of the lives of many of the cells that emerge to enliven us. Like his previous books, it is a worthwhile read for anyone curious about the role of cells in disease and in health. And yet, he fails to offer the broader context of The Song of the Cell.
As leading agronomist and essayist Wes Jackson has written, “The sequence of amino acids that is at home in the human cell, when produced inside the bacterial cell, does not fold quite right. Something about the E. coli internal environment affects the tertiary structure of the protein and makes it inactive. The whole in this case, the E. coli cell, affects the part—the newly made protein. Where is the priority of part now?” [1]
Beyond the ways that different kingdoms of life translate the same genetic code, the practical situation for humanity today relates to the ways that the different disciplines of modern life use values and culture to influence our genes, cells, bodies, and environment. It may be that humans will soon become a bit like the Delphic boat, infused with the buzz of fresh cells to repopulate different niches within our bodies, for healthier, longer lives. But in biology, as in writing, a mixed metaphor can cause something of a cacophony. For we are not boats with parts to be replaced piecemeal. And nor are whales, nor alpine forests, nor topsoil. Life isn’t a sum of parts, and neither is a song that rings true.
[1] Wes Jackson, "Visions and Assumptions," in Nature as Measure (p. 52-53).