Is It Possible to Predict Your Face, Voice, and Skin Color from Your DNA?
Renowned genetics pioneer Dr. J Craig Venter is no stranger to controversy.
Back in 2000, he famously raced the public Human Genome Project to decode all three billion letters of the human genome for the first time. A decade later, he ignited a new debate when his team created a bacterial cell with a synthesized genome.
Most recently, he's jumped back into the fray with a study in the September issue of the Proceedings of the National Academy of Sciences about the predictive potential of genomic data to identify individual traits such as voice, facial structure and skin color.
The new study raises significant questions about the privacy of genetic data.
His study applied whole-genome sequencing and statistical modeling to predict traits in 1,061 people of diverse ancestry. His approach aimed to reconstruct a person's physical characteristics based on DNA, and 74 percent of the time, his algorithm could correctly identify the individual in a random lineup of 10 people from his company's database.
While critics have been quick to cast doubt on the plausibility of his claims, the ability to discern people's observable traits, or phenotypes, from their genomes may grow more precise as technology improves, raising significant questions about the privacy and usage of genetic information in the long term.
J. Craig Venter showing slides from his recent study on facial prediction at the Summit Conference in Los Angeles on Nov. 3, 2017.
(Courtesy of Kira Peikoff)
Critics: Study Was Incomplete, Problematic
Before even redressing these potential legal and ethical considerations, some scientists simply said the study's main result was invalid. They pointed out that the methodology worked much better in distinguishing between people of different ethnicities than those of the same ethnicity. One of the most outspoken critics, Yaniv Erlich, a geneticist at Columbia University, said, "The method doesn't work. The results were like, 'If you have a lineup of ten people, you can predict eight."
Erlich, who reviewed Venter's paper for Science, where it was rejected, said that he came up with the same results—correctly predicting eight of ten people—by just looking at demographic factors such as age, gender and ethnicity. He added that Venter's recent rebuttal to his criticism was that 'Once we have thousands of phenotypes, it might work better.' But that, Erlich argued, would be "a major breach of privacy. Nobody has thousands of phenotypes for people."
Other critics suggested that the study's results discourage the sharing of genetic data, which is becoming increasingly important for medical research. They go one step further and imply that people's possible hesitation to share their genetic information in public databases may actually play into Venter's hands.
Venter's own company, Human Longevity Inc., aims to build the world's most comprehensive private database on human genotypes and phenotypes. The vastness of this information stands to improve the accuracy of whole genome and microbiome sequencing for individuals—analyses that come at a hefty price tag. Today, Human Longevity Inc. will sequence your genome and perform a battery of other health-related tests at an entry cost of $4900, going up to $25,000. Venter initially agreed to comment for this article, but then could not be reached.
"The bigger issue is how do we understand and use genetic information and avoid harming people."
Opens Up Pandora's Box of Ethical Issues
Whether Venter's study is valid may not be as important as the Pandora's box of potential ethical and legal issues that it raises for future consideration. "I think this story is one along a continuum of stories we've had on the issue of identifiability based on genomic information in the past decade," said Amy McGuire, a biomedical ethics professor at Baylor College of Medicine. "It does raise really interesting and important questions about privacy, and socially, how we respond to these types of scientific advancements. A lot of our focus from a policy and ethics perspective is to protect privacy."
McGuire, who is also the Director of the Center for Medical Ethics and Health Policy at Baylor, added that while protecting privacy is very important, "the bigger issue is how do we understand and use genetic information and avoid harming people." While we've taken "baby steps," she said, towards enacting laws in the U.S. that fight genetic determinism—such as the Genetic Information and Nondiscrimination Act, which prohibits discrimination based on genetic information in health insurance and employment—some areas remain unprotected, such as for life insurance and disability.
J. Craig Venter showing slides from his recent study on facial prediction at the Summit Conference in Los Angeles on Nov. 3, 2017.
(Courtesy of Kira Peikoff)
Physical reconstructions like those in Venter's study could also be inappropriately used by law enforcement, said Leslie Francis, a law and philosophy professor at the University of Utah, who has written about the ethical and legal issues related to sharing genomic data.
"If [Venter's] findings, or findings like them, hold up, the implications would be significant," Francis said. Law enforcement is increasingly using DNA identification from genetic material left at crime scenes to weed out innocent and guilty suspects, she explained. This adds another potentially complicating layer.
"There is a shift here, from using DNA sequencing techniques to match other DNA samples—as when semen obtained from a rape victim is then matched (or not) with a cheek swab from a suspect—to using DNA sequencing results to predict observable characteristics," Francis said. She added that while the former necessitates having an actual DNA sample for a match, the latter can use DNA to pre-emptively (and perhaps inaccurately) narrow down suspects.
"My worry is that if this [the study's methodology] turns out to be sort-of accurate, people will think it is better than what it is," said Francis. "If law enforcement comes to rely on it, there will be a host of false positives and false negatives. And we'll face new questions, [such as] 'Which is worse? Picking an innocent as guilty, or failing to identify someone who is guilty?'"
Risking Privacy Involves a Tradeoff
When people voluntarily risk their own privacy, that involves a tradeoff, McGuire said. A 2014 study that she conducted among people who were very sick, or whose children were very sick, found that more than half were willing to share their health information, despite concerns about privacy, because they saw a big benefit in advancing research on their conditions.
"We've focused a lot of our policy attention on restricting access, but we don't have a system of accountability when there's a breach."
"To make leaps and bounds in medicine and genomics, we need to create a database of millions of people signing on to share their genetic and health information in order to improve research and clinical care," McGuire said. "They are going to risk their privacy, and we have a social obligation to protect them."
That also means "punishing bad actors," she continued. "We've focused a lot of our policy attention on restricting access, but we don't have a system of accountability when there's a breach."
Even though most people using genetic information have good intentions, the consequences if not are troubling. "All you need is one bad actor who decimates the trust in the system, and it has catastrophic consequences," she warned. That hasn't happened on a massive scale yet, and even if it did, some experts argue that obtaining the data is not the real risk; what is more concerning is hacking individuals' genetic information to be used against them, such as to prove someone is unfit for a particular job because of a genetic condition like Alzheimer's, or that a parent is unfit for custody because of a genetic disposition to mental illness.
Venter, in fact, told an audience at the recent Summit conference in Los Angeles that his new study's approach could not only predict someone's physical appearance from their DNA, but also some of their psychological traits, such as the propensity for an addictive personality. In the future, he said, it will be possible to predict even more about mental health from the genome.
What is most at risk on a massive scale, however, is not so much genetic information as demographic identifiers included in medical records, such as birth dates and social security numbers, said Francis, the law and philosophy professor. "The much more interesting and lucrative security breaches typically involve not people interested in genetic information per se, but people interested in the information in health records that you can't change."
Hospitals have been hacked for this kind of information, including an incident at the Veterans Administration in 2006, in which the laptop and external hard drive of an agency employee that contained unencrypted information on 26.5 million patients were stolen from the employee's house.
So, what can people do to protect themselves? "Don't share anything you wouldn't want the world to see," Francis said. "And don't click 'I agree' without actually reading privacy policies or terms and conditions. They may surprise you."
Catching colds may help protect kids from Covid
A common cold virus causes the immune system to produce T cells that also provide protection against SARS-CoV-2, according to new research. The study, published last month in PNAS, shows that this effect is most pronounced in young children. The finding may help explain why most young people who have been exposed to the cold-causing coronavirus have not developed serious cases of COVID-19.
One curiosity stood out in the early days of the COVID-19 pandemic – why were so few kids getting sick. Generally young children and the elderly are the most vulnerable to disease outbreaks, particularly viral infections, either because their immune systems are not fully developed or they are starting to fail.
But solid information on the new infection was so scarce that many public health officials acted on the precautionary principle, assumed a worst-case scenario, and applied the broadest, most restrictive policies to all people to try to contain the coronavirus SARS-CoV-2.
One early thought was that lockdowns worked and kids (ages 6 months to 17 years) simply were not being exposed to the virus. So it was a shock when data started to come in showing that well over half of them carried antibodies to the virus, indicating exposure without getting sick. That trend grew over time and the latest tracking data from the CDC shows that 96.3 percent of kids in the U.S. now carry those antibodies.
Antibodies are relatively quick and easy to measure, but some scientists are exploring whether the reactions of T cells could serve as a more useful measure of immune protection.
But that couldn't be the whole story because antibody protection fades, sometimes as early as a month after exposure and usually within a year. Additionally, SARS-CoV-2 has been spewing out waves of different variants that were more resistant to antibodies generated by their predecessors. The resistance was so significant that over time the FDA withdrew its emergency use authorization for a handful of monoclonal antibodies with earlier approval to treat the infection because they no longer worked.
Antibodies got most of the attention early on because they are part of the first line response of the immune system. Antibodies can bind to viruses and neutralize them, preventing infection. They are relatively quick and easy to measure and even manufacture, but as SARS-CoV-2 showed us, often viruses can quickly evolve to become more resistant to them. Some scientists are exploring whether the reactions of T cells could serve as a more useful measure of immune protection.
Kids, colds and T cells
T cells are part of the immune system that deals with cells once they have become infected. But working with T cells is much more difficult, takes longer, and is more expensive than working with antibodies. So studies often lags behind on this part of the immune system.
A group of researchers led by Annika Karlsson at the Karolinska Institute in Sweden focuses on T cells targeting virus-infected cells and, unsurprisingly, saw that they can play a role in SARS-CoV-2 infection. Other labs have shown that vaccination and natural exposure to the virus generates different patterns of T cell responses.
The Swedes also looked at another member of the coronavirus family, OC43, which circulates widely and is one of several causes of the common cold. The molecular structure of OC43 is similar to its more deadly cousin SARS-CoV-2. Sometimes a T cell response to one virus can produce a cross-reactive response to a similar protein structure in another virus, meaning that T cells will identify and respond to the two viruses in much the same way. Karlsson looked to see if T cells for OC43 from a wide age range of patients were cross-reactive to SARS-CoV-2.
And that is what they found, as reported in the PNAS study last month; there was cross-reactive activity, but it depended on a person’s age. A subset of a certain type of T cells, called mCD4+,, that recognized various protein parts of the cold-causing virus, OC43, expressed on the surface of an infected cell – also recognized those same protein parts from SARS-CoV-2. The T cell response was lower than that generated by natural exposure to SARS-CoV-2, but it was functional and thus could help limit the severity of COVID-19.
“One of the most politicized aspects of our pandemic response was not accepting that children are so much less at risk for severe disease with COVID-19,” because usually young children are among the most vulnerable to pathogens, says Monica Gandhi, professor of medicine at the University of California San Francisco.
“The cross-reactivity peaked at age six when more than half the people tested have a cross-reactive immune response,” says Karlsson, though their sample is too small to say if this finding applies more broadly across the population. The vast majority of children as young as two years had OC43-specific mCD4+ T cell responses. In adulthood, the functionality of both the OC43-specific and the cross-reactive T cells wane significantly, especially with advanced age.
“Considering that the mortality rate in children is the lowest from ages five to nine, and higher in younger children, our results imply that cross-reactive mCD4+ T cells may have a role in the control of SARS-CoV-2 infection in children,” the authors wrote in their paper.
“One of the most politicized aspects of our pandemic response was not accepting that children are so much less at risk for severe disease with COVID-19,” because usually young children are among the most vulnerable to pathogens, says Monica Gandhi, professor of medicine at the University of California San Francisco and author of the book, Endemic: A Post-Pandemic Playbook, to be released by the Mayo Clinic Press this summer. The immune response of kids to SARS-CoV-2 stood our expectations on their head. “We just haven't seen this before, so knowing the mechanism of protection is really important.”
Why the T cell immune response can fade with age is largely unknown. With some viruses such as measles, a single vaccination or infection generates life-long protection. But respiratory tract infections, like SARS-CoV-2, cause a localized infection - specific to certain organs - and that response tends to be shorter lived than systemic infections that affect the entire body. Karlsson suspects the elderly might be exposed to these localized types of viruses less often. Also, frequent continued exposure to a virus that results in reactivation of the memory T cell pool might eventually result in “a kind of immunosenescence or immune exhaustion that is associated with aging,” Karlsson says. https://leaps.org/scientists-just-started-testing-a-new-class-of-drugs-to-slow-and-even-reverse-aging/particle-3 This fading protection is why older people need to be repeatedly vaccinated against SARS-CoV-2.
Policy implications
Following the numbers on COVID-19 infections and severity over the last three years have shown us that healthy young people without risk factors are not likely to develop serious disease. This latest study points to a mechanism that helps explain why. But the inertia of existing policies remains. How should we adjust policy recommendations based on what we know today?
The World Health Organization (WHO) updated their COVID-19 vaccination guidance on March 28. It calls for a focus on vaccinating and boosting those at risk for developing serious disease. The guidance basically shrugged its shoulders when it came to healthy children and young adults receiving vaccinations and boosters against COVID-19. It said the priority should be to administer the “traditional essential vaccines for children,” such as those that protect against measles, rubella, and mumps.
“As an immunologist and a mother, I think that catching a cold or two when you are a kid and otherwise healthy is not that bad for you. Children have a much lower risk of becoming severely ill with SARS-CoV-2,” says Karlsson. She has followed public health guidance in Sweden, which means that her young children have not been vaccinated, but being older, she has received the vaccine and boosters. Gandhi and her children have been vaccinated, but they do not plan on additional boosters.
The WHO got it right in “concentrating on what matters,” which is getting traditional childhood immunizations back on track after their dramatic decline over the last three years, says Gandhi. Nor is there a need for masking in schools, according to a study from the Catalonia region of Spain. It found “no difference in masking and spread in schools,” particularly since tracking data indicate that nearly all young people have been exposed to SARS-CoV-2.
Both researchers lament that public discussion has overemphasized the quickly fading antibody part of the immune response to SARS-CoV-2 compared with the more durable T cell component. They say developing an efficient measure of T cell response for doctors to use in the clinic would help to monitor immunity in people at risk for severe cases of COVID-19 compared with the current method of toting up potential risk factors.
The Friday Five covers five stories in research that you may have missed this week. There are plenty of controversies and troubling ethical issues in science – and we get into many of them in our online magazine – but this news roundup focuses on new scientific theories and progress to give you a therapeutic dose of inspiration headed into the weekend.
Listen on Apple | Listen on Spotify | Listen on Stitcher | Listen on Amazon | Listen on Google
Here are the stories covered this week:
- The eyes are the windows to the soul - and biological aging?
- What bean genes mean for health and the planet
- This breathing practice could lower levels of tau proteins
- AI beats humans at assessing heart health
- Should you get a nature prescription?