Genomic Data Has a Diversity Problem, But Global Efforts Are Underway to Fix It
Genomics has begun its golden age. Just 20 years ago, sequencing a single genome cost nearly $3 billion and took over a decade. Today, the same feat can be achieved for a few hundred dollars and the better part of a day . Suddenly, the prospect of sequencing not just individuals, but whole populations, has become feasible.
The genetic differences between humans may seem meager, only around 0.1 percent of the genome on average, but this variation can have profound effects on an individual's risk of disease, responsiveness to medication, and even the dosage level that would work best.
Already, initiatives like the U.K.'s 100,000 Genomes Project - now expanding to 1 million genomes - and other similarly massive sequencing projects in Iceland and the U.S., have begun collecting population-scale data in order to capture and study this variation.
The resulting data sets are immensely valuable to researchers and drug developers working to design new 'precision' medicines and diagnostics, and to gain insights that may benefit patients. Yet, because the majority of this data comes from developed countries with well-established scientific and medical infrastructure, the data collected so far is heavily biased towards Western populations with largely European ancestry.
This presents a startling and fast-emerging problem: groups that are under-represented in these datasets are likely to benefit less from the new wave of therapeutics, diagnostics, and insights, simply because they were tailored for the genetic profiles of people with European ancestry.
We may indeed be approaching a golden age of genomics-enabled precision medicine. But if the data bias persists then there is a risk, as with most golden ages throughout history, that the benefits will not be equally accessible to all, and existing inequalities will only be exacerbated.
To remedy the situation, a number of initiatives have sprung up to sequence genomes of under-represented groups, adding them to the datasets and ensuring that they too will benefit from the rapidly unfolding genomic revolution.
Global Gene Corp
The idea behind Global Gene Corp was born eight years ago in Harvard when Sumit Jamuar, co-founder and CEO, met up with his two other co-founders, both experienced geneticists, for a coffee.
"They were discussing the limitless applications of understanding your genetic code," said Jamuar, a business executive from New Delhi.
"And so, being a technology enthusiast type, I was excited and I turned to them and said hey, this is incredible! Could you sequence me and give me some insights? And they actually just turned around and said no, because it's not going to be useful for you - there's not enough reference for what a good Sumit looks like."
What started as a curiosity-driven conversation on the power of genomics ended with a commitment to tackle one of the field's biggest roadblocks - its lack of global representation.
Jamuar set out to begin with India, which has about 20 percent of the world's population, including over 4000 different ethnicities, but contributes less than 2 percent of genomic data, he told Leaps.org.
Eight years later, Global Gene Corp's sequencing initiative is well underway, and is the largest in the history of the Indian subcontinent. The program is being carried out in collaboration with biotech giant Regeneron, with support from the Indian government, local communities, and the Indian healthcare ecosystem. In August 2020, Global Gene Corp's work was recognized through the $1 million 2020 Roddenberry award for organizations that advance the vision of 'Star Trek' creator Gene Roddenberry to better humanity.
This problem has already begun to manifest itself in, for example, much higher levels of genetic misdiagnosis among non-Europeans tested for their risk of certain diseases, such as hypertrophic cardiomyopathy - an inherited disease of the heart muscle.
Global Gene Corp also focuses on developing and implementing AI and machine learning tools to make sense of the deluge of genomic data. These tools are increasingly used by both industry and academia to guide future research by identifying particularly promising or clinically interesting genetic variants. But if the underlying data is skewed European, then the effectiveness of the computational analysis - along with the future advances and avenues of research that emerge from it - will be skewed towards Europeans too.
This problem has already begun to manifest itself in, for example, much higher levels of genetic misdiagnosis among non-Europeans tested for their risk of certain diseases, such as hypertrophic cardiomyopathy - an inherited disease of the heart muscle. Most of the genetic variants used in these tests were identified as being causal for the disease from studies of European genomes. However, many of these variants differ both in their distribution and clinical significance across populations, leading to many patients of non-European ancestry receiving false-positive test results - as their benign genetic variants were misclassified as pathogenic. Had even a small number of genomes from other ethnicities been included in the initial studies, these misdiagnoses could have been avoided.
"Unless we have a data set which is unbiased and representative, we're never going to achieve the success that we want," Jamuar says.
"When Siri was first launched, she could hardly recognize an accent which was not of a certain type, so if I was trying to speak to Siri, I would have to repeat myself multiple times and try to mimic an accent which wasn't my accent so that she could understand it.
"But over time the voice recognition technology improved tremendously because the training data was expanded to include people of very diverse backgrounds and their accents, so the algorithms were trained to be able to pick that up and it dramatically improved the technology. That's the way we have to think about it - without that good-quality diverse data, we will never be able to achieve the full potential of the computational tools."
While mapping India's rich genetic diversity has been the organization's primary focus so far, they plan, in time, to expand their work to other under-represented groups in Asia, the Middle East, Africa, and Latin America.
"As other like-minded people and partners join the mission, it just accelerates the achievement of what we have set out to do, which is to map out and organize the world's genomic diversity so that we can enable high-quality life and longevity benefits for everyone, everywhere," Jamuar says.
Empowering African Genomics
Africa is the birthplace of our species, and today still retains an inordinate amount of total human genetic diversity. Groups that left Africa and went on to populate the rest of the world, some 50 to 100,000 years ago, were likely small in number and only took a fraction of the total genetic diversity with them. This ancient bottleneck means that no other group in the world can match the level of genetic diversity seen in modern African populations.
Despite Africa's central importance in understanding the history and extent of human genetic diversity, the genomics of African populations remains wildly understudied. Addressing this disparity has become a central focus of the H3Africa Consortium, an initiative formally launched in 2012 with support from the African Academy of Sciences, the U.S. National Institutes of Health, and the UK's Wellcome Trust. Today, H3Africa supports over 50 projects across the continent, on an array of different research areas in genetics relevant to the health and heredity of Africans.
"Africa is the cradle of Humankind. So what that really means is that the populations that are currently living in Africa are among some of the oldest populations on the globe, and we know that the longer populations have had to go through evolutionary phases, the more variation there is in the genomes of people who live presently," says Zane Lombard, a principal investigator at H3Africa and Associate Professor of Human Genetics at the University of the Witwatersrand in Johannesburg, South Africa.
"So for that reason, African populations carry a huge amount of genetic variation and diversity, which is pretty much uncaptured. There's still a lot to learn as far as novel variation is concerned by looking at and studying African genomes."
A recent landmark H3Africa study, led by Lombard and published in Nature in October, sequenced the genomes of over 400 African individuals from 50 ethno-linguistic groups - many of which had never been sampled before.
Despite the relatively modest number of individuals sequenced in the study, over three million previously undescribed genetic variants were found, and complex patterns of ancestral migration were uncovered.
"In some of these ethno-linguistic groups they don't have a word for DNA, so we've had to really think about how to make sure that we communicate the purposes of different studies to participants so that you have true informed consent," says Lombard.
"The objective," she explained, "was to try and fill some of the gaps for many of these populations for which we didn't have any whole genome sequences or any genetic variation data...because if we're thinking about the future of precision medicine, if the patient is a member of a specific group where we don't know a lot about the genomic variation that exists in that group, it makes it really difficult to start thinking about clinical interpretation of their data."
From H3Africa's conception, the consortium's goal has not only been to better represent Africa's staggering genetic diversity in genomic data sets, but also to build Africa's domestic genomics capabilities and empower a new generation of African researchers. By doing so, the hope is that Africans will be able to set their own genomics agenda, and leapfrog to new and better ways of doing the work.
"The training that has happened on the continent and the number of new scientists, new students, and fellows that have come through the process and are now enabled to start their own research groups, to grow their own research in their countries, to be a spokesperson for genomics research in their countries, and to build that political will to do these larger types of sequencing initiatives - that is really a significant outcome from H3Africa as well. Over and above all the science that's coming out," Lombard says.
"What has been created through H3Africa is just this locus of researchers and scientists and bioethicists who have the same goal at heart - to work towards adjusting the data bias and making sure that all global populations are represented in genomics."
Debates over transgender athletes rage on, with new state bans and rules for Olympians, NCAA sports
Ashley O’Connor, who was biologically male at birth but identifies as female, decided to compete in badminton as a girl during her senior year of high school in Downers Grove, Illinois. There was no team for boys, and a female friend and badminton player “practically bullied me into joining” the girls’ team. O’Connor, who is 18 and taking hormone replacement therapy for her gender transition, recalled that “it was easily one of the best decisions I have ever made.”
She believes there are many reasons why it’s important for transgender people to have the option of playing sports on the team of their choice. “It provides a sense of community,” said O’Connor, now a first-year student concentrating in psychology at the College of DuPage in Glen Ellyn, Illinois.
“It’s a great way to get a workout, which is good for physical and mental health,” she added. She also enjoyed the opportunity to be competitive, learn about her strengths and weaknesses, and just be normal. “Trans people have friends and trans people want to play sports with their friends, especially in adolescence,” she said.
However, in 18 states, many of which are politically conservative, laws prohibit transgender students from participating in sports consistent with their gender identity, according to the Movement Advancement Project, an independent, nonprofit think tank based in Boulder, Colo., that focuses on the rights of LGBTQ people. The first ban was passed in Idaho in 2020, although federal district judges have halted this legislation and a similar law in West Virginia from taking effect.
Proponents of the bans caution that transgender females would have an unfair biological advantage in competitive school sports with other girls or women as a result of being born as stronger males, potentially usurping the athletic accomplishments of other athletes.
“The future of women’s sports is at risk, and the equal rights of female athletes is being infringed,” said Penny Nance, CEO and president of Concerned Women for America, a legislative action committee in D.C. that seeks to impact culture to promote religious values.
“As the tidal wave of gender activism consumes sports from the Olympics on down, a backlash is being felt as parents are furious about the disregard for their daughters who have worked very hard to achieve success as athletes,” Nance added. “Former athletes, whose records are being shattered, are demanding answers.”
Meanwhile, opponents of the bans contend that they bar transgender athletes from playing sports with friends and learning the value of teamwork and other life lessons. These laws target transgender girls most often in kindergarten through high school but sometimes in college as well. Many local schools and state athletic associations already have their own guidelines “to both protect transgender people and ensure a level playing field for all athletes,” according to the Movement Advancement Project’s website. But statewide bans take precedence over these policies.
"It’s easy to sympathize on some level with arguments on both sides, and it’s likely going to be impossible to make everyone happy,” said Liz Joy, a past president of the American College of Sports Medicine.
In January, the National Collegiate Athletic Association (NCAA), based in Indianapolis, tried to sort out the controversy by implementing a new policy. It requires transgender students participating in female sports to prove that they’ve been taking treatments to suppress testosterone for at least one year before competition, as well as demonstrating that their testosterone level is sufficiently low, depending on the sport, through a blood test.
Then, in August, the NCAA clarified that these athletes also must take another blood test six months after their season has started that shows their testosterone levels aren’t too high. Additional guidelines will take effect next August.
Even with these requirements, “there is no plan that is going to be considered equitable and fair to all,” said Bradley Anawalt, an endocrinologist at the University of Washington School of Medicine. Biologically, he noted, there is still some evidence that a transgender female who initiates hormone therapy with estrogen and drops her testosterone to very low levels may have some advantage over other females, based on characteristics such as hand and foot size, height and perhaps strength.
Liz Joy, a past president of the American College of Sports Medicine, agrees that allowing transgender athletes to compete on teams of their self-identifying gender poses challenges. “It’s easy to sympathize on some level with arguments on both sides, and it’s likely going to be impossible to make everyone happy,” said Joy, a physician and senior medical director of wellness and nutrition at Intermountain Healthcare in Salt Lake City, Utah. While advocating for inclusion, she added that “sport was incredibly important in my life. I just want everyone to be able to benefit from it.”
One solution may be to allow transgender youth to play sports in a way that aligns with their gender identity until a certain age and before an elite level. “There are minimal or no potential financial stakes for most youth sports before age 13 or 14, and you do not have a lot of separation in athlete performance between most boys and girls until about age 13,” said Anwalt, who was a reviewer of the Endocrine Society’s national guidelines on transgender care.
Myron Genel, a professor emeritus and former chief of pediatric endocrinology at Yale School of Medicine, said it’s difficult to argue that height gives transgender females an edge because in some sports tall women already dominate over their shorter counterparts.
He added that the decision to allow transgender females to compete with other girls or women could hinge on when athletes began taking testosterone blockers. “If the process of conversion from male to female has been undertaken in the early stages of puberty, from my perspective, they have very little unique advantage,” said Genel, who advised the International Olympic Committee (IOC), based in Switzerland, on testosterone limits for transgender athletes.
Because young athletes’ bodies are still developing, “the differences in natural abilities are so massive that they would overwhelm any advantage a transgender athlete might have,” said Thomas H. Murray, president emeritus of The Hastings Center, a pioneering bioethics research institute in Garrison, New York, and author of the book “Good Sport,” which focuses on the ethics and values in the Olympics and other competitions.
“There’s no good reason to limit the participation of transgender athletes in the sports where male athletes don’t have an advantage over women,” such as sailing, archery and shooting events, Murray said. “The burden of proof rests on those who want to restrict participation by transgender athletes. They must show that in this sport, at this level of competition, transgender athletes have a conspicuous advantage.”
Last year, the IOC issued a new framework emphasizing that the Olympic rules related to transgender participation should be specific to each sport. “This is an evolving topic and there has been—as it will continue to be—new research coming out and new developments informing our approach,” and there’s currently no consensus on how testosterone affects performance across all sports, an IOC spokesperson told Leaps.org.
Many of the new laws prohibiting transgender people from competing in sports consistent with their gender identity specifically apply to transgender females. Yet, some experts say the issue also affects transgender males, nonbinary and intersex athletes.
“There has been quite a bit of attention paid to transgender females and their participation in biological female sports and almost minimal focus on transgender male competition in male sports or in any sports,” said Katherine Drabiak, associate professor of public health law and medical ethics at University of South Florida in Tampa. In fact, “transgender men, because they were born female, would be at a disadvantage of having less lean body mass, less strength and less muscular area as a general category compared to a biological male.”
While discussing transgender students’ participation in sports, it’s important to call attention to the toll that anti-transgender legislation can take on these young people’s well-being, said Jonah DeChants, a research scientist at The Trevor Project, a suicide prevention and mental health organization for LGBTQ youth. Recent polling found that 85 percent of transgender and nonbinary youth said that debates around anti-transgender laws had a negative impact on their mental health.
“The reality is simple: Most transgender girls want to play sports for the same reasons as any student—to benefit their health, to have fun, and to build connection with friends,” DeChants said. According to a new peer-reviewed qualitative study by researchers at The Trevor Project, many trans girls who participated in sports experienced harassment and stigma based on their gender identity, which can contribute to poor mental health outcomes and suicide risk.
In addition to badminton, O'Connor played other sports such as volleyball, and she plans to become an assistant coach or manager of her old high school's badminton team.
Ashley O'Connor
However, DeChants added, research also shows that young people who reported living in an accepting community, had access to LGBTQ-affirming spaces, or had social support from family and friends reported significantly lower rates of attempting suicide in the past year. “We urge coaches, educators and school administrators to seek LGBTQ-cultural competency training, implement zero tolerance policies for anti-trans bullying, and create safe, affirming environments for all transgender students on and off the field,” DeChants said.
O’Connor said her experiences on the athletic scene have been mostly positive. The politics of her community lean somewhat liberal, and she thinks it’s probably more supportive than some other areas of the country, though she noted the local library has received threats for hosting LGBTQ events. In addition to badminton, she also played baseball, lacrosse, volleyball, basketball and hockey. In the spring, she plans to become an assistant coach or manager for the girls’ badminton team at her old high school.
“When I played badminton, I never got any direct backlash from any coaches, competitors or teammates,” she said. “I had a few other teammates that identified as trans or nonbinary, [and] nearly all of the people I ever interacted with were super pleasant and treated me like any other normal person.” She added that transgender athletes “have aspirations. We have wants and needs. We have dreams. And at the end of the day, we just want to live our lives and be happy like everyone else.”
The Friday Five covers five stories in research that you may have missed this week. There are plenty of controversies and troubling ethical issues in science – and we get into many of them in our online magazine – but this news roundup focuses on scientific creativity and progress to give you a therapeutic dose of inspiration headed into the weekend.
Listen on Apple | Listen on Spotify | Listen on Stitcher | Listen on Amazon | Listen on Google
Here are the promising studies covered in this week's Friday Five:
- Research on a "smart" bandage for wounds
- A breakthrough in fighting inflammation
- The pros and cons of a new drug for Alzheimer's
- Benefits of the Mediterranean diet - with a twist
- How to recycle a plastic that was un-recyclable