Genomic Data Has a Diversity Problem, But Global Efforts Are Underway to Fix It
Genomics has begun its golden age. Just 20 years ago, sequencing a single genome cost nearly $3 billion and took over a decade. Today, the same feat can be achieved for a few hundred dollars and the better part of a day . Suddenly, the prospect of sequencing not just individuals, but whole populations, has become feasible.
The genetic differences between humans may seem meager, only around 0.1 percent of the genome on average, but this variation can have profound effects on an individual's risk of disease, responsiveness to medication, and even the dosage level that would work best.
Already, initiatives like the U.K.'s 100,000 Genomes Project - now expanding to 1 million genomes - and other similarly massive sequencing projects in Iceland and the U.S., have begun collecting population-scale data in order to capture and study this variation.
The resulting data sets are immensely valuable to researchers and drug developers working to design new 'precision' medicines and diagnostics, and to gain insights that may benefit patients. Yet, because the majority of this data comes from developed countries with well-established scientific and medical infrastructure, the data collected so far is heavily biased towards Western populations with largely European ancestry.
This presents a startling and fast-emerging problem: groups that are under-represented in these datasets are likely to benefit less from the new wave of therapeutics, diagnostics, and insights, simply because they were tailored for the genetic profiles of people with European ancestry.
We may indeed be approaching a golden age of genomics-enabled precision medicine. But if the data bias persists then there is a risk, as with most golden ages throughout history, that the benefits will not be equally accessible to all, and existing inequalities will only be exacerbated.
To remedy the situation, a number of initiatives have sprung up to sequence genomes of under-represented groups, adding them to the datasets and ensuring that they too will benefit from the rapidly unfolding genomic revolution.
Global Gene Corp
The idea behind Global Gene Corp was born eight years ago in Harvard when Sumit Jamuar, co-founder and CEO, met up with his two other co-founders, both experienced geneticists, for a coffee.
"They were discussing the limitless applications of understanding your genetic code," said Jamuar, a business executive from New Delhi.
"And so, being a technology enthusiast type, I was excited and I turned to them and said hey, this is incredible! Could you sequence me and give me some insights? And they actually just turned around and said no, because it's not going to be useful for you - there's not enough reference for what a good Sumit looks like."
What started as a curiosity-driven conversation on the power of genomics ended with a commitment to tackle one of the field's biggest roadblocks - its lack of global representation.
Jamuar set out to begin with India, which has about 20 percent of the world's population, including over 4000 different ethnicities, but contributes less than 2 percent of genomic data, he told Leaps.org.
Eight years later, Global Gene Corp's sequencing initiative is well underway, and is the largest in the history of the Indian subcontinent. The program is being carried out in collaboration with biotech giant Regeneron, with support from the Indian government, local communities, and the Indian healthcare ecosystem. In August 2020, Global Gene Corp's work was recognized through the $1 million 2020 Roddenberry award for organizations that advance the vision of 'Star Trek' creator Gene Roddenberry to better humanity.
This problem has already begun to manifest itself in, for example, much higher levels of genetic misdiagnosis among non-Europeans tested for their risk of certain diseases, such as hypertrophic cardiomyopathy - an inherited disease of the heart muscle.
Global Gene Corp also focuses on developing and implementing AI and machine learning tools to make sense of the deluge of genomic data. These tools are increasingly used by both industry and academia to guide future research by identifying particularly promising or clinically interesting genetic variants. But if the underlying data is skewed European, then the effectiveness of the computational analysis - along with the future advances and avenues of research that emerge from it - will be skewed towards Europeans too.
This problem has already begun to manifest itself in, for example, much higher levels of genetic misdiagnosis among non-Europeans tested for their risk of certain diseases, such as hypertrophic cardiomyopathy - an inherited disease of the heart muscle. Most of the genetic variants used in these tests were identified as being causal for the disease from studies of European genomes. However, many of these variants differ both in their distribution and clinical significance across populations, leading to many patients of non-European ancestry receiving false-positive test results - as their benign genetic variants were misclassified as pathogenic. Had even a small number of genomes from other ethnicities been included in the initial studies, these misdiagnoses could have been avoided.
"Unless we have a data set which is unbiased and representative, we're never going to achieve the success that we want," Jamuar says.
"When Siri was first launched, she could hardly recognize an accent which was not of a certain type, so if I was trying to speak to Siri, I would have to repeat myself multiple times and try to mimic an accent which wasn't my accent so that she could understand it.
"But over time the voice recognition technology improved tremendously because the training data was expanded to include people of very diverse backgrounds and their accents, so the algorithms were trained to be able to pick that up and it dramatically improved the technology. That's the way we have to think about it - without that good-quality diverse data, we will never be able to achieve the full potential of the computational tools."
While mapping India's rich genetic diversity has been the organization's primary focus so far, they plan, in time, to expand their work to other under-represented groups in Asia, the Middle East, Africa, and Latin America.
"As other like-minded people and partners join the mission, it just accelerates the achievement of what we have set out to do, which is to map out and organize the world's genomic diversity so that we can enable high-quality life and longevity benefits for everyone, everywhere," Jamuar says.
Empowering African Genomics
Africa is the birthplace of our species, and today still retains an inordinate amount of total human genetic diversity. Groups that left Africa and went on to populate the rest of the world, some 50 to 100,000 years ago, were likely small in number and only took a fraction of the total genetic diversity with them. This ancient bottleneck means that no other group in the world can match the level of genetic diversity seen in modern African populations.
Despite Africa's central importance in understanding the history and extent of human genetic diversity, the genomics of African populations remains wildly understudied. Addressing this disparity has become a central focus of the H3Africa Consortium, an initiative formally launched in 2012 with support from the African Academy of Sciences, the U.S. National Institutes of Health, and the UK's Wellcome Trust. Today, H3Africa supports over 50 projects across the continent, on an array of different research areas in genetics relevant to the health and heredity of Africans.
"Africa is the cradle of Humankind. So what that really means is that the populations that are currently living in Africa are among some of the oldest populations on the globe, and we know that the longer populations have had to go through evolutionary phases, the more variation there is in the genomes of people who live presently," says Zane Lombard, a principal investigator at H3Africa and Associate Professor of Human Genetics at the University of the Witwatersrand in Johannesburg, South Africa.
"So for that reason, African populations carry a huge amount of genetic variation and diversity, which is pretty much uncaptured. There's still a lot to learn as far as novel variation is concerned by looking at and studying African genomes."
A recent landmark H3Africa study, led by Lombard and published in Nature in October, sequenced the genomes of over 400 African individuals from 50 ethno-linguistic groups - many of which had never been sampled before.
Despite the relatively modest number of individuals sequenced in the study, over three million previously undescribed genetic variants were found, and complex patterns of ancestral migration were uncovered.
"In some of these ethno-linguistic groups they don't have a word for DNA, so we've had to really think about how to make sure that we communicate the purposes of different studies to participants so that you have true informed consent," says Lombard.
"The objective," she explained, "was to try and fill some of the gaps for many of these populations for which we didn't have any whole genome sequences or any genetic variation data...because if we're thinking about the future of precision medicine, if the patient is a member of a specific group where we don't know a lot about the genomic variation that exists in that group, it makes it really difficult to start thinking about clinical interpretation of their data."
From H3Africa's conception, the consortium's goal has not only been to better represent Africa's staggering genetic diversity in genomic data sets, but also to build Africa's domestic genomics capabilities and empower a new generation of African researchers. By doing so, the hope is that Africans will be able to set their own genomics agenda, and leapfrog to new and better ways of doing the work.
"The training that has happened on the continent and the number of new scientists, new students, and fellows that have come through the process and are now enabled to start their own research groups, to grow their own research in their countries, to be a spokesperson for genomics research in their countries, and to build that political will to do these larger types of sequencing initiatives - that is really a significant outcome from H3Africa as well. Over and above all the science that's coming out," Lombard says.
"What has been created through H3Africa is just this locus of researchers and scientists and bioethicists who have the same goal at heart - to work towards adjusting the data bias and making sure that all global populations are represented in genomics."
Nobel Prize goes to technology for mRNA vaccines
When Drew Weissman received a call from Katalin Karikó in the early morning hours this past Monday, he assumed his longtime research partner was calling to share a nascent, nagging idea. Weissman, a professor of medicine at the Perelman School of Medicine at the University of Pennsylvania, and Karikó, a professor at Szeged University and an adjunct professor at UPenn, both struggle with sleep disturbances. Thus, middle-of-the-night discourses between the two, often over email, has been a staple of their friendship. But this time, Karikó had something more pressing and exciting to share: They had won the 2023 Nobel Prize in Physiology or Medicine.
The work for which they garnered the illustrious award and its accompanying $1,000,000 cash windfall was completed about two decades ago, wrought through long hours in the lab over many arduous years. But humanity collectively benefited from its life-saving outcome three years ago, when both Moderna and Pfizer/BioNTech’s mRNA vaccines against COVID were found to be safe and highly effective at preventing severe disease. Billions of doses have since been given out to protect humans from the upstart viral scourge.
“I thought of going somewhere else, or doing something else,” said Katalin Karikó. “I also thought maybe I’m not good enough, not smart enough. I tried to imagine: Everything is here, and I just have to do better experiments.”
Unlocking the power of mRNA
Weissman and Karikó unlocked mRNA vaccines for the world back in the early 2000s when they made a key breakthrough. Messenger RNA molecules are essentially instructions for cells’ ribosomes to make specific proteins, so in the 1980s and 1990s, researchers started wondering if sneaking mRNA into the body could trigger cells to manufacture antibodies, enzymes, or growth agents for protecting against infection, treating disease, or repairing tissues. But there was a big problem: injecting this synthetic mRNA triggered a dangerous, inflammatory immune response resulting in the mRNA’s destruction.
While most other researchers chose not to tackle this perplexing problem to instead pursue more lucrative and publishable exploits, Karikó stuck with it. The choice sent her academic career into depressing doldrums. Nobody would fund her work, publications dried up, and after six years as an assistant professor at the University of Pennsylvania, Karikó got demoted. She was going backward.
“I thought of going somewhere else, or doing something else,” Karikó told Stat in 2020. “I also thought maybe I’m not good enough, not smart enough. I tried to imagine: Everything is here, and I just have to do better experiments.”
A tale of tenacity
Collaborating with Drew Weissman, a new professor at the University of Pennsylvania, in the late 1990s helped provide Karikó with the tenacity to continue. Weissman nurtured a goal of developing a vaccine against HIV-1, and saw mRNA as a potential way to do it.
“For the 20 years that we’ve worked together before anybody knew what RNA is, or cared, it was the two of us literally side by side at a bench working together,” Weissman said in an interview with Adam Smith of the Nobel Foundation.
In 2005, the duo made their 2023 Nobel Prize-winning breakthrough, detailing it in a relatively small journal, Immunity. (Their paper was rejected by larger journals, including Science and Nature.) They figured out that chemically modifying the nucleoside bases that make up mRNA allowed the molecule to slip past the body’s immune defenses. Karikó and Weissman followed up that finding by creating mRNA that’s more efficiently translated within cells, greatly boosting protein production. In 2020, scientists at Moderna and BioNTech (where Karikó worked from 2013 to 2022) rushed to craft vaccines against COVID, putting their methods to life-saving use.
The future of vaccines
Buoyed by the resounding success of mRNA vaccines, scientists are now hurriedly researching ways to use mRNA medicine against other infectious diseases, cancer, and genetic disorders. The now ubiquitous efforts stand in stark contrast to Karikó and Weissman’s previously unheralded struggles years ago as they doggedly worked to realize a shared dream that so many others shied away from. Katalin Karikó and Drew Weissman were brave enough to walk a scientific path that very well could have ended in a dead end, and for that, they absolutely deserve their 2023 Nobel Prize.
This article originally appeared on Big Think, home of the brightest minds and biggest ideas of all time.
Scientists turn pee into power in Uganda
At the edge of a dirt road flanked by trees and green mountains outside the town of Kisoro, Uganda, sits the concrete building that houses Sesame Girls School, where girls aged 11 to 19 can live, learn and, at least for a while, safely use a toilet. In many developing regions, toileting at night is especially dangerous for children. Without electrical power for lighting, kids may fall into the deep pits of the latrines through broken or unsteady floorboards. Girls are sometimes assaulted by men who hide in the dark.
For the Sesame School girls, though, bright LED lights, connected to tiny gadgets, chased the fears away. They got to use new, clean toilets lit by the power of their own pee. Some girls even used the light provided by the latrines to study.
Urine, whether animal or human, is more than waste. It’s a cheap and abundant resource. Each day across the globe, 8.1 billion humans make 4 billion gallons of pee. Cows, pigs, deer, elephants and other animals add more. By spending money to get rid of it, we waste a renewable resource that can serve more than one purpose. Microorganisms that feed on nutrients in urine can be used in a microbial fuel cell that generates electricity – or "pee power," as the Sesame girls called it.
Plus, urine contains water, phosphorus, potassium and nitrogen, the key ingredients plants need to grow and survive. Human urine could replace about 25 percent of current nitrogen and phosphorous fertilizers worldwide and could save water for gardens and crops. The average U.S. resident flushes a toilet bowl containing only pee and paper about six to seven times a day, which adds up to about 3,500 gallons of water down per year. Plus cows in the U.S. produce 231 gallons of the stuff each year.
Pee power
A conventional fuel cell uses chemical reactions to produce energy, as electrons move from one electrode to another to power a lightbulb or phone. Ioannis Ieropoulos, a professor and chair of Environmental Engineering at the University of Southampton in England, realized the same type of reaction could be used to make a fuel from microbes in pee.
Bacterial species like Shewanella oneidensis and Pseudomonas aeruginosa can consume carbon and other nutrients in urine and pop out electrons as a result of their digestion. In a microbial fuel cell, one electrode is covered in microbes, immersed in urine and kept away from oxygen. Another electrode is in contact with oxygen. When the microbes feed on nutrients, they produce the electrons that flow through the circuit from one electrod to another to combine with oxygen on the other side. As long as the microbes have fresh pee to chomp on, electrons keep flowing. And after the microbes are done with the pee, it can be used as fertilizer.
These microbes are easily found in wastewater treatment plants, ponds, lakes, rivers or soil. Keeping them alive is the easy part, says Ieropoulos. Once the cells start producing stable power, his group sequences the microbes and keeps using them.
Like many promising technologies, scaling these devices for mass consumption won’t be easy, says Kevin Orner, a civil engineering professor at West Virginia University. But it’s moving in the right direction. Ieropoulos’s device has shrunk from the size of about three packs of cards to a large glue stick. It looks and works much like a AAA battery and produce about the same power. By itself, the device can barely power a light bulb, but when stacked together, they can do much more—just like photovoltaic cells in solar panels. His lab has produced 1760 fuel cells stacked together, and with manufacturing support, there’s no theoretical ceiling, he says.
Although pure urine produces the most power, Ieropoulos’s devices also work with the mixed liquids of the wastewater treatment plants, so they can be retrofit into urban wastewater utilities.
This image shows how the pee-powered system works. Pee feeds bacteria in the stack of fuel cells (1), which give off electrons (2) stored in parallel cylindrical cells (3). These cells are connected to a voltage regulator (4), which smooths out the electrical signal to ensure consistent power to the LED strips lighting the toilet.
Courtesy Ioannis Ieropoulos
Key to the long-term success of any urine reclamation effort, says Orner, is avoiding what he calls “parachute engineering”—when well-meaning scientists solve a problem with novel tech and then abandon it. “The way around that is to have either the need come from the community or to have an organization in a community that is committed to seeing a project operate and maintained,” he says.
Success with urine reclamation also depends on the economy. “If energy prices are low, it may not make sense to recover energy,” says Orner. “But right now, fertilizer prices worldwide are generally pretty high, so it may make sense to recover fertilizer and nutrients.” There are obstacles, too, such as few incentives for builders to incorporate urine recycling into new construction. And any hiccups like leaks or waste seepage will cost builders money and reputation. Right now, Orner says, the risks are just too high.
Despite the challenges, Ieropoulos envisions a future in which urine is passed through microbial fuel cells at wastewater treatment plants, retrofitted septic tanks, and building basements, and is then delivered to businesses to use as agricultural fertilizers. Although pure urine produces the most power, Ieropoulos’s devices also work with the mixed liquids of the wastewater treatment plants, so they can be retrofitted into urban wastewater utilities where they can make electricity from the effluent. And unlike solar cells, which are a common target of theft in some areas, nobody wants to steal a bunch of pee.
When Ieropoulos’s team returned to wrap up their pilot project 18 months later, the school’s director begged them to leave the fuel cells in place—because they made a major difference in students’ lives. “We replaced it with a substantial photovoltaic panel,” says Ieropoulos, They couldn’t leave the units forever, he explained, because of intellectual property reasons—their funders worried about theft of both the technology and the idea. But the photovoltaic replacement could be stolen, too, leaving the girls in the dark.
The story repeated itself at another school, in Nairobi, Kenya, as well as in an informal settlement in Durban, South Africa. Each time, Ieropoulos vowed to return. Though the pandemic has delayed his promise, he is resolute about continuing his work—it is a moral and legal obligation. “We've made a commitment to ourselves and to the pupils,” he says. “That's why we need to go back.”
Urine as fertilizer
Modern day industrial systems perpetuate the broken cycle of nutrients. When plants grow, they use up nutrients the soil. We eat the plans and excrete some of the nutrients we pass them into rivers and oceans. As a result, farmers must keep fertilizing the fields while our waste keeps fertilizing the waterways, where the algae, overfertilized with nitrogen, phosphorous and other nutrients grows out of control, sucking up oxygen that other marine species need to live. Few global communities remain untouched by the related challenges this broken chain create: insufficient clean water, food, and energy, and too much human and animal waste.
The Rich Earth Institute in Vermont runs a community-wide urine nutrient recovery program, which collects urine from homes and businesses, transports it for processing, and then supplies it as fertilizer to local farms.
One solution to this broken cycle is reclaiming urine and returning it back to the land. The Rich Earth Institute in Vermont is one of several organizations around the world working to divert and save urine for agricultural use. “The urine produced by an adult in one day contains enough fertilizer to grow all the wheat in one loaf of bread,” states their website.
Notably, while urine is not entirely sterile, it tends to harbor fewer pathogens than feces. That’s largely because urine has less organic matter and therefore less food for pathogens to feed on, but also because the urinary tract and the bladder have built-in antimicrobial defenses that kill many germs. In fact, the Rich Earth Institute says it’s safe to put your own urine onto crops grown for home consumption. Nonetheless, you’ll want to dilute it first because pee usually has too much nitrogen and can cause “fertilizer burn” if applied straight without dilution. Other projects to turn urine into fertilizer are in progress in Niger, South Africa, Kenya, Ethiopia, Sweden, Switzerland, The Netherlands, Australia, and France.
Eleven years ago, the Institute started a program that collects urine from homes and businesses, transports it for processing, and then supplies it as fertilizer to local farms. By 2021, the program included 180 donors producing over 12,000 gallons of urine each year. This urine is helping to fertilize hay fields at four partnering farms. Orner, the West Virginia professor, sees it as a success story. “They've shown how you can do this right--implementing it at a community level scale."