Genomic Data Has a Diversity Problem, But Global Efforts Are Underway to Fix It
Genomics has begun its golden age. Just 20 years ago, sequencing a single genome cost nearly $3 billion and took over a decade. Today, the same feat can be achieved for a few hundred dollars and the better part of a day . Suddenly, the prospect of sequencing not just individuals, but whole populations, has become feasible.
The genetic differences between humans may seem meager, only around 0.1 percent of the genome on average, but this variation can have profound effects on an individual's risk of disease, responsiveness to medication, and even the dosage level that would work best.
Already, initiatives like the U.K.'s 100,000 Genomes Project - now expanding to 1 million genomes - and other similarly massive sequencing projects in Iceland and the U.S., have begun collecting population-scale data in order to capture and study this variation.
The resulting data sets are immensely valuable to researchers and drug developers working to design new 'precision' medicines and diagnostics, and to gain insights that may benefit patients. Yet, because the majority of this data comes from developed countries with well-established scientific and medical infrastructure, the data collected so far is heavily biased towards Western populations with largely European ancestry.
This presents a startling and fast-emerging problem: groups that are under-represented in these datasets are likely to benefit less from the new wave of therapeutics, diagnostics, and insights, simply because they were tailored for the genetic profiles of people with European ancestry.
We may indeed be approaching a golden age of genomics-enabled precision medicine. But if the data bias persists then there is a risk, as with most golden ages throughout history, that the benefits will not be equally accessible to all, and existing inequalities will only be exacerbated.
To remedy the situation, a number of initiatives have sprung up to sequence genomes of under-represented groups, adding them to the datasets and ensuring that they too will benefit from the rapidly unfolding genomic revolution.
Global Gene Corp
The idea behind Global Gene Corp was born eight years ago in Harvard when Sumit Jamuar, co-founder and CEO, met up with his two other co-founders, both experienced geneticists, for a coffee.
"They were discussing the limitless applications of understanding your genetic code," said Jamuar, a business executive from New Delhi.
"And so, being a technology enthusiast type, I was excited and I turned to them and said hey, this is incredible! Could you sequence me and give me some insights? And they actually just turned around and said no, because it's not going to be useful for you - there's not enough reference for what a good Sumit looks like."
What started as a curiosity-driven conversation on the power of genomics ended with a commitment to tackle one of the field's biggest roadblocks - its lack of global representation.
Jamuar set out to begin with India, which has about 20 percent of the world's population, including over 4000 different ethnicities, but contributes less than 2 percent of genomic data, he told Leaps.org.
Eight years later, Global Gene Corp's sequencing initiative is well underway, and is the largest in the history of the Indian subcontinent. The program is being carried out in collaboration with biotech giant Regeneron, with support from the Indian government, local communities, and the Indian healthcare ecosystem. In August 2020, Global Gene Corp's work was recognized through the $1 million 2020 Roddenberry award for organizations that advance the vision of 'Star Trek' creator Gene Roddenberry to better humanity.
This problem has already begun to manifest itself in, for example, much higher levels of genetic misdiagnosis among non-Europeans tested for their risk of certain diseases, such as hypertrophic cardiomyopathy - an inherited disease of the heart muscle.
Global Gene Corp also focuses on developing and implementing AI and machine learning tools to make sense of the deluge of genomic data. These tools are increasingly used by both industry and academia to guide future research by identifying particularly promising or clinically interesting genetic variants. But if the underlying data is skewed European, then the effectiveness of the computational analysis - along with the future advances and avenues of research that emerge from it - will be skewed towards Europeans too.
This problem has already begun to manifest itself in, for example, much higher levels of genetic misdiagnosis among non-Europeans tested for their risk of certain diseases, such as hypertrophic cardiomyopathy - an inherited disease of the heart muscle. Most of the genetic variants used in these tests were identified as being causal for the disease from studies of European genomes. However, many of these variants differ both in their distribution and clinical significance across populations, leading to many patients of non-European ancestry receiving false-positive test results - as their benign genetic variants were misclassified as pathogenic. Had even a small number of genomes from other ethnicities been included in the initial studies, these misdiagnoses could have been avoided.
"Unless we have a data set which is unbiased and representative, we're never going to achieve the success that we want," Jamuar says.
"When Siri was first launched, she could hardly recognize an accent which was not of a certain type, so if I was trying to speak to Siri, I would have to repeat myself multiple times and try to mimic an accent which wasn't my accent so that she could understand it.
"But over time the voice recognition technology improved tremendously because the training data was expanded to include people of very diverse backgrounds and their accents, so the algorithms were trained to be able to pick that up and it dramatically improved the technology. That's the way we have to think about it - without that good-quality diverse data, we will never be able to achieve the full potential of the computational tools."
While mapping India's rich genetic diversity has been the organization's primary focus so far, they plan, in time, to expand their work to other under-represented groups in Asia, the Middle East, Africa, and Latin America.
"As other like-minded people and partners join the mission, it just accelerates the achievement of what we have set out to do, which is to map out and organize the world's genomic diversity so that we can enable high-quality life and longevity benefits for everyone, everywhere," Jamuar says.
Empowering African Genomics
Africa is the birthplace of our species, and today still retains an inordinate amount of total human genetic diversity. Groups that left Africa and went on to populate the rest of the world, some 50 to 100,000 years ago, were likely small in number and only took a fraction of the total genetic diversity with them. This ancient bottleneck means that no other group in the world can match the level of genetic diversity seen in modern African populations.
Despite Africa's central importance in understanding the history and extent of human genetic diversity, the genomics of African populations remains wildly understudied. Addressing this disparity has become a central focus of the H3Africa Consortium, an initiative formally launched in 2012 with support from the African Academy of Sciences, the U.S. National Institutes of Health, and the UK's Wellcome Trust. Today, H3Africa supports over 50 projects across the continent, on an array of different research areas in genetics relevant to the health and heredity of Africans.
"Africa is the cradle of Humankind. So what that really means is that the populations that are currently living in Africa are among some of the oldest populations on the globe, and we know that the longer populations have had to go through evolutionary phases, the more variation there is in the genomes of people who live presently," says Zane Lombard, a principal investigator at H3Africa and Associate Professor of Human Genetics at the University of the Witwatersrand in Johannesburg, South Africa.
"So for that reason, African populations carry a huge amount of genetic variation and diversity, which is pretty much uncaptured. There's still a lot to learn as far as novel variation is concerned by looking at and studying African genomes."
A recent landmark H3Africa study, led by Lombard and published in Nature in October, sequenced the genomes of over 400 African individuals from 50 ethno-linguistic groups - many of which had never been sampled before.
Despite the relatively modest number of individuals sequenced in the study, over three million previously undescribed genetic variants were found, and complex patterns of ancestral migration were uncovered.
"In some of these ethno-linguistic groups they don't have a word for DNA, so we've had to really think about how to make sure that we communicate the purposes of different studies to participants so that you have true informed consent," says Lombard.
"The objective," she explained, "was to try and fill some of the gaps for many of these populations for which we didn't have any whole genome sequences or any genetic variation data...because if we're thinking about the future of precision medicine, if the patient is a member of a specific group where we don't know a lot about the genomic variation that exists in that group, it makes it really difficult to start thinking about clinical interpretation of their data."
From H3Africa's conception, the consortium's goal has not only been to better represent Africa's staggering genetic diversity in genomic data sets, but also to build Africa's domestic genomics capabilities and empower a new generation of African researchers. By doing so, the hope is that Africans will be able to set their own genomics agenda, and leapfrog to new and better ways of doing the work.
"The training that has happened on the continent and the number of new scientists, new students, and fellows that have come through the process and are now enabled to start their own research groups, to grow their own research in their countries, to be a spokesperson for genomics research in their countries, and to build that political will to do these larger types of sequencing initiatives - that is really a significant outcome from H3Africa as well. Over and above all the science that's coming out," Lombard says.
"What has been created through H3Africa is just this locus of researchers and scientists and bioethicists who have the same goal at heart - to work towards adjusting the data bias and making sure that all global populations are represented in genomics."
Niklas Anzinger is the founder of Infinita VC based in the charter city of Prospera in Honduras. Infinita focuses on a new trend of charter cities and other forms of alternative jurisdictions. Healso hosts a podcast about how to accelerate the future by unblocking “stranded technologies”.This spring he was a part of the network city experiment Zuzalu spearheaded by Ethereum founder Vitalik Buterin where a few hundred invited guests from the spheres of longevity, biotechnology, crypto, artificial intelligence and investment came together to form a two-monthlong community. It has been described as the world’s first pop-up city. Every morning Vitalians would descend on a long breakfast—the menu had been carefully designed by famed radical longevity self-experimenter Bryan Johnson—and there is where I first met Anzinger who told me about Prospera. Intrigued to say the least, I caught up with him later the same week and the following is a record of our conversation.
Q. We are sitting here in the so-called pop-up network state Zuzalu temporarily realized in the village of Lusticia Bay by the beautiful Mediterranean Sea. To me this is an entirely new concept: What is a network state?
A. A network state is a highly aligned online community that has a level of in-person civility; it crowd-funds territory, and it eventually seeks diplomatic recognition. In a way it's about starting a new country. The term was coined by the crypto influencer and former CTO of Coinbase Balaji Srinivasan in a book by the same title last year [2022]. What many people don't know is that it is a more recent addition or innovation in a space called competitive governance. The idea is that you have multiple jurisdictions competing to provide you services as a customer. When you have competition among governments or government service providers, these entities are forced to provide you with a better service instead of the often worse service at higher prices or higher taxes that we're currently getting. The idea went from seasteading, which was hardly feasible because of costs, to charter cities getting public/private partnerships with existing governments and a level of legal autonomy, to special economic zones, to now network states.
Q. How do network states compare to charter cities and similar jurisdictions?
A. Charter cities and special economic zones were legal forks from other existing states. Dubai, Shenzhen in China, to some degree Hong Kong, to some degree Singapore are some examples. There's a host of other charter cities, one of which I'm based in myself, which is Prospera located in Honduras on the island Roatán. Charter cities provide the full stack of governance; they provide new laws and regulations, business registration, tax codes and governance services, Estonia style: you log on to the government platform and you get services as a citizen.
When conceptualizing network states, Balagi Srinivasan turns the idea of a charter city a bit on its head: he doesn't want to start with this full stack because it's still very hard to get these kinds of partnerships with government. It's very expensive and requires lots of experience and lots of social capital. He is saying that network states could instead start as an online community. They could have a level of alignment where they trade with each other; they have their own economy; they meet in person in regular gatherings like we're doing here in Zuzulu for two months, and then they negotiate with existing governments or host cities to get a certain degree of legal autonomy that is centered around a moral innovation. So, his idea is: don't focus on building a completely new country or city; focus on a moral innovation.
Q. What would be an example of such a moral innovation?
A. An example would be longevity—life is good; death is bad—let's see what we can do to foster progress around that moral innovation and see how we can get legal forks from the existing system that allow us to accelerate progress in that area. There is an increasing realization in the science that there are hallmarks of aging and that aging is a cause of other diseases like cancer, ALS or Alzheimer's. But aging is not recognized as a disease by the FDA in the United States and in most countries around the world, so it's very hard to get scientific funding for biotechnology that would attack the hallmarks of aging and allow us potentially to reverse aging and extend life. This is a significant shortcoming of existing government systems that groups such as the ones that have come together here in Montenegro are now seeking alternatives too. Charter cities and now network states are such alternatives.
Q. Would it not be better to work within the current systems, and try to improve them, rather than abandon them for new experimental jurisdictions?
A. There are numerous failures of public policies. These failures are hard, if not impossible, to reverse, because as soon as you have these policies, you have entrenched interests who benefit from the regulations. The only way to disrupt incumbent industries is with start-ups, but the way the system is set up makes it excessively hard for such start-ups to become big companies. In fact, larger companies are weaponizing the legal system against small companies, because they can afford the lawyers and the fixed cost of compliance.
I don't believe that our institutions in many developed countries are beyond hope. I just think it's easier to change them if you could point at successful examples. ‘Hey, this country or this zone is already doing it very successfully’; if they can extend people’s lifespan by 10 years, if they can reduce maternal mortality, and if they have a massive medical tourism where people come back healthier, then that is just very embarrassing for the FDA.
Q. Perhaps a comparison here would be the relationship between Hong Kong and China?
A. Correct, so having Hong Kong right in front of your door … ‘Hey, this capitalism thing seems to work, why don't we try it here?’ It was due to the very bold leadership by Deng Xiaoping that they experimented with it in the development zone of Shenzhen. It worked really well and then they expanded with more special economic zones that also worked.
Próspera is a private city and special economic zone on the island of Roatán in the Central American state of Honduras.
Q. Tell us about Prospera, the charter city in Honduras, that you are intimately connected with.
A. Honduras is a very poor country. It has a lot of crime, never had a single VC investment, and has a GDP per capita of 2,000 per year. Honduras has suffered tremendously. The goal of these special economic zones is to bring in economic development. That's their sole purpose. It's a homegrown innovation from Honduras that started in 2009 with a very forward-thinking statesman, Octavio Sanchez, who was the chief of staff to the president of Honduras, and then president. He had his own ideas about making Honduras a more decentralized system, where more of the power lies in the municipalities.
Inspired by the ideas of Nobel laureate economist Paul Romer, who gave a famous Ted Talk in 2009 about charter cities, Sanchez initiated a process that lasted for years and eventually led to the creation of a special economic zone legal regime that’s anchored in the Hunduran constitution that provides the highest legal autonomy in the world to these zones. There are today three special economic zones approved by the Honduran government: Prospera, Ciudad Morazan and Orchidea.
Q. How did you become interested and then involved in Prospera?
A. I read about it first in an article by Scott Alexander, a famous rationalist blogger, who wrote a very long article about Prospera, and I thought, this is amazing! Then I came to Prospera and I found it to be one of the most if not the most exciting project in the world going on right now and that it also opened my heart to the country and its people. Most of my friends there are Honduran, they have been working on this for 10 or more years. They want to remake Honduras and put it on the map as the place in the world where this legal and governance innovation started.
Q. To what extent is Prospera autonomous relative to the Honduran government?
A. What's interesting about the Honduran model is that it's anchored within the Honduran constitution, and it has a very clear framework for what's possible and what's not possible, and what's possible ensures the highest degree of legal autonomy anywhere seen in the world. Prospera has really pushed the model furthest in creating a common law-based polycentric legal system. The idea is that you don't have a legislature, instead you have common law and it's based on the best practice common law principles that a legal scholar named Tom W. Bell created.
One of the core ideas is that as a business you're not obligated to follow one regulatory monopoly like the FDA. You have regulatory flexibility so you can choose what you're regulated under. So, you can say: ‘if I do a medical clinic, I do it under Norwegian law here’. And you even have the possibility to amend it a bit. You're still required to have liability insurance, and have to agree to binding arbitration in case there's a legal dispute. And your insurance has to approve you. So, under that model the insurance becomes the regulator and they regulate through prices. The limiting factor is criminal law; Honduran criminal law fully applies. So does immigration law. And we pay taxes.
Q. Is there also an idea of creating a kind of healthy living there, and encourage medical tourism?
A. Yes, we specifically look for legal advantages in autonomy around creating new drugs, doing clinical trials, doing self-medication and experimentation. There is a stem cell clinic here and they're doing clinical trials. The island of Roatán is very easily accessible for American tourists. It's a beautiful island, and it's for regulatory reasons hard to do stem cell therapies in the United States, so they're flying in patients from the United States. Most of them are very savvy and often have PhDs in biotech and are able to assess the risk for themselves of taking drugs and doing clinical trials. We're also going to get a wellness center, and there have been ideas around establishing a peptide clinic and a compound pharmacy and things like that. We are developing a healthcare ecosystem.
Q. This kind of experimental tourism raises some ethical issues. What happens if patients are harmed? And what are the moral implications for society of these new treatments?
A. As a moral principle we believe in medical freedom: people have rights over their bodies, even at the (informed) risk of harm to themselves if no unconsenting third-parties are harmed; this is a fundamental right currently not protected effectively.
What we do differently is not changing ethical norms around safety and efficacy, we’re just changing the institutional setup. Instead of one centralized bureaucracy, like the FDA, we have regulatory pluralism that allows different providers of safety and efficacy to compete under market rules. Like under any legal system, common law in Prospera punishes malpractice, fraud, murder etc. This system will still produce safe and effective drugs, and it will still work with common sense legal notions like informed consent and liability for harm. There are regulations for medical practice, there is liability insurance and things like that. It will just do so more efficiently than the current way of doing things (unless it won’t, in which case it will change and evolve – or fail).
A direct moral benefit ´to what we do is that we increase accessibility. Typical gene therapies on the market cost $1 million dollars in the US. The gene therapy developed in Prospera costs $25,000. As to concern about whether such treatments are problematic, we do not share this perspective. We are for advancing science responsibly and we believe that both individuals and society stand to gain from improving the resiliency of the human body through advanced biotechnology.
Q. How does Prospera relate to the local Honduran population?
A. I think it's very important that our projects deliver local benefits and that they're well anchored in local communities. Because when you go to a new place, you're seen as a foreigner, and you're seen as potentially a danger or a threat. The most important thing for Prospera and Ciudad Morazan is to show we're creating jobs; we're creating employment; we're improving people's lives on the ground. Prospera is directly and indirectly employing 1,100 people. More than 2/3 of the people who are working for Prospera are Honduran. It has a lot of local service workers from the island, and it has educated Hondurans from the mainland for whom it's an alternative to going to the United States.
Q. What makes a good Prosperian citizen?
A. People in Prospera are very entrepreneurial. They're opening companies on a small scale. For example, Vehinia, who is the cook in the kitchen at Prospera, she's from the neighboring village and she started an NGO that is now funding a school where children from the local village can go to instead of a school that's 45 minutes away. There's very much a spirit of ‘let's exchange and trade with each other’. Some people might see that as a bit too commercial, but that's something about the culture that people accept and that people see as a good thing.
Q. Five years from now, if everything goes well, what do we see in Prospera?
A. I think Prospera will have at least 10,000 residents and I think Honduras hopefully will have more zones. There could be zones with a thriving industrial sector and sort of a labor-intensive economy and some that are very strong in pharmaceuticals, there could also be other zones for synthetic biology, and other zones focused on agriculture. The zones of Prospera, Ciudad Morazan and Orchidea are already showing the results we want to see, the results that we will eventually be measured by, and I'm tremendously excited about Honduras.
How to Measure Your Stress, with Dr. Rosalind Picard
Today’s podcast guest is Rosalind Picard, a researcher, inventor named on over 100 patents, entrepreneur, author, professor and engineer. When it comes to the science related to endowing computer software with emotional intelligence, she wrote the book. It’s published by MIT Press and called Affective Computing.
Dr. Picard is founder and director of the MIT Media Lab’s Affective Computing Research Group. Her research and engineering contributions have been recognized internationally. For example, she received the 2022 International Lombardy Prize for Computer Science Research, considered by many to be the Nobel prize in computer science.
Through her research and companies, Dr. Picard has developed wearable sensors, algorithms and systems for sensing, recognizing and responding to information about human emotion. Her products are focused on using fitness trackers to advance clinical quality treatments for a range of conditions.
Meanwhile, in just the past few years, numerous fitness tracking companies have released products with their own stress sensors and systems. You may have heard about Fitbit’s Stress Management Score, or Whoop’s Stress Monitor – these features and apps measure things like your heart rhythm and a certain type of invisible sweat to identify stress. They’re designed to raise awareness about forms of stress such as anxieties and anger, and suggest strategies like meditation to relax in real time when stress occurs.
But how well do these off-the-shelf gadgets work? There’s no one more knowledgeable and experienced than Rosalind Picard to explain the science behind these stress features, what they do exactly, how they might be able to help us, and their current shortcomings.
Dr. Picard is a member of the National Academy of Engineering and a Fellow of the National Academy of Inventors, and a popular speaker who’s given over a hundred invited keynote talks and a TED talk with over 2 million views. She holds a Bachelors in Electrical Engineering from Georgia Tech, and Masters and Doctorate degrees in Electrical Engineering and Computer Science from MIT. She lives in Newton, Massachusetts with her husband, where they’ve raised three sons.
In our conversation, we discuss stress scores on fitness trackers to improve well-being. She describes the difference between commercial products that might help people become more mindful of their health and products that are FDA approved and really capable of advancing the science. We also talk about several fascinating findings and concepts discovered in Dr. Picard’s lab including the multiple arousal theory, a phenomenon you’ll want to hear about. And we explore the complexity of stress, one reason it’s so tough to measure. For example, many forms of stress are actually good for us. Can fitness trackers tell the difference between stress that’s healthy and unhealthy?
Show links:
- Dr. Picard’s book, Affective Computing
- Dr. Picard’s bio
- Dr. Picard on Twitter
- Dr. Picard’s company, Empatica - https://www.empatica.com/ - The FDA-cleared Empatica Health Monitoring Platform provides accurate, continuous health insights for researchers and clinicians, collected in the real world
- Empatica Twitter
- Dr. Picard and her team have published hundreds of peer-reviewed articles across AI, Machine Learning, Affective Computing, Digital Health, and Human-computer interaction.
- Dr. Picard’s TED talk
Rosalind Picard