Genomic Data Has a Diversity Problem, But Global Efforts Are Underway to Fix It
Genomics has begun its golden age. Just 20 years ago, sequencing a single genome cost nearly $3 billion and took over a decade. Today, the same feat can be achieved for a few hundred dollars and the better part of a day . Suddenly, the prospect of sequencing not just individuals, but whole populations, has become feasible.
The genetic differences between humans may seem meager, only around 0.1 percent of the genome on average, but this variation can have profound effects on an individual's risk of disease, responsiveness to medication, and even the dosage level that would work best.
Already, initiatives like the U.K.'s 100,000 Genomes Project - now expanding to 1 million genomes - and other similarly massive sequencing projects in Iceland and the U.S., have begun collecting population-scale data in order to capture and study this variation.
The resulting data sets are immensely valuable to researchers and drug developers working to design new 'precision' medicines and diagnostics, and to gain insights that may benefit patients. Yet, because the majority of this data comes from developed countries with well-established scientific and medical infrastructure, the data collected so far is heavily biased towards Western populations with largely European ancestry.
This presents a startling and fast-emerging problem: groups that are under-represented in these datasets are likely to benefit less from the new wave of therapeutics, diagnostics, and insights, simply because they were tailored for the genetic profiles of people with European ancestry.
We may indeed be approaching a golden age of genomics-enabled precision medicine. But if the data bias persists then there is a risk, as with most golden ages throughout history, that the benefits will not be equally accessible to all, and existing inequalities will only be exacerbated.
To remedy the situation, a number of initiatives have sprung up to sequence genomes of under-represented groups, adding them to the datasets and ensuring that they too will benefit from the rapidly unfolding genomic revolution.
Global Gene Corp
The idea behind Global Gene Corp was born eight years ago in Harvard when Sumit Jamuar, co-founder and CEO, met up with his two other co-founders, both experienced geneticists, for a coffee.
"They were discussing the limitless applications of understanding your genetic code," said Jamuar, a business executive from New Delhi.
"And so, being a technology enthusiast type, I was excited and I turned to them and said hey, this is incredible! Could you sequence me and give me some insights? And they actually just turned around and said no, because it's not going to be useful for you - there's not enough reference for what a good Sumit looks like."
What started as a curiosity-driven conversation on the power of genomics ended with a commitment to tackle one of the field's biggest roadblocks - its lack of global representation.
Jamuar set out to begin with India, which has about 20 percent of the world's population, including over 4000 different ethnicities, but contributes less than 2 percent of genomic data, he told Leaps.org.
Eight years later, Global Gene Corp's sequencing initiative is well underway, and is the largest in the history of the Indian subcontinent. The program is being carried out in collaboration with biotech giant Regeneron, with support from the Indian government, local communities, and the Indian healthcare ecosystem. In August 2020, Global Gene Corp's work was recognized through the $1 million 2020 Roddenberry award for organizations that advance the vision of 'Star Trek' creator Gene Roddenberry to better humanity.
This problem has already begun to manifest itself in, for example, much higher levels of genetic misdiagnosis among non-Europeans tested for their risk of certain diseases, such as hypertrophic cardiomyopathy - an inherited disease of the heart muscle.
Global Gene Corp also focuses on developing and implementing AI and machine learning tools to make sense of the deluge of genomic data. These tools are increasingly used by both industry and academia to guide future research by identifying particularly promising or clinically interesting genetic variants. But if the underlying data is skewed European, then the effectiveness of the computational analysis - along with the future advances and avenues of research that emerge from it - will be skewed towards Europeans too.
This problem has already begun to manifest itself in, for example, much higher levels of genetic misdiagnosis among non-Europeans tested for their risk of certain diseases, such as hypertrophic cardiomyopathy - an inherited disease of the heart muscle. Most of the genetic variants used in these tests were identified as being causal for the disease from studies of European genomes. However, many of these variants differ both in their distribution and clinical significance across populations, leading to many patients of non-European ancestry receiving false-positive test results - as their benign genetic variants were misclassified as pathogenic. Had even a small number of genomes from other ethnicities been included in the initial studies, these misdiagnoses could have been avoided.
"Unless we have a data set which is unbiased and representative, we're never going to achieve the success that we want," Jamuar says.
"When Siri was first launched, she could hardly recognize an accent which was not of a certain type, so if I was trying to speak to Siri, I would have to repeat myself multiple times and try to mimic an accent which wasn't my accent so that she could understand it.
"But over time the voice recognition technology improved tremendously because the training data was expanded to include people of very diverse backgrounds and their accents, so the algorithms were trained to be able to pick that up and it dramatically improved the technology. That's the way we have to think about it - without that good-quality diverse data, we will never be able to achieve the full potential of the computational tools."
While mapping India's rich genetic diversity has been the organization's primary focus so far, they plan, in time, to expand their work to other under-represented groups in Asia, the Middle East, Africa, and Latin America.
"As other like-minded people and partners join the mission, it just accelerates the achievement of what we have set out to do, which is to map out and organize the world's genomic diversity so that we can enable high-quality life and longevity benefits for everyone, everywhere," Jamuar says.
Empowering African Genomics
Africa is the birthplace of our species, and today still retains an inordinate amount of total human genetic diversity. Groups that left Africa and went on to populate the rest of the world, some 50 to 100,000 years ago, were likely small in number and only took a fraction of the total genetic diversity with them. This ancient bottleneck means that no other group in the world can match the level of genetic diversity seen in modern African populations.
Despite Africa's central importance in understanding the history and extent of human genetic diversity, the genomics of African populations remains wildly understudied. Addressing this disparity has become a central focus of the H3Africa Consortium, an initiative formally launched in 2012 with support from the African Academy of Sciences, the U.S. National Institutes of Health, and the UK's Wellcome Trust. Today, H3Africa supports over 50 projects across the continent, on an array of different research areas in genetics relevant to the health and heredity of Africans.
"Africa is the cradle of Humankind. So what that really means is that the populations that are currently living in Africa are among some of the oldest populations on the globe, and we know that the longer populations have had to go through evolutionary phases, the more variation there is in the genomes of people who live presently," says Zane Lombard, a principal investigator at H3Africa and Associate Professor of Human Genetics at the University of the Witwatersrand in Johannesburg, South Africa.
"So for that reason, African populations carry a huge amount of genetic variation and diversity, which is pretty much uncaptured. There's still a lot to learn as far as novel variation is concerned by looking at and studying African genomes."
A recent landmark H3Africa study, led by Lombard and published in Nature in October, sequenced the genomes of over 400 African individuals from 50 ethno-linguistic groups - many of which had never been sampled before.
Despite the relatively modest number of individuals sequenced in the study, over three million previously undescribed genetic variants were found, and complex patterns of ancestral migration were uncovered.
"In some of these ethno-linguistic groups they don't have a word for DNA, so we've had to really think about how to make sure that we communicate the purposes of different studies to participants so that you have true informed consent," says Lombard.
"The objective," she explained, "was to try and fill some of the gaps for many of these populations for which we didn't have any whole genome sequences or any genetic variation data...because if we're thinking about the future of precision medicine, if the patient is a member of a specific group where we don't know a lot about the genomic variation that exists in that group, it makes it really difficult to start thinking about clinical interpretation of their data."
From H3Africa's conception, the consortium's goal has not only been to better represent Africa's staggering genetic diversity in genomic data sets, but also to build Africa's domestic genomics capabilities and empower a new generation of African researchers. By doing so, the hope is that Africans will be able to set their own genomics agenda, and leapfrog to new and better ways of doing the work.
"The training that has happened on the continent and the number of new scientists, new students, and fellows that have come through the process and are now enabled to start their own research groups, to grow their own research in their countries, to be a spokesperson for genomics research in their countries, and to build that political will to do these larger types of sequencing initiatives - that is really a significant outcome from H3Africa as well. Over and above all the science that's coming out," Lombard says.
"What has been created through H3Africa is just this locus of researchers and scientists and bioethicists who have the same goal at heart - to work towards adjusting the data bias and making sure that all global populations are represented in genomics."
In the 1966 movie "Fantastic Voyage," actress Raquel Welch and her submarine were shrunk to the size of a cell in order to eliminate a blood clot in a scientist's brain. Now, 55 years later, the scenario is becoming closer to reality.
California-based startup Bionaut Labs has developed a nanobot about the size of a grain of rice that's designed to transport medication to the exact location in the body where it's needed. If you think about it, the conventional way to deliver medicine makes little sense: A painkiller affects the entire body instead of just the arm that's hurting, and chemotherapy is flushed through all the veins instead of precisely targeting the tumor.
"Chemotherapy is delivered systemically," Bionaut-founder and CEO Michael Shpigelmacher says. "Often only a small percentage arrives at the location where it is actually needed."
But what if it was possible to send a tiny robot through the body to attack a tumor or deliver a drug at exactly the right location?
Several startups and academic institutes worldwide are working to develop such a solution but Bionaut Labs seems the furthest along in advancing its invention. "You can think of the Bionaut as a tiny screw that moves through the veins as if steered by an invisible screwdriver until it arrives at the tumor," Shpigelmacher explains. Via Zoom, he shares the screen of an X-ray machine in his Culver City lab to demonstrate how the half-transparent, yellowish device winds its way along the spine in the body. The nanobot contains a tiny but powerful magnet. The "invisible screwdriver" is an external magnetic field that rotates that magnet inside the device and gets it to move and change directions.
The current model has a diameter of less than a millimeter. Shpigelmacher's engineers could build the miniature vehicle even smaller but the current size has the advantage of being big enough to see with bare eyes. It can also deliver more medicine than a tinier version. In the Zoom demonstration, the micorobot is injected into the spine, not unlike an epidural, and pulled along the spine through an outside magnet until the Bionaut reaches the brainstem. Depending which organ it needs to reach, it could be inserted elsewhere, for instance through a catheter.
"The hope is that we can develop a vehicle to transport medication deep into the body," says Max Planck scientist Tian Qiu.
Imagine moving a screw through a steak with a magnet — that's essentially how the device works. But of course, the Bionaut is considerably different from an ordinary screw: "At the right location, we give a magnetic signal, and it unloads its medicine package," Shpigelmacher says.
To start, Bionaut Labs wants to use its device to treat Parkinson's disease and brain stem gliomas, a type of cancer that largely affects children and teenagers. About 300 to 400 young people a year are diagnosed with this type of tumor. Radiation and brain surgery risk damaging sensitive brain tissue, and chemotherapy often doesn't work. Most children with these tumors live less than 18 months. A nanobot delivering targeted chemotherapy could be a gamechanger. "These patients really don't have any other hope," Shpigelmacher says.
Of course, the main challenge of the developing such a device is guaranteeing that it's safe. Because tissue is so sensitive, any mistake could risk disastrous results. In recent years, Bionaut has tested its technology in dozens of healthy sheep and pigs with no major adverse effects. Sheep make a good stand-in for humans because their brains and spines are similar to ours.
The Bionaut device is about the size of a grain of rice.
Bionaut Labs
"As the Bionaut moves through brain tissue, it creates a transient track that heals within a few weeks," Shpigelmacher says. The company is hoping to be the first to test a nanobot in humans. In December 2022, it announced that a recent round of funding drew $43.2 million, for a total of 63.2 million, enabling more research and, if all goes smoothly, human clinical trials by early next year.
Once the technique has been perfected, further applications could include addressing other kinds of brain disorders that are considered incurable now, such as Alzheimer's or Huntington's disease. "Microrobots could serve as a bridgehead, opening the gateway to the brain and facilitating precise access of deep brain structure – either to deliver medication, take cell samples or stimulate specific brain regions," Shpigelmacher says.
Robot-assisted hybrid surgery with artificial intelligence is already used in state-of-the-art surgery centers, and many medical experts believe that nanorobotics will be the instrument of the future. In 2016, three scientists were awarded the Nobel Prize in Chemistry for their development of "the world's smallest machines," nano "elevators" and minuscule motors. Since then, the scientific experiments have progressed to the point where applicable devices are moving closer to actually being implemented.
Bionaut's technology was initially developed by a research team lead by Peer Fischer, head of the independent Micro Nano and Molecular Systems Lab at the Max Planck Institute for Intelligent Systems in Stuttgart, Germany. Fischer is considered a pioneer in the research of nano systems, which he began at Harvard University more than a decade ago. He and his team are advising Bionaut Labs and have licensed their technology to the company.
"The hope is that we can develop a vehicle to transport medication deep into the body," says Max Planck scientist Tian Qiu, who leads the cooperation with Bionaut Labs. He agrees with Shpigelmacher that the Bionaut's size is perfect for transporting medication loads and is researching potential applications for even smaller nanorobots, especially in the eye, where the tissue is extremely sensitive. "Nanorobots can sneak through very fine tissue without causing damage."
In "Fantastic Voyage," Raquel Welch's adventures inside the body of a dissident scientist let her swim through his veins into his brain, but her shrunken miniature submarine is attacked by antibodies; she has to flee through the nerves into the scientist's eye where she escapes into freedom on a tear drop. In reality, the exit in the lab is much more mundane. The Bionaut simply leaves the body through the same port where it entered. But apart from the dramatization, the "Fantastic Voyage" was almost prophetic, or, as Shpigelmacher says, "Science fiction becomes science reality."
This article was first published by Leaps.org on April 12, 2021.
How the Human Brain Project Built a Mind of its Own
In 2009, neuroscientist Henry Markram gave an ambitious TED talk. “Our mission is to build a detailed, realistic computer model of the human brain,” he said, naming three reasons for this unmatched feat of engineering. One was because understanding the human brain was essential to get along in society. Another was because experimenting on animal brains could only get scientists so far in understanding the human ones. Third, medicines for mental disorders weren’t good enough. “There are two billion people on the planet that are affected by mental disorders, and the drugs that are used today are largely empirical,” Markram said. “I think that we can come up with very concrete solutions on how to treat disorders.”
Markram's arguments were very persuasive. In 2013, the European Commission launched the Human Brain Project, or HBP, as part of its Future and Emerging Technologies program. Viewed as Europe’s chance to try to win the “brain race” between the U.S., China, Japan, and other countries, the project received about a billion euros in funding with the goal to simulate the entire human brain on a supercomputer, or in silico, by 2023.
Now, after 10 years of dedicated neuroscience research, the HBP is coming to an end. As its many critics warned, it did not manage to build an entire human brain in silico. Instead, it achieved a multifaceted array of different goals, some of them unexpected.
Scholars have found that the project did help advance neuroscience more than some detractors initially expected, specifically in the area of brain simulations and virtual models. Using an interdisciplinary approach of combining technology, such as AI and digital simulations, with neuroscience, the HBP worked to gain a deeper understanding of the human brain’s complicated structure and functions, which in some cases led to novel treatments for brain disorders. Lastly, through online platforms, the HBP spearheaded a previously unmatched level of global neuroscience collaborations.
Simulating a human brain stirs up controversy
Right from the start, the project was plagued with controversy and condemnation. One of its prominent critics was Yves Fregnac, a professor in cognitive science at the Polytechnic Institute of Paris and research director at the French National Centre for Scientific Research. Fregnac argued in numerous articles that the HBP was overfunded based on proposals with unrealistic goals. “This new way of over-selling scientific targets, deeply aligned with what modern society expects from mega-sciences in the broad sense (big investment, big return), has been observed on several occasions in different scientific sub-fields,” he wrote in one of his articles, “before invading the field of brain sciences and neuromarketing.”
"A human brain model can simulate an experiment a million times for many different conditions, but the actual human experiment can be performed only once or a few times," said Viktor Jirsa, a professor at Aix-Marseille University.
Responding to such critiques, the HBP worked to restructure the effort in its early days with new leadership, organization, and goals that were more flexible and attainable. “The HBP got a more versatile, pluralistic approach,” said Viktor Jirsa, a professor at Aix-Marseille University and one of the HBP lead scientists. He believes that these changes fixed at least some of HBP’s issues. “The project has been on a very productive and scientifically fruitful course since then.”
After restructuring, the HBP became a European hub on brain research, with hundreds of scientists joining its growing network. The HBP created projects focused on various brain topics, from consciousness to neurodegenerative diseases. HBP scientists worked on complex subjects, such as mapping out the brain, combining neuroscience and robotics, and experimenting with neuromorphic computing, a computational technique inspired by the human brain structure and function—to name just a few.
Simulations advance knowledge and treatment options
In 2013, it seemed that bringing neuroscience into a digital age would be farfetched, but research within the HBP has made this achievable. The virtual maps and simulations various HBP teams create through brain imaging data make it easier for neuroscientists to understand brain developments and functions. The teams publish these models on the HBP’s EBRAINS online platform—one of the first to offer access to such data to neuroscientists worldwide via an open-source online site. “This digital infrastructure is backed by high-performance computers, with large datasets and various computational tools,” said Lucy Xiaolu Wang, an assistant professor in the Resource Economics Department at the University of Massachusetts Amherst, who studies the economics of the HBP. That means it can be used in place of many different types of human experimentation.
Jirsa’s team is one of many within the project that works on virtual brain models and brain simulations. Compiling patient data, Jirsa and his team can create digital simulations of different brain activities—and repeat these experiments many times, which isn’t often possible in surgeries on real brains. “A human brain model can simulate an experiment a million times for many different conditions,” Jirsa explained, “but the actual human experiment can be performed only once or a few times.” Using simulations also saves scientists and doctors time and money when looking at ways to diagnose and treat patients with brain disorders.
Compiling patient data, scientists can create digital simulations of different brain activities—and repeat these experiments many times.
The Human Brain Project
Simulations can help scientists get a full picture that otherwise is unattainable. “Another benefit is data completion,” added Jirsa, “in which incomplete data can be complemented by the model. In clinical settings, we can often measure only certain brain areas, but when linked to the brain model, we can enlarge the range of accessible brain regions and make better diagnostic predictions.”
With time, Jirsa’s team was able to move into patient-specific simulations. “We advanced from generic brain models to the ability to use a specific patient’s brain data, from measurements like MRI and others, to create individualized predictive models and simulations,” Jirsa explained. He and his team are working on this personalization technique to treat patients with epilepsy. According to the World Health Organization, about 50 million people worldwide suffer from epilepsy, a disorder that causes recurring seizures. While some epilepsy causes are known others remain an enigma, and many are hard to treat. For some patients whose epilepsy doesn’t respond to medications, removing part of the brain where seizures occur may be the only option. Understanding where in the patients’ brains seizures arise can give scientists a better idea of how to treat them and whether to use surgery versus medications.
“We apply such personalized models…to precisely identify where in a patient’s brain seizures emerge,” Jirsa explained. “This guides individual surgery decisions for patients for which surgery is the only treatment option.” He credits the HBP for the opportunity to develop this novel approach. “The personalization of our epilepsy models was only made possible by the Human Brain Project, in which all the necessary tools have been developed. Without the HBP, the technology would not be in clinical trials today.”
Personalized simulations can significantly advance treatments, predict the outcome of specific medical procedures and optimize them before actually treating patients. Jirsa is watching this happen firsthand in his ongoing research. “Our technology for creating personalized brain models is now used in a large clinical trial for epilepsy, funded by the French state, where we collaborate with clinicians in hospitals,” he explained. “We have also founded a spinoff company called VB Tech (Virtual Brain Technologies) to commercialize our personalized brain model technology and make it available to all patients.”
The Human Brain Project created a level of interconnectedness within the neuroscience research community that never existed before—a network not unlike the brain’s own.
Other experts believe it’s too soon to tell whether brain simulations could change epilepsy treatments. “The life cycle of developing treatments applicable to patients often runs over a decade,” Wang stated. “It is still too early to draw a clear link between HBP’s various project areas with patient care.” However, she admits that some studies built on the HBP-collected knowledge are already showing promise. “Researchers have used neuroscientific atlases and computational tools to develop activity-specific stimulation programs that enabled paraplegic patients to move again in a small-size clinical trial,” Wang said. Another intriguing study looked at simulations of Alzheimer’s in the brain to understand how it evolves over time.
Some challenges remain hard to overcome even with computer simulations. “The major challenge has always been the parameter explosion, which means that many different model parameters can lead to the same result,” Jirsa explained. An example of this parameter explosion could be two different types of neurodegenerative conditions, such as Parkinson’s and Huntington’s diseases. Both afflict the same area of the brain, the basal ganglia, which can affect movement, but are caused by two different underlying mechanisms. “We face the same situation in the living brain, in which a large range of diverse mechanisms can produce the same behavior,” Jirsa said. The simulations still have to overcome the same challenge.
Understanding where in the patients’ brains seizures arise can give scientists a better idea of how to treat them and whether to use surgery versus medications.
The Human Brain Project
A network not unlike the brain’s own
Though the HBP will be closing this year, its legacy continues in various studies, spin-off companies, and its online platform, EBRAINS. “The HBP is one of the earliest brain initiatives in the world, and the 10-year long-term goal has united many researchers to collaborate on brain sciences with advanced computational tools,” Wang said. “Beyond the many research articles and projects collaborated on during the HBP, the online neuroscience research infrastructure EBRAINS will be left as a legacy even after the project ends.”
Those who worked within the HBP see the end of this project as the next step in neuroscience research. “Neuroscience has come closer to very meaningful applications through the systematic link with new digital technologies and collaborative work,” Jirsa stated. “In that way, the project really had a pioneering role.” It also created a level of interconnectedness within the neuroscience research community that never existed before—a network not unlike the brain’s own. “Interconnectedness is an important advance and prerequisite for progress,” Jirsa said. “The neuroscience community has in the past been rather fragmented and this has dramatically changed in recent years thanks to the Human Brain Project.”
According to its website, by 2023 HBP’s network counted over 500 scientists from over 123 institutions and 16 different countries, creating one of the largest multi-national research groups in the world. Even though the project hasn’t produced the in-silico brain as Markram envisioned it, the HBP created a communal mind with immense potential. “It has challenged us to think beyond the boundaries of our own laboratories,” Jirsa said, “and enabled us to go much further together than we could have ever conceived going by ourselves.”