Bad Actors Getting Your Health Data Is the FBI’s Latest Worry
In February 2015, the health insurer Anthem revealed that criminal hackers had gained access to the company's servers, exposing the personal information of nearly 79 million patients. It's the largest known healthcare breach in history.
FBI agents worry that the vast amounts of healthcare data being generated for precision medicine efforts could leave the U.S. vulnerable to cyber and biological attacks.
That year, the data of millions more would be compromised in one cyberattack after another on American insurers and other healthcare organizations. In fact, for the past several years, the number of reported data breaches has increased each year, from 199 in 2010 to 344 in 2017, according to a September 2018 analysis in the Journal of the American Medical Association.
The FBI's Edward You sees this as a worrying trend. He says hackers aren't just interested in your social security or credit card number. They're increasingly interested in stealing your medical information. Hackers can currently use this information to make fake identities, file fraudulent insurance claims, and order and sell expensive drugs and medical equipment. But beyond that, a new kind of cybersecurity threat is around the corner.
Mr. You and others worry that the vast amounts of healthcare data being generated for precision medicine efforts could leave the U.S. vulnerable to cyber and biological attacks. In the wrong hands, this data could be used to exploit or extort an individual, discriminate against certain groups of people, make targeted bioweapons, or give another country an economic advantage.
Precision medicine, of course, is the idea that medical treatments can be tailored to individuals based on their genetics, environment, lifestyle or other traits. But to do that requires collecting and analyzing huge quantities of health data from diverse populations. One research effort, called All of Us, launched by the U.S. National Institutes of Health last year, aims to collect genomic and other healthcare data from one million participants with the goal of advancing personalized medical care.
Other initiatives are underway by academic institutions and healthcare organizations. Electronic medical records, genetic tests, wearable health trackers, mobile apps, and social media are all sources of valuable healthcare data that a bad actor could potentially use to learn more about an individual or group of people.
"When you aggregate all of that data together, that becomes a very powerful profile of who you are," Mr. You says.
A supervisory special agent in the biological countermeasures unit within the FBI's weapons of mass destruction directorate, it's Mr. You's job to imagine worst-case bioterror scenarios and figure out how to prevent and prepare for them.
That used to mean focusing on threats like anthrax, Ebola, and smallpox—pathogens that could be used to intentionally infect people—"basically the dangerous bugs," as he puts it. In recent years, advances in gene editing and synthetic biology have given rise to fears that rogue, or even well-intentioned, scientists could create a virulent virus that's intentionally, or unintentionally, released outside the lab.
"If a foreign source, especially a criminal one, has your biological information, then they might have some particular insights into what your future medical needs might be and exploit that."
While Mr. You is still tracking those threats, he's been traveling around the country talking to scientists, lawyers, software engineers, cyber security professionals, government officials and CEOs about new security threats—those posed by genetic and other biological data.
Emerging threats
Mr. You says one possible situation he can imagine is the potential for nefarious actors to use an individual's sensitive medical information to extort or blackmail that person.
"If a foreign source, especially a criminal one, has your biological information, then they might have some particular insights into what your future medical needs might be and exploit that," he says. For instance, "what happens if you have a singular medical condition and an outside entity says they have a treatment for your condition?" You could get talked into paying a huge sum of money for a treatment that ends up being bogus.
Or what if hackers got a hold of a politician or high-profile CEO's health records? Say that person had a disease-causing genetic mutation that could affect their ability to carry out their job in the future and hackers threatened to expose that information. These scenarios may seem far-fetched, but Mr. You thinks they're becoming increasingly plausible.
On a wider scale, Kavita Berger, a scientist at Gryphon Scientific, a Washington, D.C.-area life sciences consulting firm, worries that data from different populations could be used to discriminate against certain groups of people, like minorities and immigrants.
For instance, the advocacy group Human Rights Watch in 2017 flagged a concerning trend in China's Xinjiang territory, a region with a history of government repression. Police there had purchased 12 DNA sequencers and were collecting and cataloging DNA samples from people to build a national database.
"The concern is that this particular province has a huge population of the Muslim minority in China," Ms. Berger says. "Now they have a really huge database of genetic sequences. You have to ask, why does a police station need 12 next-generation sequencers?"
Also alarming is the potential that large amounts of data from different groups of people could lead to customized bioweapons if that data ends up in the wrong hands.
Eleonore Pauwels, a research fellow on emerging cybertechnologies at United Nations University's Centre for Policy Research, says new insights gained from genomic and other data will give scientists a better understanding of how diseases occur and why certain people are more susceptible to certain diseases.
"As you get more and more knowledge about the genomic picture and how the microbiome and the immune system of different populations function, you could get a much deeper understanding about how you could target different populations for treatment but also how you could eventually target them with different forms of bioagents," Ms. Pauwels says.
Economic competitiveness
Another reason hackers might want to gain access to large genomic and other healthcare datasets is to give their country a leg up economically. Many large cyber-attacks on U.S. healthcare organizations have been tied to Chinese hacking groups.
"This is a biological space race and we just haven't woken up to the fact that we're in this race."
"It's becoming clear that China is increasingly interested in getting access to massive data sets that come from different countries," Ms. Pauwels says.
A year after U.S. President Barack Obama conceived of the Precision Medicine Initiative in 2015—later renamed All of Us—China followed suit, announcing the launch of a 15-year, $9 billion precision health effort aimed at turning China into a global leader in genomics.
Chinese genomics companies, too, are expanding their reach outside of Asia. One company, WuXi NextCODE, which has offices in Shanghai, Reykjavik, and Cambridge, Massachusetts, has built an extensive library of genomes from the U.S., China and Iceland, and is now setting its sights on Ireland.
Another Chinese company, BGI, has partnered with Children's Hospital of Philadelphia and Sinai Health System in Toronto, and also formed a collaboration with the Smithsonian Institute to sequence all species on the planet. BGI has built its own advanced genomic sequencing machines to compete with U.S.-based Illumina.
Mr. You says having access to all this data could lead to major breakthroughs in healthcare, such as new blockbuster drugs. "Whoever has the largest, most diverse dataset is truly going to win the day and come up with something very profitable," he says.
Some direct-to-consumer genetic testing companies with offices in the U.S., like Dante Labs, also use BGI to process customers' DNA.
Experts worry that China could race ahead the U.S. in precision medicine because of Chinese laws governing data sharing. Currently, China prohibits the exportation of genetic data without explicit permission from the government. Mr. You says this creates an asymmetry in data sharing between the U.S. and China.
"This is a biological space race and we just haven't woken up to the fact that we're in this race," he said in January at an American Society for Microbiology conference in Washington, D.C. "We don't have access to their data. There is absolutely no reciprocity."
Protecting your data
While Mr. You has been stressing the importance of data security to anyone who will listen, the National Academies of Sciences, Engineering, and Medicine, which makes scientific and policy recommendations on issues of national importance, has commissioned a study on "safeguarding the bioeconomy."
In the meantime, Ms. Berger says organizations that deal with people's health data should assess their security risks and identify potential vulnerabilities in their systems.
As for what individuals can do to protect themselves, she urges people to think about the different ways they're sharing healthcare data—such as via mobile health apps and wearables.
"Ask yourself, what's the benefit of sharing this? What are the potential consequences of sharing this?" she says.
Mr. You also cautions people to think twice before taking consumer DNA tests. They may seem harmless, he says, but at the end of the day, most people don't know where their genetic information is going. "If your genetic sequence is taken, once it's gone, it's gone. There's nothing you can do about it."
Have you felt a bit like an armchair epidemiologist lately? Maybe you've been poring over coronavirus statistics on your county health department's website or on the pages of your local newspaper.
If the percentage of positive tests steadily stays under 8 percent, that's generally a good sign.
You're likely to find numbers and charts but little guidance about how to interpret them, let alone use them to make day-to-day decisions about pandemic safety precautions.
Enter the gurus. We asked several experts to provide guidance for laypeople about how to navigate the numbers. Here's a look at several common COVID-19 statistics along with tips about how to understand them.
Case Counts: Consider the Context
The number of confirmed COVID-19 cases in American counties is widely available. Local and state health departments should provide them online, or you can easily look them up at The New York Times' coronavirus database. However, you need to be cautious about interpreting them.
"Case counts are the obvious numbers to look at. But they're probably the hardest thing to sort out," said Dr. Jeff Martin, an epidemiologist at the University of California at San Francisco.
That's because case counts by themselves aren't a good window into how the coronavirus is affecting your community since they rely on testing. And testing itself varies widely from day to day and community to community.
"The more testing that's done, the more infections you'll pick up," explained Dr. F. Perry Wilson, a physician at Yale University. The numbers can also be thrown off when tests are limited to certain groups of people.
"If the tests are being mostly given to people with a high probability of having been infected -- for example, they have had symptoms or work in a high-risk setting -- then we expect lots of the tests to be positive. But that doesn't tell us what proportion of the general public is likely to have been infected," said Eleanor Murray, an epidemiologist at Boston University.
These Stats Are More Meaningful
According to Dr. Wilson, it's more useful to keep two other statistics in mind: the number of COVID tests that are being performed in your community and the percentage that turn up positive, showing that people have the disease. (These numbers may or may not be available locally. Check the websites of your community's health department and local news media outlets.)
If the number of people being tested is going up, but the percentage of positive tests is going down, Dr. Wilson said, that's a good sign. But if both numbers are going up – the number of people tested and the percentage of positive results – then "that's a sign that there are more infections burning in the community."
It's especially worrisome if the percentage of positive cases is growing compared to previous days or weeks, he said. According to him, that's a warning of a "high-risk situation."
Dr. George Rutherford, an epidemiologist at University of California at San Francisco, offered this tip: If the percentage of positive tests steadily stays under 8 percent, that's generally a good sign.
There's one more caveat about case counts. It takes an average of a week for someone to be infected with COVID-19, develop symptoms, and get tested, Dr. Rutherford said. It can take an additional several days for those test results to be reported to the county health department. This means that case numbers don't represent infections happening right now, but instead are a picture of the state of the pandemic more than a week ago.
Hospitalizations: Focus on Current Statistics
You should be able to find numbers about how many people in your community are currently hospitalized – or have been hospitalized – with diagnoses of COVID-19. But experts say these numbers aren't especially revealing unless you're able to see the number of new hospitalizations over time and track whether they're rising or falling. This number often isn't publicly available, however.
If new hospitalizations are increasing, "you may want to react by being more careful yourself."
And there's an important caveat: "The problem with hospitalizations is that they do lag," UC San Francisco's Dr. Martin said, since it takes time for someone to become ill enough to need to be hospitalized. "They tell you how much virus was being transmitted in your community 2 or 2.5 weeks ago."
Also, he said, people should be cautious about comparing new hospitalization rates between communities unless they're adjusted to account for the number of more-vulnerable older people.
Still, if new hospitalizations are increasing, he said, "you may want to react by being more careful yourself."
Deaths: They're an Even More Delayed Headline
Cable news networks obsessively track the number of coronavirus deaths nationwide, and death counts for every county in the country are available online. Local health departments and media websites may provide charts tracking the growth in deaths over time in your community.
But while death rates offer insight into the disease's horrific toll, they're not useful as an instant snapshot of the pandemic in your community because severely ill patients are typically sick for weeks. Instead, think of them as a delayed headline.
"These numbers don't tell you what's happening today. They tell you how much virus was being transmitted 3-4 weeks ago," Dr. Martin said.
'Reproduction Value': It May Be Revealing
You're not likely to find an available "reproduction value" for your community, but it is available for your state and may be useful.
A reproduction value, also known as R0 or R-naught, "tells us how many people on average we expect will be infected from a single case if we don't take any measures to intervene and if no one has been infected before," said Boston University's Murray.
As The New York Times explained, "R0 is messier than it might look. It is built on hard science, forensic investigation, complex mathematical models — and often a good deal of guesswork. It can vary radically from place to place and day to day, pushed up or down by local conditions and human behavior."
It may be impossible to find the R0 for your community. However, a website created by data specialists is providing updated estimates of a related number -- effective reproduction number, or Rt – for each state. (The R0 refers to how infectious the disease is in general and if precautions aren't taken. The Rt measures its infectiousness at a specific time – the "t" in Rt.) The site is at rt.live.
"The main thing to look at is whether the number is bigger than 1, meaning the outbreak is currently growing in your area, or smaller than 1, meaning the outbreak is currently decreasing in your area," Murray said. "It's also important to remember that this number depends on the prevention measures your community is taking. If the Rt is estimated to be 0.9 in your area and you are currently under lockdown, then to keep it below 1 you may need to remain under lockdown. Relaxing the lockdown could mean that Rt increases above 1 again."
"Whether they're on the upswing or downswing, no state is safe enough to ignore the precautions about mask wearing and social distancing."
Keep in mind that you can still become infected even if an outbreak in your community appears to be slowing. Low risk doesn't mean no risk.
Putting It All Together: Why the Numbers Matter
So you've reviewed COVID-19 statistics in your community. Now what?
Dr. Wilson suggests using the data to remind yourself that the coronavirus pandemic "is still out there. You need to take it seriously and continue precautions," he said. "Whether they're on the upswing or downswing, no state is safe enough to ignore the precautions about mask wearing and social distancing. 'My state is doing well, no one I know is sick, is it time to have a dinner party?' No."
He also recommends that laypeople avoid tracking COVID-19 statistics every day. "Check in once a week or twice a month to see how things are going," he suggested. "Don't stress too much. Just let it remind you to put that mask on before you get out of your car [and are around others]."
GOOD10: The Pandemic Issue explores big-picture ways that science innovation and communication can usher in a more equitable, more progress-oriented, and safer world.
This issue is a collaboration among GOOD, leapsmag, and the Aspen Institute Science & Society Program.
The GOOD10 format explores fundamental issues facing humanity through the lenses of ten forces pushing the needle toward progress: Places, Philanthropists, Celebrities, Whistleblowers, Companies, Media, Products, Politicians, Scientists, and Actions. Across these categories, we seek to present unexpected and encouraging paradigms emerging from this historic crisis.
This special issue is available as an e-reader version for both desktop and mobile. It is also available as a free downloadable PDF.
TABLE OF CONTENTS:
- PLACES:
55 Lessons Learned About Science Communication Around the World; Quarantining Our Way Into Outer Space - PLACES:
Quarantining Our Way Into Outer Space - PHILANTHROPISTS:
An Exclusive Interview with Wendy Schmidt about Science in the Pandemic Era - CELEBRITIES:
Neil deGrasse Tyson Wants Celebrities to Promote Scientists - WHISTLEBLOWERS:
The Science Sleuths Holding Fraudulent Research Accountable - COMPANIES:
The Biggest Challenge for a COVID-19 Vaccine: Making It Accessible and Affordable - MEDIA:
Isaac Asimov on the History of Infectious Disease—And How Humanity Learned To Fight Back - PRODUCTS:
Will COVID-19 Pave the Way For DIY Precision Medicine? - POLITICIANS:
Will the Pandemic Propel STEM Experts to Political Power? - SCIENTISTS:
Would a Broad-Spectrum Antiviral Drug Stop the Pandemic? - ACTIONS:
Pseudoscience is Rampant: How Not to Fall For It - ACTIONS:
How COVID-19 Could Usher In a New Age of Collective Drug Discovery
THE EVENT:
"The Pandemic Science Summit" focused on how science innovation is key to society's future stability as we emerge from the pandemic, featuring:
Christopher Bailey – Arts and Health Lead, World Health Organization
Elisabeth Bik, Ph.D. – Microbiologist and scientific integrity consultant
Margaret Hamburg, M.D. – Foreign Secretary, National Academy of Medicine; former Commissioner, U.S. Food and Drug Administration
Peggy Oti-Boateng, Ph.D. – Director, Division of Science Policy and Capacity- Building, UNESCO
George Yancopoulos, M.D., Ph.D. – President and Chief Scientific Officer, Regeneron Pharmaceuticals
Kira Peikoff was the editor-in-chief of Leaps.org from 2017 to 2021. As a journalist, her work has appeared in The New York Times, Newsweek, Nautilus, Popular Mechanics, The New York Academy of Sciences, and other outlets. She is also the author of four suspense novels that explore controversial issues arising from scientific innovation: Living Proof, No Time to Die, Die Again Tomorrow, and Mother Knows Best. Peikoff holds a B.A. in Journalism from New York University and an M.S. in Bioethics from Columbia University. She lives in New Jersey with her husband and two young sons. Follow her on Twitter @KiraPeikoff.