Big Data Probably Knows More About You Than Your Friends Do
Data is the new oil. It is highly valuable, and it is everywhere, even if you're not aware of it. For example, it's there when you use social media. Sharing pictures on Facebook lets its facial recognition software peg you and your friends. Thanks to that software, now anywhere you visit that has installed cameras, your face can be identified and your actions recorded.
The big data revolution is advancing much faster than the ones before, and it carries both promises and perils for humanity.
It's there when you log into Twitter, posting one of the 230 million tweets per day, which up until last month were all archived by the Library of Congress and will be made public for research. These social media data can be used to predict your political affiliations, ethnicity, race, age, how close you are with your family and friends, your mental health, even when you are most likely to be grumpy or go to the gym. These data can also predict when you are apt to get sick and track how diseases are spreading.
In fact, tracking isn't limited to what you decide to share or public spaces anymore. Lab experiments show Comcast and other cable companies may soon be able to record and monitor movements in your house. They may also be able to read your lips and identify your visitors simply by assessing how Wi-Fi waves bounce off bodies and other objects in houses. In one study, MIT researchers used routers and sensors to monitor breathing and heart rates with 99% accuracy. Routers could soon be used for seemingly good things, like monitoring infant breathing and whether an older adult is about to take a big tumble. However, it may also enable unwanted and unparalleled levels of surveillance.
Some call the first digital pill a snitch pill, medication with a tattletale, and big brother in your belly.
Big data is there every time you pick up your smartphone, which can track your daily steps, where you go via geolocation, what time you wake up and go to bed, your punctuality, and even your overall health depending on which features you have enabled. Are you close with your mom; are you a sedentary couch potato; did you commit a murder (iPhone data was recently used in a German murder trial)? Smartphone-generated data can be used to label you---and not just you, your future and past generations too.
Smartphones are not the only "things" gathering data on you. Anything with an on and off switch can be connected to the internet and generate data. The new rule seems to be, if it can be, it will be, connected. Washing machines, coffee makers, medical appliances, cars, and even your luggage (yes, someone created a self-driving suitcase) can and are often generating data. "Smart" refrigerators can monitor your food levels and automatically create shopping lists and order food for you—while recording your alcohol consumption and whether you tend to be a healthy or junk food eater.
Even medicines can monitor behaviors. The first digital pill was just approved by the FDA last November to track whether patients take their medicines. It has a sensor that sends signals to a patient's smartphone, and others, when it encounters stomach acid. Some call it a snitch pill, medication with a tattletale, and big brother in your belly. Others see it as a major breakthrough to help patients remember to take their medications and to save payers millions of dollars.
Big data is there when you go shopping. Credit card and retail data can show whether you pay for a gym, if you are pregnant, have children, and your credit-worthiness. Uber and Lyft transactional data reveal what time you usually go to and leave work and who you regularly visit (Uber data has been used to catch cheating spouses).
Amazon now sells a bedroom camera to see your fashion choices and offer advice. It is marketing a more fashionable you, but it probably also wants the video feed showing your body measurements—they're "a newly prized currency," according to the Washington Post. They help retailers create more customized and better fitting clothes. Amazon also just partnered with Berkshire Hathaway and JPMorgan Chase, the largest bank in the United States by assets, to create an independent health-care company for their employees--raising privacy concerns as Amazon already owns so much data about us, from drones, devices, the AI of Alexa, and our viewing, eating, and other purchasing habits on Amazon Prime.
Data generation and storage can also be used to make the world better, safer and fairer.
Big data is arguably a new phenomenon; almost all the world's data (90%) were produced within the last 2 years or so. It is a result of the fusion of physical, digital, and biological technologies that together constitute the fourth industrial revolution, according to the World Economic Forum. Unlike the last three revolutions, involving the discoveries of steam power, electrical energy, and computers—this revolution is advancing much faster than the ones before and it carries both promises and perils for humanity.
Some people may want to opt out of all this tracking, reduce their digital footprint and stay "off the grid." However, it is worth noting that data generation and storage can be used for great things --- things that make the world better, safer and fairer. For example, sharing electronic health records and social media data can help scientists better track and understand diseases, develop new cures and therapies, and understand the safety and efficacy profiles of medicines and vaccines.
While full of promise, big data is not without its pitfalls. Data are often not interoperable or easily integrated. You can use your credit card practically anywhere in the world, but you cannot easily port your electronic health record to the doctor or hospital across the street, for example.
Data quality can also be poor. It is dependent on the person entering it. My electronic health record at one point said I was male, and I was pregnant at the time. No doctors or nurses seemed to notice. The problem is worse on a global level. For example, causes of death can be coded differently by country and village. Take HIV patients: they often develop secondary infections, like TB. Do you record the cause of death as TB or HIV? There isn't global consistency, and political pressure from patient groups can exert itself on death records. Often, each group wants to say they have the most deaths so they can fundraise more money.
Data can be biased. More than 80 percent of genomic data comes from Caucasians. Only 14 percent is from Asians and 3.5 percent is from African and Hispanic populations. Thus, when scientists use genomic data to develop drugs or lab tests, they may create biased products that work for only some demographics. Take type 2 diabetes blood tests; some do not work well for African Americans. One study estimates that 650,000 African Americans may have undiagnosed diabetes, because a common blood test doesn't work for them. Using biased data in medicine can be a matter of life and death. Moreover, if genomic medicine benefits only "a privileged few," the practice raises concerns about unequal access.
Large companies are selling data that originated from you and you are not sharing in the wealth.
We need to think carefully and be transparent about the values embedded in our data, data analytics (algorithms), and data applications. Numbers are never neutral. Algorithms are always embedded with subjective normative values--sometimes purposely, sometimes not. To address this problem, we need ethicists who can audit databanks and algorithms to identify embedded norms, values and biases and help ensure they are addressed or at least transparently disclosed. Additionally, we need to determine how to let people opt out of certain types of data collection and uses—and not just at the beginning of a system, but also at any point in their lifetimes. There is a right to be forgotten, which hasn't been adequately operationalized in today's data sphere.
What do you think happens to all of these data collected about us? The short answer is the public doesn't really know. A lot of it looks like what is in a medical record—i.e. height, weight, pregnancy status, age, mental health, pulse, blood pressure, and illness symptoms--- yet, it isn't protected by HIPPA, like your medical record information.
And it is being consolidated into the hands of fewer and fewer big players. Large companies are selling data that originated from you and you are not sharing in the wealth.
A possible solution is to create an app, managed by a nonprofit or public benefit corporation, through which you could download and manage all the data collected about you. For example, you could download your credit card statements with all your purchasing habits, your Uber rides showing transit patterns, medical records, electric bills, every digital record you have and would like to download--into one application. You would then have the power to license pieces or the collection of your data to users for a small fee for one year at a time. Uses and users could be monitored and audited leveraging blockchain capabilities. After the year is up, you can withdraw access.
You could be your own data landlord. We could democratize big data and empower people to better control and manage the wealth of information collected about us. Why should only the big companies like Amazon and Apple profit off the new oil? Let's create an app so we can all manage our data wealth and maybe even become data barons—an app created by the people for the people.
DNA- and RNA-based electronic implants may revolutionize healthcare
Implantable electronic devices can significantly improve patients’ quality of life. A pacemaker can encourage the heart to beat more regularly. A neural implant, usually placed at the back of the skull, can help brain function and encourage higher neural activity. Current research on neural implants finds them helpful to patients with Parkinson’s disease, vision loss, hearing loss, and other nerve damage problems. Several of these implants, such as Elon Musk’s Neuralink, have already been approved by the FDA for human use.
Yet, pacemakers, neural implants, and other such electronic devices are not without problems. They require constant electricity, limited through batteries that need replacements. They also cause scarring. “The problem with doing this with electronics is that scar tissue forms,” explains Kate Adamala, an assistant professor of cell biology at the University of Minnesota Twin Cities. “Anytime you have something hard interacting with something soft [like muscle, skin, or tissue], the soft thing will scar. That's why there are no long-term neural implants right now.” To overcome these challenges, scientists are turning to biocomputing processes that use organic materials like DNA and RNA. Other promised benefits include “diagnostics and possibly therapeutic action, operating as nanorobots in living organisms,” writes Evgeny Katz, a professor of bioelectronics at Clarkson University, in his book DNA- And RNA-Based Computing Systems.
While a computer gives these inputs in binary code or "bits," such as a 0 or 1, biocomputing uses DNA strands as inputs, whether double or single-stranded, and often uses fluorescent RNA as an output.
Adamala’s research focuses on developing such biocomputing systems using DNA, RNA, proteins, and lipids. Using these molecules in the biocomputing systems allows the latter to be biocompatible with the human body, resulting in a natural healing process. In a recent Nature Communications study, Adamala and her team created a new biocomputing platform called TRUMPET (Transcriptional RNA Universal Multi-Purpose GatE PlaTform) which acts like a DNA-powered computer chip. “These biological systems can heal if you design them correctly,” adds Adamala. “So you can imagine a computer that will eventually heal itself.”
The basics of biocomputing
Biocomputing and regular computing have many similarities. Like regular computing, biocomputing works by running information through a series of gates, usually logic gates. A logic gate works as a fork in the road for an electronic circuit. The input will travel one way or another, giving two different outputs. An example logic gate is the AND gate, which has two inputs (A and B) and two different results. If both A and B are 1, the AND gate output will be 1. If only A is 1 and B is 0, the output will be 0 and vice versa. If both A and B are 0, the result will be 0. While a computer gives these inputs in binary code or "bits," such as a 0 or 1, biocomputing uses DNA strands as inputs, whether double or single-stranded, and often uses fluorescent RNA as an output. In this case, the DNA enters the logic gate as a single or double strand.
If the DNA is double-stranded, the system “digests” the DNA or destroys it, which results in non-fluorescence or “0” output. Conversely, if the DNA is single-stranded, it won’t be digested and instead will be copied by several enzymes in the biocomputing system, resulting in fluorescent RNA or a “1” output. And the output for this type of binary system can be expanded beyond fluorescence or not. For example, a “1” output might be the production of the enzyme insulin, while a “0” may be that no insulin is produced. “This kind of synergy between biology and computation is the essence of biocomputing,” says Stephanie Forrest, a professor and the director of the Biodesign Center for Biocomputing, Security and Society at Arizona State University.
Biocomputing circles are made of DNA, RNA, proteins and even bacteria.
Evgeny Katz
The TRUMPET’s promise
Depending on whether the biocomputing system is placed directly inside a cell within the human body, or run in a test-tube, different environmental factors play a role. When an output is produced inside a cell, the cell's natural processes can amplify this output (for example, a specific protein or DNA strand), creating a solid signal. However, these cells can also be very leaky. “You want the cells to do the thing you ask them to do before they finish whatever their businesses, which is to grow, replicate, metabolize,” Adamala explains. “However, often the gate may be triggered without the right inputs, creating a false positive signal. So that's why natural logic gates are often leaky." While biocomputing outside a cell in a test tube can allow for tighter control over the logic gates, the outputs or signals cannot be amplified by a cell and are less potent.
TRUMPET, which is smaller than a cell, taps into both cellular and non-cellular biocomputing benefits. “At its core, it is a nonliving logic gate system,” Adamala states, “It's a DNA-based logic gate system. But because we use enzymes, and the readout is enzymatic [where an enzyme replicates the fluorescent RNA], we end up with signal amplification." This readout means that the output from the TRUMPET system, a fluorescent RNA strand, can be replicated by nearby enzymes in the platform, making the light signal stronger. "So it combines the best of both worlds,” Adamala adds.
These organic-based systems could detect cancer cells or low insulin levels inside a patient’s body.
The TRUMPET biocomputing process is relatively straightforward. “If the DNA [input] shows up as single-stranded, it will not be digested [by the logic gate], and you get this nice fluorescent output as the RNA is made from the single-stranded DNA, and that's a 1,” Adamala explains. "And if the DNA input is double-stranded, it gets digested by the enzymes in the logic gate, and there is no RNA created from the DNA, so there is no fluorescence, and the output is 0." On the story's leading image above, if the tube is "lit" with a purple color, that is a binary 1 signal for computing. If it's "off" it is a 0.
While still in research, TRUMPET and other biocomputing systems promise significant benefits to personalized healthcare and medicine. These organic-based systems could detect cancer cells or low insulin levels inside a patient’s body. The study’s lead author and graduate student Judee Sharon is already beginning to research TRUMPET's ability for earlier cancer diagnoses. Because the inputs for TRUMPET are single or double-stranded DNA, any mutated or cancerous DNA could theoretically be detected from the platform through the biocomputing process. Theoretically, devices like TRUMPET could be used to detect cancer and other diseases earlier.
Adamala sees TRUMPET not only as a detection system but also as a potential cancer drug delivery system. “Ideally, you would like the drug only to turn on when it senses the presence of a cancer cell. And that's how we use the logic gates, which work in response to inputs like cancerous DNA. Then the output can be the production of a small molecule or the release of a small molecule that can then go and kill what needs killing, in this case, a cancer cell. So we would like to develop applications that use this technology to control the logic gate response of a drug’s delivery to a cell.”
Although platforms like TRUMPET are making progress, a lot more work must be done before they can be used commercially. “The process of translating mechanisms and architecture from biology to computing and vice versa is still an art rather than a science,” says Forrest. “It requires deep computer science and biology knowledge,” she adds. “Some people have compared interdisciplinary science to fusion restaurants—not all combinations are successful, but when they are, the results are remarkable.”
In today’s podcast episode, Leaps.org Deputy Editor Lina Zeldovich speaks about the health and ecological benefits of farming crickets for human consumption with Bicky Nguyen, who joins Lina from Vietnam. Bicky and her business partner Nam Dang operate an insect farm named CricketOne. Motivated by the idea of sustainable and healthy protein production, they started their unconventional endeavor a few years ago, despite numerous naysayers who didn’t believe that humans would ever consider munching on bugs.
Yet, making creepy crawlers part of our diet offers many health and planetary advantages. Food production needs to match the rise in global population, estimated to reach 10 billion by 2050. One challenge is that some of our current practices are inefficient, polluting and wasteful. According to nonprofit EarthSave.org, it takes 2,500 gallons of water, 12 pounds of grain, 35 pounds of topsoil and the energy equivalent of one gallon of gasoline to produce one pound of feedlot beef, although exact statistics vary between sources.
Meanwhile, insects are easy to grow, high on protein and low on fat. When roasted with salt, they make crunchy snacks. When chopped up, they transform into delicious pâtes, says Bicky, who invents her own cricket recipes and serves them at industry and public events. Maybe that’s why some research predicts that edible insects market may grow to almost $10 billion by 2030. Tune in for a delectable chat on this alternative and sustainable protein.
Listen on Apple | Listen on Spotify | Listen on Stitcher | Listen on Amazon | Listen on Google
Further reading:
More info on Bicky Nguyen
https://yseali.fulbright.edu.vn/en/faculty/bicky-n...
The environmental footprint of beef production
https://www.earthsave.org/environment.htm
https://www.watercalculator.org/news/articles/beef-king-big-water-footprints/
https://www.frontiersin.org/articles/10.3389/fsufs.2019.00005/full
https://ourworldindata.org/carbon-footprint-food-methane
Insect farming as a source of sustainable protein
https://www.insectgourmet.com/insect-farming-growing-bugs-for-protein/
https://www.sciencedirect.com/topics/agricultural-and-biological-sciences/insect-farming
Cricket flour is taking the world by storm
https://www.cricketflours.com/
https://talk-commerce.com/blog/what-brands-use-cricket-flour-and-why/
Lina Zeldovich has written about science, medicine and technology for Popular Science, Smithsonian, National Geographic, Scientific American, Reader’s Digest, the New York Times and other major national and international publications. A Columbia J-School alumna, she has won several awards for her stories, including the ASJA Crisis Coverage Award for Covid reporting, and has been a contributing editor at Nautilus Magazine. In 2021, Zeldovich released her first book, The Other Dark Matter, published by the University of Chicago Press, about the science and business of turning waste into wealth and health. You can find her on http://linazeldovich.com/ and @linazeldovich.