Can You Trust Your Gut for Food Advice?
Kira Peikoff was the editor-in-chief of Leaps.org from 2017 to 2021. As a journalist, her work has appeared in The New York Times, Newsweek, Nautilus, Popular Mechanics, The New York Academy of Sciences, and other outlets. She is also the author of four suspense novels that explore controversial issues arising from scientific innovation: Living Proof, No Time to Die, Die Again Tomorrow, and Mother Knows Best. Peikoff holds a B.A. in Journalism from New York University and an M.S. in Bioethics from Columbia University. She lives in New Jersey with her husband and two young sons. Follow her on Twitter @KiraPeikoff.
I recently got on the scale to weigh myself, thinking I've got to eat better. With so many trendy diets today claiming to improve health, from Keto to Paleo to Whole30, it can be confusing to figure out what we should and shouldn't eat for optimal nutrition.
A number of companies are now selling the concept of "personalized" nutrition based on the genetic makeup of your individual gut bugs.
My next thought was: I've got to lose a few pounds.
Consider a weird factoid: In addition to my fat, skin, bone and muscle, I'm carrying around two or three pounds of straight-up bacteria. Like you, I am the host to trillions of micro-organisms that live in my gut and are collectively known as my microbiome. An explosion of research has occurred in the last decade to try to understand exactly how these microbial populations, which are unique to each of us, may influence our overall health and potentially even our brains and behavior.
Lots of mysteries still remain, but it is established that these "bugs" are crucial to keeping our body running smoothly, performing functions like stimulating the immune system, synthesizing important vitamins, and aiding digestion. The field of microbiome science is evolving rapidly, and a number of companies are now selling the concept of "personalized" nutrition based on the genetic makeup of your individual gut bugs. The two leading players are Viome and DayTwo, but the landscape includes the newly launched startup Onegevity Health and others like Thryve, which offers customized probiotic supplements in addition to dietary recommendations.
The idea has immediate appeal – if science could tell you exactly what to make for lunch and what to avoid, you could forget about the fad diets and go with your own bespoke food pyramid. Wondering if the promise might be too good to be true, I decided to perform my own experiment.
Last fall, I sent the identical fecal sample to both Viome (I paid $425, but the price has since dropped to $299) and DayTwo ($349). A couple of months later, both reports finally arrived, and I eagerly opened each app to compare their recommendations.
First, I examined my results from Viome, which was founded in 2016 in Cupertino, Calif., and declares without irony on its website that "conflicting food advice is now obsolete."
I learned I have "average" metabolic fitness and "average" inflammatory activity in my gut, which are scores that the company defines based on a proprietary algorithm. But I have "low" microbial richness, with only 62 active species of bacteria identified in my sample, compared with the mean of 157 in their test population. I also received a list of the specific species in my gut, with names like Lactococcus and Romboutsia.
But none of it meant anything to me without actionable food advice, so I clicked through to the Recommendations page and found a list of My Superfoods (cranberry, garlic, kale, salmon, turmeric, watermelon, and bone broth) and My Foods to Avoid (chickpeas, kombucha, lentils, and rice noodles). There was also a searchable database of many foods that had been categorized for me, like "bell pepper; minimize" and "beef; enjoy."
"I just don't think sufficient data is yet available to make reliable personalized dietary recommendations based on one's microbiome."
Next, I looked at my results from DayTwo, which was founded in 2015 from research out of the Weizmann Institute of Science in Israel, and whose pitch to consumers is, "Blood sugar made easy. The algorithm diet personalized to you."
This app had some notable differences. There was no result about my metabolic fitness, microbial richness, or list of the species in my sample. There was also no list of superfoods or foods to avoid. Instead, the app encouraged me to build a meal by searching for foods in their database and combining them in beneficial ways for my blood sugar. Two slices of whole wheat bread received a score of 2.7 out of 10 ("Avoid"), but if combined with one cup of large curd cottage cheese, the score improved to 6.8 ("Limit"), and if I added two hard-boiled eggs, the score went up to 7.5 ("Good").
Perusing my list of foods with "Excellent" scores, I noticed some troubling conflicts with the other app. Lentils, which had been a no-no according to Viome, received high marks from DayTwo. Ditto for Kombucha. My purported superfood of cranberry received low marks. Almonds got an almost perfect score (9.7) while Viome told me to minimize them. I found similarly contradictory advice for foods I regularly eat, including navel oranges, peanuts, pork, and beets.
Contradictory dietary guidance that Kira Peikoff received from Viome (left) and DayTwo from an identical sample.
To be sure, there was some overlap. Both apps agreed on rice noodles (bad), chickpeas (bad), honey (bad), carrots (good), and avocado (good), among other foods.
But still, I was left scratching my head. Which set of recommendations should I trust, if either? And what did my results mean for the accuracy of this nascent field?
I called a couple of experts to find out.
"I have worked on the microbiome and nutrition for the last 20 years and I would be absolutely incapable of finding you evidence in the scientific literature that lentils have a detrimental effect based on the microbiome," said Dr. Jens Walter, an Associate Professor and chair for Nutrition, Microbes, and Gastrointestinal Health at the University of Alberta. "I just don't think sufficient data is yet available to make reliable personalized dietary recommendations based on one's microbiome. And even if they would have proprietary algorithms, at least one of them is not doing it right."
There is definite potential for personalized nutrition based on the microbiome, he said, but first, predictive models must be built and standardized, then linked to clinical endpoints, and tested in a large sample of healthy volunteers in order to enable extrapolations for the general population.
"It is mindboggling what you would need to do to make this work," he observed. "There are probably hundreds of relevant dietary compounds, then the microbiome has at least a hundred relevant species with a hundred or more relevant genes each, then you'd have to put all this together with relevant clinical outcomes. And there's a hundred-fold variation in that information between individuals."
However, Walter did acknowledge that the companies might be basing their algorithms on proprietary data that could potentially connect all the dots. I reached out to them to find out.
Amir Golan, the Chief Commercial Officer of DayTwo, told me, "It's important to emphasize this is a prediction, as the microbiome field is in a very early stage of research." But he added, "I believe we are the only company that has very solid science published in top journals and we can bring very actionable evidence and benefit to our uses."
He was referring to pioneering work out of the Weizmann Institute that was published in 2015 in the journal Cell, which logged the glycemic responses of 800 people in response to nearly 50,000 meals; adding information about the subjects' microbiomes enabled more accurate glycemic response predictions. Since then, Golan said, additional trials have been conducted, most recently with the Mayo Clinic, to duplicate the results, and other studies are ongoing whose results have not yet been published.
He also pointed out that the microbiome was merely one component that goes into building a client's profile, in addition to medical records, including blood glucose levels. (I provided my HbA1c levels, a measure of average blood sugar over the previous several months.)
"We are not saying we want to improve your gut microbiome. We provide a dynamic tool to help guide what you should eat to control your blood sugar and think about combinations," he said. "If you eat one thing, or with another, it will affect you in a different way."
Viome acknowledged that the two companies are taking very different approaches.
"DayTwo is primarily focused on the glycemic response," Naveen Jain, the CEO, told me. "If you can only eat butter for rest of your life, you will have no glycemic response but will probably die of a heart attack." He laughed. "Whereas we came from very different angle – what is happening inside the gut at a microbial level? When you eat food like spinach, how will that be metabolized in the gut? Will it produce the nutrients you need or cause inflammation?"
He said his team studied 1000 people who were on continuous glucose monitoring and fed them 45,000 meals, then built a proprietary data prediction model, looking at which microbes existed and how they actively broke down the food.
Jain pointed out that DayTwo sequences the DNA of the microbes, while Viome sequences the RNA – the active expression of DNA. That difference, in his opinion, is key to making accurate predictions.
"DNA is extremely stable, so when you eat any food and measure the DNA [in a fecal sample], you get all these false positives--you get DNA from plant food and meat, and you have no idea if those organisms are dead and simply transient, or actually exist. With RNA, you see what is actually alive in the gut."
More contradictory food advice from Viome (left) and DayTwo.
Note that controversy exists over how it is possible with a fecal sample to effectively measure RNA, which degrades within minutes, though Jain said that his company has the technology to keep RNA stable for fourteen days.
Viome's approach, Jain maintains, is 90 percent accurate, based on as-yet unpublished data; a patent was filed just last week. DayTwo's approach is 66 percent accurate according to the latest published research.
Natasha Haskey, a registered dietician and doctoral student conducting research in the field of microbiome science and nutrition, is skeptical of both companies. "We can make broad statements, like eat more fruits and vegetables and fiber, but when it comes to specific foods, the science is just not there yet," she said. "I think there is a future, and we will be doing that someday, but not yet. Maybe we will be closer in ten years."
Professor Walter wholeheartedly agrees with Haskey, and suggested that if people want to eat a gut-healthy diet, they should focus on beneficial oils, fruits and vegetables, fish, a variety of whole grains, poultry and beans, and limit red meat and cheese, as well as avoid processed meats.
"These services are far over the tips of their science skis," Arthur Caplan, the founding head of New York University's Division of Medical Ethics, said in an email. "We simply don't know enough about the gut microbiome, its fluctuations and variability from person to person to support general [direct-to-consumer] testing. This is simply premature. We need standards for accuracy, specificity, and sensitivity, plus mandatory competent counseling for all such testing. They don't exist. Neither should DTC testing—yet."
Meanwhile, it's time for lunch. I close out my Viome and DayTwo apps and head to the kitchen to prepare a peanut butter sandwich. My gut tells me I'll be just fine.
Kira Peikoff was the editor-in-chief of Leaps.org from 2017 to 2021. As a journalist, her work has appeared in The New York Times, Newsweek, Nautilus, Popular Mechanics, The New York Academy of Sciences, and other outlets. She is also the author of four suspense novels that explore controversial issues arising from scientific innovation: Living Proof, No Time to Die, Die Again Tomorrow, and Mother Knows Best. Peikoff holds a B.A. in Journalism from New York University and an M.S. in Bioethics from Columbia University. She lives in New Jersey with her husband and two young sons. Follow her on Twitter @KiraPeikoff.
Questions remain about new drug for hot flashes
Vascomotor symptoms (VMS) is the medical term for hot flashes associated with menopause. You are going to hear a lot more about it because a company has a new drug to sell. Here is what you need to know.
Menopause marks the end of a woman’s reproductive capacity. Normal hormonal production associated with that monthly cycle becomes erratic and finally ceases. For some women the transition can be relatively brief with only modest symptoms, while for others the body's “thermostat” in the brain is disrupted and they experience hot flashes and other symptoms that can disrupt daily activity. Lifestyle modification and drugs such as hormone therapy can provide some relief, but women at risk for cancer are advised not to use them and other women choose not to do so.
Fezolinetant, sold by Astellas Pharma Inc. under the product name Veozah™, was approved by the Food and Drug Administration (FDA) on May 12 to treat hot flashes associated with menopause. It is the first in a new class of drugs called neurokinin 3 receptor antagonists, which block specific neurons in the brain “thermostat” that trigger VMS. It does not appear to affect other symptoms of menopause. As with many drugs targeting a brain cell receptor, it must be taken continuously for a few days to build up a good therapeutic response, rather than working as a rescue product such as an asthma inhaler to immediately treat that condition.
Hot flashes vary greatly and naturally get better or resolve completely with time. That contributes to a placebo effect and makes it more difficult to judge the outcome of any intervention. Early this year, a meta analysis of 17 studies of drug trials for hot flashes found an unusually large placebo response in those types of studies; the placebo groups had an average of 5.44 fewer hot flashes and a 36 percent reduction in their severity.
In studies of fezolinetant, the drug recently approved by the FDA, the placebo benefit was strong and persistent. The drug group bested the placebo response to a statistically significant degree but, “If people have gone from 11 hot flashes a day to eight or seven in the placebo group and down to a couple fewer ones in the drug groups, how meaningful is that? Having six hot flashes a day is still pretty unpleasant,” says Diana Zuckerman, president of the National Center for Health Research (NCHR), a health oriented think tank.
“Is a reduction compared to placebo of 2-3 hot flashes per day, in a population of women experiencing 10-11 moderate to severe hot flashes daily, enough relief to be clinically meaningful?” Andrea LaCroix asked a commentary published in Nature Medicine. She is an epidemiologist at the University of California San Diego and a leader of the MsFlash network that has conducted a handful of NIH-funded studies on menopause.
Questions Remain
LaCroix and others have raised questions about how Astellas, the company that makes the new drug, handled missing data from patients who dropped out of the clinical trials. “The lack of detailed information about important parameters such as adherence and missing data raises concerns that the reported benefits of fezolinetant very likely overestimate those that will be observed in clinical practice," LaCroix wrote.
In response to this concern, Anna Criddle, director of global portfolio communications at Astellas, wrote in an email to Leaps.org: “…a full analysis of data, including adherence data and any impact of missing data, was submitted for assessment by [the FDA].”
The company ran the studies at more than 300 sites around the world. Curiously, none appear to have been at academic medical centers, which are known for higher quality research. Zuckerman says, "When somebody is paid to do a study, if they want to get paid to do another study by the same company, they will try to make sure that the results are the results that the company wants.”
Criddle said that Astellas picked the sites “that would allow us to reach a diverse population of women, including race and ethnicity.”
A trial of a lower dose of the drug was conducted in Asia. In March 2022, Astellas issued a press release saying it had failed to prove effectiveness. No further data has been released. Astellas still plans to submit the data, according to Criddle. Results from clinical trials funded by the U.S. goverment must be reported on clinicaltrials.gov within one year of the study's completion - a deadline that, in this case, has expired.
The measurement scale for hot flashes used in the studies, mild-moderate-severe, also came in for criticism. “It is really not good scale, there probably isn’t a broad enough range of things going on or descriptors,” says David Rind. He is chief medical officer of the Institute for Clinical and Economic Review (ICER), a nonprofit authority on new drugs. It conducted a thorough review and analysis of fezolinestant using then existing data gathered from conference abstracts, posters and presentations and included a public stakeholder meeting in December. A 252-page report was published in January, finding “considerable uncertainty about the comparative net health benefits of fezolinetant” versus hormone therapy.
Questions surrounding some of these issues might have been answered if the FDA had chosen to hold a public advisory committee meeting on fezolinetant, which it regularly does for first in class medicines. But the agency decided such a meeting was unnecessary.
Cost
There was little surprise when Astellas announced a list price for fezolinetant of $550 a month ($6000 annually) and a program of patient assistance to ease out of pocket expenses. The company had already incurred large expenses.
In 2017 Astellas purchased the company that originally developed fezolinetant for $534 million plus several hundred million in potential royalties. The drug company ran a "disease awareness” ad, Heat on the Street, hat aired during the Super Bowl in February, where 30 second ads cost about $7 million. Industry analysts have projected sales to be $1.9 billion by 2028.
ICER’s pre-approval evaluation said fezolinetant might "be considered cost-effective if priced around $2,000 annually. ... [It]will depend upon its price and whether it is considered an alternative to MHT [menopause hormone treatment] for all women or whether it will primarily be used by women who cannot or will not take MHT."
Criddle wrote that Astellas set the price based on the novelty of the science, the quality of evidence for the drug and its uniqueness compared to the rest of the market. She noted that an individual’s payment will depend on how much their insurance company decides to cover. “[W]e expect insurance coverage to increase over the course of the year and to achieve widespread coverage in the U.S. over time.”
Leaps.org wrote to and followed up with nine of the largest health insurers/providers asking basic questions about their coverage of fezolinetant. Only two responded. Jennifer Martin, the deputy chief consultant for pharmacy benefits management at the Department of Veterans Affairs, said the agency “covers all drugs from the date that they are launched.” Decisions on whether it will be included in the drug formulary and what if any copays might be required are under review.
“[Fezolinetant] will go through our standard P&T Committee [patient and treatment] review process in the next few months, including a review of available efficacy data, safety data, clinical practice guidelines, and comparison with other agents used for vasomotor symptoms of menopause," said Phil Blando, executive director of corporate communications for CVS Health.
Other insurers likely are going through a similar process to decide issues such as limiting coverage to women who are advised not to use hormones, how much copay will be required, and whether women will be required to first try other options or obtain approvals before getting a prescription.
Rind wants to see a few years of use before he prescribes fezolinetant broadly, and believes most doctors share his view. Nor will they be eager to fill out the additional paperwork required for women to participate in the Astellas patient assistance program, he added.
Safety
Astellas is marketing its drug by pointing out risks of hormone therapy, such as a recent paper in The BMJ, which noted that women who took hormones for even a short period of time had a 24 percent increased risk of dementia. While the percentage was scary, the combined number of women both on and off hormones who developed dementia was small. And it is unclear whether hormones are causing dementia or if more severe hot flashes are a marker for higher risk of developing dementia. This information is emerging only after 80 years of hundreds of millions of women using hormones.
In contrast, the label for fezolinetant prohibits “concomitant use with CYP1A2 inhibitors” and requires testing for liver and kidney function prior to initiating the drug and every three months thereafter. There is no human or animal data on use in a geriatric population, defined as 65 or older, a group that is likely to use the drug. Only a few thousand women have ever taken fezolinetant and most have used it for just a few months.
Options
A woman seeking relief from symptoms of menopause would like to see how fezolintant compares with other available treatment options. But Astellas did not conduct such a study and Andrea LaCroix says it is unlikely that anyone ever will.
ICER has come the closest, with a side-by-side analysis of evidence-based treatments and found that fezolinetant performed quite similarly and modestly as the others in providing relief from hot flashes. Some treatments also help with other symptoms of menopause, which fezolinetant does not.
There are many coping strategies that women can adopt to deal with hot flashes; one of the most common is dressing in layers (such as a sleeveless blouse with a sweater) that can be added or subtracted as conditions require. Avoiding caffeine, hot liquids, and spicy foods is another common strategy. “I stopped drinking hot caffeinated drinks…for several years, and you get out of the habit of drinking them,” says Zuckerman.
LaCroix curates those options at My Meno Plan, which includes a search function where you can enter your symptoms and identify which treatments might work best for you. It also links to published research papers. She says the goal is to empower women with information to make informed decisions about menopause.
Every year, around two million people worldwide die of liver disease. While some people inherit the disease, it’s most commonly caused by hepatitis, obesity and alcoholism. These underlying conditions kill liver cells, causing scar tissue to form until eventually the liver cannot function properly. Since 1979, deaths due to liver disease have increased by 400 percent.
The sooner the disease is detected, the more effective treatment can be. But once symptoms appear, the liver is already damaged. Around 50 percent of cases are diagnosed only after the disease has reached the final stages, when treatment is largely ineffective.
To address this problem, Owlstone Medical, a biotech company in England, has developed a breath test that can detect liver disease earlier than conventional approaches. Human breath contains volatile organic compounds (VOCs) that change in the first stages of liver disease. Owlstone’s breath test can reliably collect, store and detect VOCs, while picking out the specific compounds that reveal liver disease.
“There’s a need to screen more broadly for people with early-stage liver disease,” says Owlstone’s CEO Billy Boyle. “Equally important is having a test that's non-invasive, cost effective and can be deployed in a primary care setting.”
The standard tool for detection is a biopsy. It is invasive and expensive, making it impractical to use for people who aren't yet symptomatic. Meanwhile, blood tests are less invasive, but they can be inaccurate and can’t discriminate between different stages of the disease.
In the past, breath tests have not been widely used because of the difficulties of reliably collecting and storing breath. But Owlstone’s technology could help change that.
The team is testing patients in the early stages of advanced liver disease, or cirrhosis, to identify and detect these biomarkers. In an initial study, Owlstone’s breathalyzer was able to pick out patients who had early cirrhosis with 83 percent sensitivity.
Boyle’s work is personally motivated. His wife died of colorectal cancer after she was diagnosed with a progressed form of the disease. “That was a big impetus for me to see if this technology could work in early detection,” he says. “As a company, Owlstone is interested in early detection across a range of diseases because we think that's a way to save lives and a way to save costs.”
How it works
In the past, breath tests have not been widely used because of the difficulties of reliably collecting and storing breath. But Owlstone’s technology could help change that.
Study participants breathe into a mouthpiece attached to a breath sampler developed by Owlstone. It has cartridges are designed and optimized to collect gases. The sampler specifically targets VOCs, extracting them from atmospheric gases in breath, to ensure that even low levels of these compounds are captured.
The sampler can store compounds stably before they are assessed through a method called mass spectrometry, in which compounds are converted into charged atoms, before electromagnetic fields filter and identify even the tiniest amounts of charged atoms according to their weight and charge.
The top four compounds in our breath
In an initial study, Owlstone captured VOCs in breath to see which ones could help them tell the difference between people with and without liver disease. They tested the breath of 46 patients with liver disease - most of them in the earlier stages of cirrhosis - and 42 healthy people. Using this data, they were able to create a diagnostic model. Individually, compounds like 2-Pentanone and limonene performed well as markers for liver disease. Owlstone achieved even better performance by examining the levels of the top four compounds together, distinguishing between liver disease cases and controls with 95 percent accuracy.
“It was a good proof of principle since it looks like there are breath biomarkers that can discriminate between diseases,” Boyle says. “That was a bit of a stepping stone for us to say, taking those identified, let’s try and dose with specific concentrations of probes. It's part of building the evidence and steering the clinical trials to get to liver disease sensitivity.”
Sabine Szunerits, a professor of chemistry in Institute of Electronics at the University of Lille, sees the potential of Owlstone’s technology.
“Breath analysis is showing real promise as a clinical diagnostic tool,” says Szunerits, who has no ties with the company. “Owlstone Medical’s technology is extremely effective in collecting small volatile organic biomarkers in the breath. In combination with pattern recognition it can give an answer on liver disease severity. I see it as a very promising way to give patients novel chances to be cured.”
Improving the breath sampling process
Challenges remain. With more than one thousand VOCs found in the breath, it can be difficult to identify markers for liver disease that are consistent across many patients.
Julian Gardner is a professor of electrical engineering at Warwick University who researches electronic sensing devices. “Everyone’s breath has different levels of VOCs and different ones according to gender, diet, age etc,” Gardner says. “It is indeed very challenging to selectively detect the biomarkers in the breath for liver disease.”
So Owlstone is putting chemicals in the body that they know interact differently with patients with liver disease, and then using the breath sampler to measure these specific VOCs. The chemicals they administer are called Exogenous Volatile Organic Compound) probes, or EVOCs.
Most recently, they used limonene as an EVOC probe, testing 29 patients with early cirrhosis and 29 controls. They gave the limonene to subjects at specific doses to measure how its concentrations change in breath. The aim was to try and see what was happening in their livers.
“They are proposing to use drugs to enhance the signal as they are concerned about the sensitivity and selectivity of their method,” Gardner says. “The approach of EVOC probes is probably necessary as you can then eliminate the person-to-person variation that will be considerable in the soup of VOCs in our breath.”
Through these probes, Owlstone could identify patients with liver disease with 83 percent sensitivity. By targeting what they knew was a disease mechanism, they were able to amplify the signal. The company is starting a larger clinical trial, and the plan is to eventually use a panel of EVOC probes to make sure they can see diverging VOCs more clearly.
“I think the approach of using probes to amplify the VOC signal will ultimately increase the specificity of any VOC breath tests, and improve their practical usability,” says Roger Yazbek, who leads the South Australian Breath Analysis Research (SABAR) laboratory in Flinders University. “Whilst the findings are interesting, it still is only a small cohort of patients in one location.”
The future of breath diagnosis
Owlstone wants to partner with pharmaceutical companies looking to learn if their drugs have an effect on liver disease. They’ve also developed a microchip, a miniaturized version of mass spectrometry instruments, that can be used with the breathalyzer. It is less sensitive but will enable faster detection.
Boyle says the company's mission is for their tests to save 100,000 lives. "There are lots of risks and lots of challenges. I think there's an opportunity to really establish breath as a new diagnostic class.”