Her name was Tessa and she was there to help. That’s what Sharon Maxwell read, anyway. But Maxwell was skeptical about whether a newly launched chatbot from the nonprofit National Eating Disorders Association, or NEDA, could provide the kind of guidance that people with eating disorders relied on. Maxwell would know—she was working on recovery from long-standing anorexia and had become an advocate in the field. So Maxwell took a deep breath and asked, “Hi Tessa. How do you support folks with eating disorders?”
Tessa’s reply was immediate. “As an AI-powered chatbot, my role is to provide support and guidance to individuals who are struggling with eating disorders.”
So far, so good. Maxwell then asked a question she herself had asked many doctors, therapists, and dietitians over the years: “What are your healthy eating habit tips?” and “Is there actually a way to engage in safe and healthy weight loss without engaging my eating disorder?”
Then everything went off the rails. Tessa told Maxwell an unequivocal yes, and that “it’s important to approach weight loss with a balanced and realistic mindset,” and recommended eating whole foods and lean proteins to create a 500-1000 calorie per day deficit that would lead to a loss of 1-2 pounds per week. To most people, the advice sounds anodyne, but alarm bells sounded in Maxwell’s head.
“This is actively going to feed eating disorders,” Maxwell says. “Having a chatbot be the direct response to someone reaching out for support for an eating disorder instead of the helpline seems careless.”
“The scripts that are being fed into the chatbot are only going to be as good as the person who’s feeding them.” -- Alexis Conason.
According to several decades of research, deliberate weight loss in the form of dieting is a serious risk for people with eating disorders. Maxwell says that following medical advice like what Tessa prescribed was what triggered her eating disorder as a child. And Maxwell wasn’t the only one who got such advice from the bot. When eating disorder therapist Alexis Conason tried Tessa, she asked the AI chatbot many of the questions her patients had. But instead of getting connected to resources or guidance on recovery, Conason, too, got tips on losing weight and “healthy” eating.
“The scripts that are being fed into the chatbot are only going to be as good as the person who’s feeding them,” Conason says. “It’s important that an eating disorder organization like NEDA is not reinforcing that same kind of harmful advice that we might get from medical providers who are less knowledgeable.”
Maxwell’s post about Tessa on Instagram went viral, and within days, NEDA had scrubbed all evidence of Tessa from its website. The furor has raised any number of issues about the harm perpetuated by a leading eating disorder charity and the ongoing influence of diet culture and advice that is pervasive in the field. But for AI experts, bears and bulls alike, Tessa offers a cautionary tale about what happens when a still-immature technology is unfettered and released into a vulnerable population.
Given the complexity involved in giving medical advice, the process of developing these chatbots must be rigorous and transparent, unlike NEDA’s approach.
“We don’t have a full understanding of what’s going on in these models. They’re a black box,” says Stephen Schueller, a clinical psychologist at the University of California, Irvine.
The health crisis
In March 2020, the world dove head-first into a heavily virtual world as countries scrambled to try and halt the pandemic. Even with lockdowns, hospitals were overwhelmed by the virus. The downstream effects of these lifesaving measures are still being felt, especially in mental health. Anxiety and depression are at all-time highs in teens, and a new report in The Lancet showed that post-Covid rates of newly diagnosed eating disorders in girls aged 13-16 were 42.4 percent higher than previous years.
And the crisis isn’t just in mental health.
“People are so desperate for health care advice that they'll actually go online and post pictures of [their intimate areas] and ask what kind of STD they have on public social media,” says John Ayers, an epidemiologist at the University of California, San Diego.
For many people, the choice isn’t chatbot vs. well-trained physician, but chatbot vs. nothing at all.
I know a bit about that desperation. Like Maxwell, I have struggled with a multi-decade eating disorder. I spent my 20s and 30s bouncing from crisis to crisis. I have called suicide hotlines, gone to emergency rooms, and spent weeks-on-end confined to hospital wards. Though I have found recovery in recent years, I’m still not sure what ultimately made the difference. A relapse isn't improbably, given my history. Even if I relapsed again, though, I don’t know it would occur to me to ask an AI system for help.
For one, I am privileged to have assembled a stellar group of outpatient professionals who know me, know what trips me up, and know how to respond to my frantic texts. Ditto for my close friends. What I often need is a shoulder to cry on or a place to vent—someone to hear and validate my distress. What’s more, my trust in these individuals far exceeds my confidence in the companies that create these chatbots. The Internet is full of health advice, much of it bad. Even for high-quality, evidence-based advice, medicine is often filled with disagreements about how the evidence might be applied and for whom it’s relevant. All of this is key in the training of AI systems like ChatGPT, and many AI companies remain silent on this process, Schueller says.
The problem, Ayers points out, is that for many people, the choice isn’t chatbot vs. well-trained physician, but chatbot vs. nothing at all. Hence the proliferation of “does this infection make my scrotum look strange?” questions. Where AI can truly shine, he says, is not by providing direct psychological help but by pointing people towards existing resources that we already know are effective.
“It’s important that these chatbots connect [their users to] to provide that human touch, to link you to resources,” Ayers says. “That’s where AI can actually save a life.”
Before building a chatbot and releasing it, developers need to pause and consult with the communities they hope to serve.
Unfortunately, many systems don’t do this. In a study published last month in the Journal of the American Medical Association, Ayers and colleagues found that although the chatbots did well at providing evidence-based answers, they often didn’t provide referrals to existing resources. Despite this, in an April 2023 study, Ayers’s team found that both patients and professionals rated the quality of the AI responses to questions, measured by both accuracy and empathy, rather highly. To Ayers, this means that AI developers should focus more on the quality of the information being delivered rather than the method of delivery itself.
Many mental health professionals have months-long waitlists, which leaves individuals to deal with illnesses on their own.
Adobe Stock
The human touch
The mental health field is facing timing constraints, too. Even before the pandemic, the U.S. suffered from a shortage of mental health providers. Since then, the rates of anxiety, depression, and eating disorders have spiked even higher, and many mental health professionals report waiting lists that are months long. Without support, individuals are left to try and cope on their own, which often means their condition deteriorates even further.
Nor do mental health crises happen during office hours. I struggled the most late at night, long after everyone else had gone to bed. I needed support during those times when I was most liable to hurt myself, not in the mornings and afternoons when I was at work.
In this sense, a 24/7 chatbot makes lots of sense. “I don't think we should stifle innovation in this space,” Schueller says. “Because if there was any system that needs to be innovated, it's mental health services, because they are sadly insufficient. They’re terrible.”
But before building a chatbot and releasing it, Tina Hernandez-Boussard, a data scientist at Stanford Medicine, says that developers need to pause and consult with the communities they hope to serve. It requires a deep understanding of what their needs are, the language they use to describe their concerns, existing resources, and what kinds of topics and suggestions aren’t helpful. Even asking a simple question at the beginning of a conversation such as “Do you want to talk to an AI or a human?” could allow those individuals to pick the type of interaction that suits their needs, Hernandez-Boussard says.
NEDA did none of these things before deploying Tessa. The researchers who developed the online body positivity self-help program upon which Tessa was initially based created a set of online question-and-answer exercises to improve body image. It didn’t involve generative AI that could write its own answers. The bot deployed by NEDA did use generative AI, something that no one in the eating disorder community was aware of before Tessa was brought online. Consulting those with lived experience would have flagged Tessa’s weight loss and “healthy eating” recommendations, Conason says.
The question for healthcare isn’t whether to use AI, but how.
NEDA did not comment on initial Tessa’s development and deployment, but a spokesperson told Leaps.org that “Tessa will be back online once we are confident that the program will be run with the rule-based approach as it was designed.”
The tech and therapist collaboration
The question for healthcare isn’t whether to use AI, but how. Already, AI can spot anomalies on medical images with greater precision than human eyes and can flag specific areas of an image for a radiologist to review in greater detail. Similarly, in mental health, AI should be an add-on for therapy, not a counselor-in-a-box, says Aniket Bera, an expert on AI and mental health at Purdue University.
“If [AIs] are going to be good helpers, then we need to understand humans better,” Bera says. That means understanding what patients and therapists alike need help with and respond to.
One of the biggest challenges of struggling with chronic illness is the dehumanization that happens. You become a patient number, a set of laboratory values and test scores. Treatment is often dictated by invisible algorithms and rules that you have no control over or access to. It’s frightening and maddening. But this doesn’t mean chatbots don’t have any place in medicine and mental health. An AI system could help provide appointment reminders and answer procedural questions about parking and whether someone should fast before a test or a procedure. They can help manage billing and even provide support between outpatient sessions by offering suggestions for what coping skills to use, the best ways to manage anxiety, and point to local resources. As the bots get better, they may eventually shoulder more and more of the burden of providing mental health care. But as Maxwell learned with Tessa, it’s still no replacement for human interaction.
“I'm not suggesting we should go in and start replacing therapists with technologies,” Schueller says. Instead, he advocates for a therapist-tech collaboration. “The technology side and the human component—these things need to come together.”
Every year, around two million people worldwide die of liver disease. While some people inherit the disease, it’s most commonly caused by hepatitis, obesity and alcoholism. These underlying conditions kill liver cells, causing scar tissue to form until eventually the liver cannot function properly. Since 1979, deaths due to liver disease have increased by 400 percent.
The sooner the disease is detected, the more effective treatment can be. But once symptoms appear, the liver is already damaged. Around 50 percent of cases are diagnosed only after the disease has reached the final stages, when treatment is largely ineffective.
To address this problem, Owlstone Medical, a biotech company in England, has developed a breath test that can detect liver disease earlier than conventional approaches. Human breath contains volatile organic compounds (VOCs) that change in the first stages of liver disease. Owlstone’s breath test can reliably collect, store and detect VOCs, while picking out the specific compounds that reveal liver disease.
“There’s a need to screen more broadly for people with early-stage liver disease,” says Owlstone’s CEO Billy Boyle. “Equally important is having a test that's non-invasive, cost effective and can be deployed in a primary care setting.”
The standard tool for detection is a biopsy. It is invasive and expensive, making it impractical to use for people who aren't yet symptomatic. Meanwhile, blood tests are less invasive, but they can be inaccurate and can’t discriminate between different stages of the disease.
In the past, breath tests have not been widely used because of the difficulties of reliably collecting and storing breath. But Owlstone’s technology could help change that.
The team is testing patients in the early stages of advanced liver disease, or cirrhosis, to identify and detect these biomarkers. In an initial study, Owlstone’s breathalyzer was able to pick out patients who had early cirrhosis with 83 percent sensitivity.
Boyle’s work is personally motivated. His wife died of colorectal cancer after she was diagnosed with a progressed form of the disease. “That was a big impetus for me to see if this technology could work in early detection,” he says. “As a company, Owlstone is interested in early detection across a range of diseases because we think that's a way to save lives and a way to save costs.”
How it works
In the past, breath tests have not been widely used because of the difficulties of reliably collecting and storing breath. But Owlstone’s technology could help change that.
Study participants breathe into a mouthpiece attached to a breath sampler developed by Owlstone. It has cartridges are designed and optimized to collect gases. The sampler specifically targets VOCs, extracting them from atmospheric gases in breath, to ensure that even low levels of these compounds are captured.
The sampler can store compounds stably before they are assessed through a method called mass spectrometry, in which compounds are converted into charged atoms, before electromagnetic fields filter and identify even the tiniest amounts of charged atoms according to their weight and charge.
The top four compounds in our breath
In an initial study, Owlstone captured VOCs in breath to see which ones could help them tell the difference between people with and without liver disease. They tested the breath of 46 patients with liver disease - most of them in the earlier stages of cirrhosis - and 42 healthy people. Using this data, they were able to create a diagnostic model. Individually, compounds like 2-Pentanone and limonene performed well as markers for liver disease. Owlstone achieved even better performance by examining the levels of the top four compounds together, distinguishing between liver disease cases and controls with 95 percent accuracy.
“It was a good proof of principle since it looks like there are breath biomarkers that can discriminate between diseases,” Boyle says. “That was a bit of a stepping stone for us to say, taking those identified, let’s try and dose with specific concentrations of probes. It's part of building the evidence and steering the clinical trials to get to liver disease sensitivity.”
Sabine Szunerits, a professor of chemistry in Institute of Electronics at the University of Lille, sees the potential of Owlstone’s technology.
“Breath analysis is showing real promise as a clinical diagnostic tool,” says Szunerits, who has no ties with the company. “Owlstone Medical’s technology is extremely effective in collecting small volatile organic biomarkers in the breath. In combination with pattern recognition it can give an answer on liver disease severity. I see it as a very promising way to give patients novel chances to be cured.”
Improving the breath sampling process
Challenges remain. With more than one thousand VOCs found in the breath, it can be difficult to identify markers for liver disease that are consistent across many patients.
Julian Gardner is a professor of electrical engineering at Warwick University who researches electronic sensing devices. “Everyone’s breath has different levels of VOCs and different ones according to gender, diet, age etc,” Gardner says. “It is indeed very challenging to selectively detect the biomarkers in the breath for liver disease.”
So Owlstone is putting chemicals in the body that they know interact differently with patients with liver disease, and then using the breath sampler to measure these specific VOCs. The chemicals they administer are called Exogenous Volatile Organic Compound) probes, or EVOCs.
Most recently, they used limonene as an EVOC probe, testing 29 patients with early cirrhosis and 29 controls. They gave the limonene to subjects at specific doses to measure how its concentrations change in breath. The aim was to try and see what was happening in their livers.
“They are proposing to use drugs to enhance the signal as they are concerned about the sensitivity and selectivity of their method,” Gardner says. “The approach of EVOC probes is probably necessary as you can then eliminate the person-to-person variation that will be considerable in the soup of VOCs in our breath.”
Through these probes, Owlstone could identify patients with liver disease with 83 percent sensitivity. By targeting what they knew was a disease mechanism, they were able to amplify the signal. The company is starting a larger clinical trial, and the plan is to eventually use a panel of EVOC probes to make sure they can see diverging VOCs more clearly.
“I think the approach of using probes to amplify the VOC signal will ultimately increase the specificity of any VOC breath tests, and improve their practical usability,” says Roger Yazbek, who leads the South Australian Breath Analysis Research (SABAR) laboratory in Flinders University. “Whilst the findings are interesting, it still is only a small cohort of patients in one location.”
The future of breath diagnosis
Owlstone wants to partner with pharmaceutical companies looking to learn if their drugs have an effect on liver disease. They’ve also developed a microchip, a miniaturized version of mass spectrometry instruments, that can be used with the breathalyzer. It is less sensitive but will enable faster detection.
Boyle says the company's mission is for their tests to save 100,000 lives. "There are lots of risks and lots of challenges. I think there's an opportunity to really establish breath as a new diagnostic class.”
Bacterial antibiotic resistance has been a concern in the medical field for several years. Now a new, similar threat is arising: drug-resistant fungal infections. The Centers for Disease Control and Prevention considers antifungal and antimicrobial resistance to be among the world’s greatest public health challenges.
One particular type of fungal infection caused by Candida auris is escalating rapidly throughout the world. And to make matters worse, C. auris is becoming increasingly resistant to current antifungal medications, which means that if you develop a C. auris infection, the drugs your doctor prescribes may not work. “We’re effectively out of medicines,” says Thomas Walsh, founding director of the Center for Innovative Therapeutics and Diagnostics, a translational research center dedicated to solving the antimicrobial resistance problem. Walsh spoke about the challenges at a Demy-Colton Virtual Salon, one in a series of interactive discussions among life science thought leaders.
Although C. auris typically doesn’t sicken healthy people, it afflicts immunocompromised hospital patients and may cause severe infections that can lead to sepsis, a life-threatening condition in which the overwhelmed immune system begins to attack the body’s own organs. Between 30 and 60 percent of patients who contract a C. auris infection die from it, according to the CDC. People who are undergoing stem cell transplants, have catheters or have taken antifungal or antibiotic medicines are at highest risk. “We’re coming to a perfect storm of increasing resistance rates, increasing numbers of immunosuppressed patients worldwide and a bug that is adapting to higher temperatures as the climate changes,” says Prabhavathi Fernandes, chair of the National BioDefense Science Board.
Most Candida species aren’t well-adapted to our body temperatures so they aren’t a threat. C. auris, however, thrives at human body temperatures.
Although medical professionals aren’t concerned at this point about C. auris evolving to affect healthy people, they worry that its presence in hospitals can turn routine surgeries into life-threatening calamities. “It’s coming,” says Fernandes. “It’s just a matter of time.”
An emerging global threat
“Fungi are found in the environment,” explains Fernandes, so Candida spores can easily wind up on people’s skin. In hospitals, they can be transferred from contact with healthcare workers or contaminated surfaces. Most Candida species aren’t well-adapted to our body temperatures so they aren’t a threat. C. auris, however, thrives at human body temperatures. It can enter the body during medical treatments that break the skin—and cause an infection. Overall, fungal infections cost some $48 billion in the U.S. each year. And infection rates are increasing because, in an ironic twist, advanced medical therapies are enabling severely ill patients to live longer and, therefore, be exposed to this pathogen.
The first-ever case of a C. auris infection was reported in Japan in 2009, although an analysis of Candida samples dated the earliest strain to a 1996 sample from South Korea. Since then, five separate varieties – called clades, which are similar to strains among bacteria – developed independently in different geographies: South Asia, East Asia, South Africa, South America and, recently, Iran. So far, C. auris infections have been reported in 35 countries.
In the U.S., the first infection was reported in 2016, and the CDC started tracking it nationally two years later. During that time, 5,654 cases have been reported to the CDC, which only tracks U.S. data.
What’s more notable than the number of cases is their rate of increase. In 2016, new cases increased by 175 percent and, on average, they have approximately doubled every year. From 2016 through 2022, the number of infections jumped from 63 to 2,377, a roughly 37-fold increase.
“This reminds me of what we saw with epidemics from 2013 through 2020… with Ebola, Zika and the COVID-19 pandemic,” says Robin Robinson, CEO of Spriovas and founding director of the Biomedical Advanced Research and Development Authority (BARDA), which is part of the U.S. Department of Health and Human Services. These epidemics started with a hockey stick trajectory, Robinson says—a gradual growth leading to a sharp spike, just like the shape of a hockey stick.
Another challenge is that right now medics don’t have rapid diagnostic tests for fungal infections. Currently, patients are often misdiagnosed because C. auris resembles several other easily treated fungi. Or they are diagnosed long after the infection begins and is harder to treat.
The problem is that existing diagnostics tests can only identify C. auris once it reaches the bloodstream. Yet, because this pathogen infects bodily tissues first, it should be possible to catch it much earlier before it becomes life-threatening. “We have to diagnose it before it reaches the bloodstream,” Walsh says.
The most alarming fact is that some Candida infections no longer respond to standard therapeutics.
“We need to focus on rapid diagnostic tests that do not rely on a positive blood culture,” says John Sperzel, president and CEO of T2 Biosystems, a company specializing in diagnostics solutions. Blood cultures typically take two to three days for the concentration of Candida to become large enough to detect. The company’s novel test detects about 90 percent of Candida species within three to five hours—thanks to its ability to spot minute quantities of the pathogen in blood samples instead of waiting for them to incubate and proliferate.
Unlike other Candida species C. auris thrives at human body temperatures
Adobe Stock
Tackling the resistance challenge
The most alarming fact is that some Candida infections no longer respond to standard therapeutics. The number of cases that stopped responding to echinocandin, the first-line therapy for most Candida infections, tripled in 2020, according to a study by the CDC.
Now, each of the first four clades shows varying levels of resistance to all three commonly prescribed classes of antifungal medications, such as azoles, echinocandins, and polyenes. For example, 97 percent of infections from C. auris Clade I are resistant to fluconazole, 54 percent to voriconazole and 30 percent of amphotericin. Nearly half are resistant to multiple antifungal drugs. Even with Clade II fungi, which has the least resistance of all the clades, 11 to 14 percent have become resistant to fluconazole.
Anti-fungal therapies typically target specific chemical compounds present on fungi’s cell membranes, but not on human cells—otherwise the medicine would cause damage to our own tissues. Fluconazole and other azole antifungals target a compound called ergosterol, preventing the fungal cells from replicating. Over the years, however, C. auris evolved to resist it, so existing fungal medications don’t work as well anymore.
A newer class of drugs called echinocandins targets a different part of the fungal cell. “The echinocandins – like caspofungin – inhibit (a part of the fungi) involved in making glucan, which is an essential component of the fungal cell wall and is not found in human cells,” Fernandes says. New antifungal treatments are needed, she adds, but there are only a few magic bullets that will hit just the fungus and not the human cells.
Research to fight infections also has been challenged by a lack of government support. That is changing now that BARDA is requesting proposals to develop novel antifungals. “The scope includes C. auris, as well as antifungals following a radiological/nuclear emergency, says BARDA spokesperson Elleen Kane.
The remaining challenge is the number of patients available to participate in clinical trials. Large numbers are needed, but the available patients are quite sick and often die before trials can be completed. Consequently, few biopharmaceutical companies are developing new treatments for C. auris.
ClinicalTrials.gov reports only two drugs in development for invasive C. auris infections—those than can spread throughout the body rather than localize in one particular area, like throat or vaginal infections: ibrexafungerp by Scynexis, Inc., fosmanogepix, by Pfizer.
Scynexis’ ibrexafungerp appears active against C. auris and other emerging, drug-resistant pathogens. The FDA recently approved it as a therapy for vaginal yeast infections and it is undergoing Phase III clinical trials against invasive candidiasis in an attempt to keep the infection from spreading.
“Ibreafungerp is structurally different from other echinocandins,” Fernandes says, because it targets a different part of the fungus. “We’re lucky it has activity against C. auris.”
Pfizer’s fosmanogepix is in Phase II clinical trials for patients with invasive fungal infections caused by multiple Candida species. Results are showing significantly better survival rates for people taking fosmanogepix.
Although C. auris does pose a serious threat to healthcare worldwide, scientists try to stay optimistic—because they recognized the problem early enough, they might have solutions in place before the perfect storm hits. “There is a bit of hope,” says Robinson. “BARDA has finally been able to fund the development of new antifungal agents and, hopefully, this year we can get several new classes of antifungals into development.”