The Science Sleuth Holding Fraudulent Research Accountable
Kira Peikoff was the editor-in-chief of Leaps.org from 2017 to 2021. As a journalist, her work has appeared in The New York Times, Newsweek, Nautilus, Popular Mechanics, The New York Academy of Sciences, and other outlets. She is also the author of four suspense novels that explore controversial issues arising from scientific innovation: Living Proof, No Time to Die, Die Again Tomorrow, and Mother Knows Best. Peikoff holds a B.A. in Journalism from New York University and an M.S. in Bioethics from Columbia University. She lives in New Jersey with her husband and two young sons. Follow her on Twitter @KiraPeikoff.
Introduction by Mary Inman, Whistleblower Attorney
For most people, when they see the word "whistleblower," the image that leaps to mind is a lone individual bravely stepping forward to shine a light on misconduct she has witnessed first-hand. Meryl Streep as Karen Silkwood exposing safety violations observed while working the line at the Kerr-McGee plutonium plant. Matt Damon as Mark Whitacre in The Informant!, capturing on his pocket recorder clandestine meetings between his employer and its competitors to fix the price of lysine. However, a new breed of whistleblower is emerging who isn't at the scene of the crime but instead figures it out after the fact through laborious review of publicly available information and expert analysis. Elisabeth Bik belongs to this new class of whistleblower.
"There's this delicate balance where on one hand we want to spread results really fast as scientists, but on the other hand, we know it's incomplete, it's rushed and it's not great."
Using her expertise as a microbiologist and her trained eye, Bik studies publicly available scientific papers to sniff out potential irregularities in the images that suggest research fraud, later seeking retraction of the offending paper from the journal's publisher. There's no smoking gun, no first-hand account of any kind. Just countless hours spent reviewing scores of scientific papers and Bik's skills and dedication as a science fraud sleuth.
While Bik's story may not as readily lend itself to the big screen, her work is nonetheless equally heroic. By tirelessly combing scientific papers to expose research fraud, Bik is playing a vital role in holding the scientific publishing process accountable and ensuring that misleading information does not spread unchecked. This is important work in any age, but particularly so in the time of COVID, where we can ill afford the setbacks and delays of scientists building on false science. In the present climate, where science is politicized and scientific principles are under attack, strong voices like Bik's must rise above the din to ensure the scientific information we receive, and our governments act upon, is accurate. Our health and wellbeing depend on it.
Whistleblower outsiders like Bik are challenging the traditional concept of what it means to be a whistleblower. Fortunately for us, the whistleblower community is a broad church. As with most ecosystems, we all benefit from a diversity of voices —whistleblower insiders and outsiders alike. What follows is an illuminating conversation between Bik, and Ivan Oransky, the co-founder of Retraction Watch, an influential blog that reports on retractions of scientific papers and related topics. (Conversation facilitated by LeapsMag Editor-in-Chief Kira Peikoff)
Elisabeth Bik and Ivan Oransky.
(Photo credits Michel & Co Photography, San Jose, CA and Elizabeth Solaka)
Ivan
I'd like to hear your thoughts, Elisabeth, on an L.A. Times story, which was picking up a preprint about mutations and the novel coronavirus, alleging that the virus is mutating to become more infectious – even though this conclusion wasn't actually warranted.
Elisabeth
A lot of the news around it is picking up on one particular side of the story that is maybe not that much exaggerated by the scientists. I don't think this paper really showed that the mutations were causing the virus to be more virulent. Some of these viruses continuously mutate and mutate and mutate, and that doesn't necessarily make a strain more virulent. I think in many cases, a lot of people want to read something in a paper that is not actually there.
Ivan
The tone level, everything that's being published now, it's problematic. It's being rushed, here it wasn't even peer-reviewed. But even when they are peer-reviewed, they're being peer-reviewed by people who often aren't really an expert in that particular area.
Elisabeth
That's right.
Ivan
To me, it's all problematic. At the same time, it's all really good that it's all getting out there. I think that five or 10 years ago, or if we weren't in a pandemic, maybe that paper wouldn't have appeared at all. It would have maybe been submitted to a top-ranked journal and not have been accepted, or maybe it would have been improved during peer review and bounced down the ladder a bit to a lower-level journal.
Yet, now, because it's about coronavirus, it's in a major newspaper and, in fact, it's getting critiqued immediately.
Maybe it's too Pollyanna-ish, but I actually think that quick uploading is a good thing. The fear people have about preprint servers is based on this idea that the peer-reviewed literature is perfect. Once it is in a peer-reviewed journal, they think it must have gone through this incredible process. You're laughing because-
Elisabeth
I am laughing.
Ivan
You know it's not true.
Elisabeth
Yes, we both know that. I agree and I think in this particular situation, a pandemic that is unlike something our generation has seen before, there is a great, great need for fast dissemination of science.
If you have new findings, it is great that there is a thing called a preprint server where scientists can quickly share their results, with, of course, the caveat that it's not peer-reviewed yet.
It's unlike the traditional way of publishing papers, which can take months or years. Preprint publishing is a very fast way of spreading your results in a good way so that is what the world needs right now.
On the other hand, of course, there's the caveat that these are brand new results and a good scientist usually thinks about their results to really interpret it well. You have to look at it from all sides and I think with the rushed publication of preprint papers, there is no such thing as carefully thinking about what results might mean.
So there's this delicate balance where on one hand we want to spread results really fast as scientists, but on the other hand, we know it's incomplete, it's rushed and it's not great. This might be hard for the general audience to understand.
Ivan
I still think the benefits of that dissemination are more positive than negative.
Elisabeth
Right. But there's also so many papers that come out now on preprint servers and most of them are not that great, but there are some really good studies in there. It's hard to find those nuggets of really great papers. There's just a lot of papers that come out now.
Ivan
Well, you've made more than a habit of finding problems in papers. These are mostly, of course, until now published papers that you examined, but what is this time like for you? How is it different?
Elisabeth
It's different because in the beginning I looked at several COVID-19-related papers that came out and wrote some critiques about it. I did experience a lot of backlash because of that. So I felt I had to take a break from social media and from writing about COVID-19.
I focused a little bit more on other work because I just felt that a lot of these papers on COVID-19 became so politically divisive that if you tried to be a scientist and think critically about a paper, you were actually assigned to a particular political party or to be against other political parties. It's hard for me to be sucked into the political discussion and to the way that our society now is so completely divided into two camps that seem to be not listening to each other.
Ivan
I was curious about that because I've followed your work for a number of years, as you know, and certainly you have had critics before. I'm thinking of the case in China that you uncovered, the leading figure in the Chinese Academy who was really a powerful political figure in addition to being a scientist.
Elisabeth
So that was a case in which I found a couple of papers at first from a particular group in China, and I was just posting on a website called PubPeer, where you can post comments, concerns about papers. And in this case, these were image duplication issues, which is my specialty.
I did not realize that the group I was looking at at that moment was led by one of the highest ranked scientists in China. If I had known that, I would probably not have posted that under my full name, but under a pseudonym. Since I had already posted, some people were starting to send me direct messages on Twitter like, "OMG, the guy you're posting about now is the top scientist in China so you're going to have a lot of backlash."
Then I decided I'll just continue doing this. I found a total of around 50 papers from this group and posted all of them on PubPeer. That story quickly became a very popular story in China: number two on Sina Weibo, a social media site in China.
I was surprised it wasn't suppressed by the Chinese government, it was actually allowed by journalists that were writing about it, and I didn't experience a lot of backlash because of that.
Actually the Chinese doctor wrote me an email saying that he appreciated my feedback and that he would look into these cases. He sent a very polite email so I sent him back that I appreciated that he would look into these cases and left it there.
Ivan
There are certain subjects that I know when we write about them in Retraction Watch, they have tended in the past to really draw a lot of ire. I'm thinking anything about vaccines and autism, anything about climate change, stem cell research.
For a while that last subject has sort of died down. But now it's become a highly politically charged atmosphere. Do you feel that this pandemic has raised the profile of people such as yourself who we refer to as scientific sleuths, people who look critically and analytically at new research?
Elisabeth
Yeah, some people. But I'm also worried that some people who are great scientists and have shown a lot of critical thinking are being attacked because of that. If you just look at what happened to Dr. Fauci, I think that's a prime example. Where somebody who actually is very knowledgeable and very cautious of new science has not been widely accepted as a great leader, in our country at least. It's sad to see that. I'm just worried how long he will be at his position, to be honest.
Ivan
We noticed a big uptick in our traffic in the last few days to Retraction Watch and it turns out it was because someone we wrote about a number of years ago has really hopped on the bandwagon to try and discredit and even try to have Dr. Fauci fired.
It's one of these reminders that the way people think about scientists has, in many cases, far more to do with their own history or their own perspective going in than with any reality or anything about the science. It's pretty disturbing, but it's not a new thing. This has been happening for a while.
You can go back and read sociologists of science from 50-60 years ago and see the same thing, but I just don't think that it's in the same way that it is now, maybe in part because of social media.
Elisabeth
I've been personally very critical about several studies, but this is the first time I've experienced being attacked by trolls and having some nasty websites written about me. It is very disturbing to read.
"I don't think that something that's been peer-reviewed is perfect and something that hasn't been peer reviewed, you should never bother reading it."
Ivan
It is. Yet you have been a fearless and vocal critic of some very high-profile papers, like the infamous French study about hydroxychloroquine.
Elisabeth
Right, the paper that came out was immediately tweeted by the President of the United States. At first I thought it was great that our President tweeted about science! I thought that was a major breakthrough. I took a look at this paper.
It had just come out that day, I believe. The first thing I noticed is that it was accepted within 24 hours of being submitted to the journal. It was actually published in a journal where one of the authors is the editor-in-chief, which is a huge conflict of interest, but it happens.
But in this particular case, there were also a lot of flaws with the study and that, I think, should have been caught during peer review. The paper was first published on a preprint server and then within 24 hours or so it was published in that paper, supposedly after peer review.
There were very few changes between the preprint version and the peer review paper. There were just a couple of extra lines, extra sentences added here and there, but it wasn't really, I think, critically looked at. Because there were a lot of things that I thought were flaws.
Just to go over a couple of them. This paper showed supposedly that people who were treated with hydroxychloroquine and azithromycin were doing much better by clearing their virus much faster than people who were not treated with these drugs.
But if you look carefully at the paper there were a couple of people who were left out of the study. So they were treated with hydroxychloroquine, but they were not shown in the end results of the paper. All six people who were treated with the drug combination were clearing the virus within six days, but there were a couple of others who were left out of the study. They also started the drug combination, but they stopped taking the drugs for several reasons and three of them were admitted to the intensive care, one died, one had some side effects and one apparently walked out of the hospital.
They were left out of the study but they were actually not doing very well with the drug combination. It's not very good science if you leave out people who don't do very well with your drug combination in your study. That was one of my biggest critiques of the paper.
Ivan
What struck us about that case was, in addition to what you, of course, mentioned, the fact that Trump tweeted it and was talking about hydroxychloroquine, was that it seemed to be a perfect example of, "well, it was in a peer review journal." Yeah, it was a preprint first, but, well, it's a peer review journal. And yet, as you point out, when you look at the history of the paper, it was accepted in 24 hours.
If you talk to most scientists, the actual act of a peer review, once you sit down to do it and can concentrate, a good one takes, again, these are averages, but four hours, a half a day is not unreasonable. So you had to find three people who could suddenly review this paper. As you pointed out, it was in a journal where one of the authors was editor.
Then some strange things also happened, right? The society that actually publishes the journal, they came out with a statement saying this wasn't up to our standards, which is odd. Then Elsevier came in, they're the ones who are actually contracted to publish the journal for the society. They said, basically, "Oh, we're going to look into this now too."
It just makes you wonder what happened before the paper was actually published. All the people who were supposed to have been involved in doing the peer review or checking on it are clearly very distraught about what actually happened. It's that scene from Casablanca, "I'm shocked, shocked there's gambling going on here." And then, "Your winnings, sir."
Elisabeth
Yes.
Ivan
And I don't actually blame the public, I don't blame reporters for getting a bit confused about what it all means and what they should trust. I don't think trust is a binary any more than anything else is a binary. I don't think that something that's been peer-reviewed is perfect and something that hasn't been peer reviewed, you should never bother reading it. I think everything is much more gray.
Yet we've turned things into a binary. Even if you go back before coronavirus, coffee is good for you, coffee is bad for you, red wine, chocolate, all the rest of it. A lot of that is because of this sort of binary construct of the world for journalists, frankly, for scientists that need to get their next grants. And certainly for the general public, they want answers.
On the one hand, if I had to choose what group of experts, or what field of human endeavor would I trust with finding the answer to a pandemic like this, or to any crisis, it would absolutely be scientists. Hands down. This is coming from someone who writes about scientific fraud.
But on the other hand, that means that if scientists aren't clear about what they don't know and about the nuances and about what the scientific method actually allows us to do and learn, that just sets them up for failure. It sets people like Dr. Fauci up for failure.
Elisabeth
Right.
Ivan
It sets up any public health official who has a discussion about models. There's a famous saying: "All models are wrong, but some are useful."
Just because the projections change, it's not proof of wrongness, it's not proof that the model is fatally flawed. In fact, I'd be really concerned if the projections didn't change based on new information. I would love it if this whole episode did lead to a better understanding of the scientific process and how scientific publishing fits into that — and doesn't fit into it.
Elisabeth
Yes, I'm with you. I'm very worried that the general audience's perspective is based on maybe watching too many movies where the scientist comes up with a conclusion one hour into the movie when everything is about to fail. Like that scene in Contagion where somebody injects, I think, eight monkeys, and one of the monkeys survives and boom we have the vaccine. That's not really how science works. Everything takes many, many years and many, many applications where usually your first ideas and your first hypothesis turn out to be completely wrong.
Then you go back to the drawing board, you develop another hypothesis and this is a very reiterative process that usually takes years. Most of the people who watch the movie might have a very wrong idea and wrong expectations about how science works. We're living in the movie Contagion and by September, we'll all be vaccinated and we can go on and live our lives. But that's not what is going to happen. It's going to take much, much longer and we're going to have to change the models every time and change our expectations. Just because we don't know all the numbers and all the facts yet.
Ivan
Generally it takes a fairly long time to change medical practice. A lot of times people see that as a bad thing. What I think that ignores, or at least doesn't take into as much account as I would, is that you don't want doctors and other health care professionals to turn on a dime and suddenly switch. Unless, of course, it turns out there was no evidence for what you were looking at.
It's a complicated situation.
Everybody wants scientists to be engineers, right?
Elisabeth
Right.
Ivan
I'm not saying engineering isn't scientific, nor am I saying that science is just completely whimsical, but there's a different process. It's a different way of looking at things and you can't just throw all the data into a big supercomputer, which is what I think a lot of people seem to want us to do, and then the obvious answer will come out on the other side.
Elisabeth
No. It's true and a lot of engineers suddenly feel their inherent need to solve this as a problem. They're not scientists and it's not building a bridge over a big river. But we're dealing with something that is very hard to solve because we don't understand the problem yet. I think scientists are usually first analyzing the problem and trying to understand what the problem actually is before you can even think about a solution.
Ivan
I think we're still at the understanding the problem phase.
Elisabeth
Exactly. And going back to the French group paper, that promised such a result and that was interpreted as such by a lot of people including presidents, but it's a very rare thing to find a medication that will have a 100% curation rate. That's something that I wish the people would understand better. We all want that to happen, but it's very unlikely and very unprecedented in the best of times.
Ivan
I would second that and also say that the world needs to better value the work that people like Elisabeth and others are doing. Because we're not going to get to a better answer if we're not rigorous about scrutinizing the literature and scrutinizing the methodology and scrutinizing the results.
"I quit my job to be able to do this work."
It's a relatively new phenomenon that you're able to do this at any scale at all, and even now it's at a very small scale. Elisabeth mentioned PubPeer and I'm a big fan — also full disclosure, I'm on their board of directors as a volunteer — it's a very powerful engine for readers and journal editors and other scientists to discuss issues.
And Elisabeth has used it really, really well. I think we need to start giving credit to people like that. And, also creating incentives for that kind of work in a way that science hasn't yet.
Elisabeth
Yeah. I quit my job to be able to do this work. It's really hard to combine it with a job either in academia or industry because we're looking for or criticizing papers and it's hard when you are still employed to do that.
I try to make it about the papers and do it in a polite way, but still it's a very hard job to do if you have a daytime job and a position and a career to worry about. Because if you're critical of other academics, that could actually mean the end of your career and that's sad. They should be more open to polite criticism.
Ivan
And for the general public, if you're reading a newspaper story or something online about a single study and it doesn't mention any other studies that have said the same thing or similar, or frankly, if it doesn't say anything about any studies that contradicted it, that's probably also telling you something.
Say you're looking at a huge painting of a shoreline, a beach, and a forest. Any single study is just a one-centimeter-by-one-centimeter square of any part of that canvas. If you just look at that, you would either think it was a painting of the sea, of a beach, or of the forest. It's actually all three of those things.
We just need to be patient, and that's very challenging to us as human beings, but we need to take the time to look at the whole picture.
DISCLAIMER: Neither Elisabeth Bik nor Ivan Oransky was compensated for participation in The Pandemic Issue. While the magazine's editors suggested broad topics for discussion, consistent with Bik's and Oransky's work, neither they nor the magazine's underwriters had any influence on their conversation.
[Editor's Note: This article was originally published on June 8th, 2020 as part of a standalone magazine called GOOD10: The Pandemic Issue. Produced as a partnership among LeapsMag, The Aspen Institute, and GOOD, the magazine is available for free online.]
Kira Peikoff was the editor-in-chief of Leaps.org from 2017 to 2021. As a journalist, her work has appeared in The New York Times, Newsweek, Nautilus, Popular Mechanics, The New York Academy of Sciences, and other outlets. She is also the author of four suspense novels that explore controversial issues arising from scientific innovation: Living Proof, No Time to Die, Die Again Tomorrow, and Mother Knows Best. Peikoff holds a B.A. in Journalism from New York University and an M.S. in Bioethics from Columbia University. She lives in New Jersey with her husband and two young sons. Follow her on Twitter @KiraPeikoff.
Scientists find enzymes in nature that could replace toxic chemicals
Some 900 miles off the coast of Portugal, nine major islands rise from the mid-Atlantic. Verdant and volcanic, the Azores archipelago hosts a wealth of biodiversity that keeps field research scientist, Marlon Clark, returning for more. “You’ve got this really interesting biogeography out there,” says Clark. “There’s real separation between the continents, but there’s this inter-island dispersal of plants and seeds and animals.”
It’s a visual paradise by any standard, but on a microscopic level, there’s even more to see. The Azores’ nutrient-rich volcanic rock — and its network of lagoons, cave systems, and thermal springs — is home to a vast array of microorganisms found in a variety of microclimates with different elevations and temperatures.
Clark works for Basecamp Research, a biotech company headquartered in London, and his job is to collect samples from ecosystems around the world. By extracting DNA from soil, water, plants, microbes and other organisms, Basecamp is building an extensive database of the Earth’s proteins. While DNA itself isn’t a protein, the information stored in DNA is used to create proteins, so extracting, sequencing, and annotating DNA allows for the discovery of unique protein sequences.
Using what they’re finding in the middle of the Atlantic and beyond, Basecamp’s detailed database is constantly growing. The outputs could be essential for cleaning up the damage done by toxic chemicals and finding alternatives to these chemicals.
Catalysts for change
Proteins provide structure and function in all living organisms. Some of these functional proteins are enzymes, which quite literally make things happen.
“Industrial chemistry is heavily polluting, especially the chemistry done in pharmaceutical drug development. Biocatalysis is providing advantages, both to make more complex drugs and to be more sustainable, reducing the pollution and toxicity of conventional chemistry," says Ahir Pushpanath, who heads partnerships for Basecamp.
“Enzymes are perfectly evolved catalysts,” says Ahir Pushpanath, a partnerships lead at Basecamp. ”Enzymes are essentially just a polymer, and polymers are made up of amino acids, which are nature’s building blocks.” He suggests thinking about it like Legos — if you have a bunch of Lego pieces and use them to build a structure that performs a function, “that’s basically how an enzyme works. In nature, these monuments have evolved to do life’s chemistry. If we didn’t have enzymes, we wouldn’t be alive.”
In our own bodies, enzymes catalyze everything from vision to digesting food to regrowing muscles, and these same types of enzymes are necessary in the pharmaceutical, agrochemical and fine chemical industries. But industrial conditions differ from those inside our bodies. So, when scientists need certain chemical reactions to create a particular product or substance, they make their own catalysts in their labs — generally through the use of petroleum and heavy metals.
These petrochemicals are effective and cost-efficient, but they’re wasteful and often hazardous. With growing concerns around sustainability and long-term public health, it's essential to find alternative solutions to toxic chemicals. “Industrial chemistry is heavily polluting, especially the chemistry done in pharmaceutical drug development,” Pushpanath says.
Basecamp is trying to replace lab-created catalysts with enzymes found in the wild. This concept is called biocatalysis, and in theory, all scientists have to do is find the right enzymes for their specific need. Yet, historically, researchers have struggled to find enzymes to replace petrochemicals. When they can’t identify a suitable match, they turn to what Pushpanath describes as “long, iterative, resource-intensive, directed evolution” in the laboratory to coax a protein into industrial adaptation. But the latest scientific advances have enabled these discoveries in nature instead.
Marlon Clark, a research scientist at Basecamp Research, looks for novel biochemistries in the Azores.
Glen Gowers
Enzyme hunters
Whether it’s Clark and a colleague setting off on an expedition, or a local, on-the-ground partner gathering and processing samples, there’s a lot to be learned from each collection. “Microbial genomes contain complete sets of information that define an organism — much like how letters are a code allowing us to form words, sentences, pages, and books that contain complex but digestible knowledge,” Clark says. He thinks of the environmental samples as biological libraries, filled with thousands of species, strains, and sequence variants. “It’s our job to glean genetic information from these samples.”
“We can actually dream up new proteins using generative AI," Pushpanath says.
Basecamp researchers manage this feat by sequencing the DNA and then assembling the information into a comprehensible structure. “We’re building the ‘stories’ of the biota,” Clark says. The more varied the samples, the more valuable insights his team gains into the characteristics of different organisms and their interactions with the environment. Sequencing allows scientists to examine the order of nucleotides — the organic molecules that form DNA — to identify genetic makeups and find changes within genomes. The process used to be too expensive, but the cost of sequencing has dropped from $10,000 a decade ago to as low as $100. Notably, biocatalysis isn’t a new concept — there have been waves of interest in using natural enzymes in catalysis for over a century, Pushpanath says. “But the technology just wasn’t there to make it cost effective,” he explains. “Sequencing has been the biggest boon.”
AI is probably the second biggest boon.
“We can actually dream up new proteins using generative AI,” Pushpanath says, which means that biocataylsis now has real potential to scale.
Glen Gowers, the co-founder of Basecamp, compares the company’s AI approach to that of social networks and streaming services. Consider how these platforms suggest connecting with the friends of your friends, or how watching one comedy film from the 1990s leads to a suggestion of three more.
“They’re thinking about data as networks of relationships as opposed to lists of items,” says Gowers. “By doing the same, we’re able to link the metadata of the proteins — by their relationships to each other, the environments in which they’re found, the way those proteins might look similar in sequence and structure, their surrounding genome context — really, this just comes down to creating a searchable network of proteins.”
On an Azores island, this volcanic opening may harbor organisms that can help scientists identify enzymes for biocatalysis to replace toxic chemicals.
Emma Bolton
Uwe Bornscheuer, professor at the Institute of Biochemistry at the University of Greifswald, and co-founder of Enzymicals, another biocatalysis company, says that the development of machine learning is a critical component of this work. “It’s a very hot topic, because the challenge in protein engineering is to predict which mutation at which position in the protein will make an enzyme suitable for certain applications,” Bornscheuer explains. These predictions are difficult for humans to make at all, let alone quickly. “It is clear that machine learning is a key technology.”
Benefiting from nature’s bounty
Biodiversity commonly refers to plants and animals, but the term extends to all life, including microbial life, and some regions of the world are more biodiverse than others. Building relationships with global partners is another key element to Basecamp’s success. Doing so in accordance with the access and benefit sharing principles set forth by the Nagoya Protocol — an international agreement that seeks to ensure the benefits of using genetic resources are distributed in a fair and equitable way — is part of the company's ethos. “There's a lot of potential for us, and there’s a lot of potential for our partners to have exactly the same impact in building and discovering commercially relevant proteins and biochemistries from nature,” Clark says.
Bornscheuer points out that Basecamp is not the first company of its kind. A former San Diego company called Diversa went public in 2000 with similar work. “At that time, the Nagoya Protocol was not around, but Diversa also wanted to ensure that if a certain enzyme or microorganism from Costa Rica, for example, were used in an industrial process, then people in Costa Rica would somehow profit from this.”
An eventual merger turned Diversa into Verenium Corporation, which is now a part of the chemical producer BASF, but it laid important groundwork for modern companies like Basecamp to continue to scale with today’s technologies.
“To collect natural diversity is the key to identifying new catalysts for use in new applications,” Bornscheuer says. “Natural diversity is immense, and over the past 20 years we have gained the advantages that sequencing is no longer a cost or time factor.”
This has allowed Basecamp to rapidly grow its database, outperforming Universal Protein Resource or UniProt, which is the public repository of protein sequences most commonly used by researchers. Basecamp’s database is three times larger, totaling about 900 million sequences. (UniProt isn’t compliant with the Nagoya Protocol, because, as a public database, it doesn’t provide traceability of protein sequences. Some scientists, however, argue that Nagoya compliance hinders progress.)
“Eventually, this work will reduce chemical processes. We’ll have cleaner processes, more sustainable processes," says Uwe Bornscheuer, a professor at the University of Greifswald.
With so much information available, Basecamp’s AI has been trained on “the true dictionary of protein sequence life,” Pushpanath says, which makes it possible to design sequences for particular applications. “Through deep learning approaches, we’re able to find protein sequences directly from our database, without the need for further laboratory-directed evolution.”
Recently, a major chemical company was searching for a specific transaminase — an enzyme that catalyzes a transfer of amino groups. “They had already spent a year-and-a-half and nearly two million dollars to evolve a public-database enzyme, and still had not reached their goal,” Pushpanath says. “We used our AI approaches on our novel database to yield 10 candidates within a week, which, when validated by the client, achieved the desired target even better than their best-evolved candidate.”
Basecamp’s other huge potential is in bioremediation, where natural enzymes can help to undo the damage caused by toxic chemicals. “Biocatalysis impacts both sides,” says Gowers. “It reduces the usage of chemicals to make products, and at the same time, where contamination sites do exist from chemical spills, enzymes are also there to kind of mop those up.”
So far, Basecamp's round-the-world sampling has covered 50 percent of the 14 major biomes, or regions of the planet that can be distinguished by their flora, fauna, and climate, as defined by the World Wildlife Fund. The other half remains to be catalogued — a key milestone for understanding our planet’s protein diversity, Pushpanath notes.
There’s still a long road ahead to fully replace petrochemicals with natural enzymes, but biocatalysis is on an upward trajectory. "Eventually, this work will reduce chemical processes,” Bornscheuer says. “We’ll have cleaner processes, more sustainable processes.”
Small changes in how a person talks could reveal Alzheimer’s earlier
Dave Arnold retired in his 60s and began spending time volunteering in local schools. But then he started misplacing items, forgetting appointments and losing his sense of direction. Eventually he was diagnosed with early stage Alzheimer’s.
“Hearing the diagnosis made me very emotional and tearful,” he said. “I immediately thought of all my mom had experienced.” His mother suffered with the condition for years before passing away. Over the last year, Arnold has worked for the Alzheimer’s Association as one of its early stage advisors, sharing his insights to help others in the initial stages of the disease.
Arnold was diagnosed sooner than many others. It's important to find out early, when interventions can make the most difference. One promising avenue is looking at how people talk. Research has shown that Alzheimer’s affects a part of the brain that controls speech, resulting in small changes before people show other signs of the disease.
Now, Canary Speech, a company based in Utah, is using AI to examine elements like the pitch of a person’s voice and their pauses. In an initial study, Canary analyzed speech recordings with AI and identified early stage Alzheimer’s with 96 percent accuracy.
Developing the AI model
Canary Speech’s CEO, Henry O’Connell, met cofounder Jeff Adams about 40 years before they started the company. Back when they first crossed paths, they were both living in Bethesda, Maryland; O’Connell was a research fellow at the National Institutes of Health studying rare neurological diseases, while Adams was working to decode spy messages. Later on, Adams would specialize in building mathematical models to analyze speech and sound as a team leader in developing Amazon's Alexa.
It wasn't until 2015 that they decided to make use of the fit between their backgrounds. ““We established Canary Speech in 2017 to build a product that could be used in multiple languages in clinical environments,” O'Connell says.
The need is growing. About 55 million people worldwide currently live with Alzheimer’s, a number that is expected to double by 2050. Some scientists think the disease results from a buildup of plaque in the brain. It causes mild memory loss at first and, over time, this issue get worse while other symptoms, such as disorientation and hallucinations, can develop. Treatment to manage the disease is more effective in the earlier stages, but detection is difficult since mild symptoms are often attributed to the normal aging process.
O’Connell and Adams specialize in the complex ways that Alzheimer’s effects how people speak. Using AI, their mathematical model analyzes 15 million data points every minute, focusing on certain features of speech such as pitch, pauses and elongation of words. It also pays attention to how the vibrations of vocal cords change in different stages of the disease.
To create their model, the team used a type of machine learning called deep neural nets, which looks at multiple layers of data - in this case, the multiple features of a person’s speech patterns.
“Deep neural nets allow us to look at much, much larger data sets built out of millions of elements,” O’Connell explained. “Through machine learning and AI, we’ve identified features that are very sensitive to an Alzheimer’s patient versus [people without the disease] and also very sensitive to mild cognitive impairment, early stage and moderate Alzheimer's.” Based on their learnings, Canary is able to classify the disease stage very quickly, O’Connell said.
“When we’re listening to sublanguage elements, we’re really analyzing the direct result of changes in the brain in the physical body,” O’Connell said. “The brain controls your vocal cords: how fast they vibrate, the expansion of them, the contraction.” These factors, along with where people put their tongues when talking, function subconsciously and result in subtle changes in the sounds of speech.
Further testing is needed
In an initial trial, Canary analyzed speech recordings from phone calls to a large U.S. health insurer. They looked at the audio recordings of 651 policyholders who had early stage Alzheimer’s and 1018 who did not have the condition, aiming for a representative sample of age, gender and race. They used this data to create their first diagnostic model and found that it was 96 percent accurate in identifying Alzheimer’s.
Christian Herff, an assistant professor of neuroscience at Maastricht University in the Netherlands, praised this approach while adding that further testing is needed to assess its effectiveness.
“I think the general idea of identifying increased risk for cognitive impairment based on speech characteristics is very feasible, particularly when change in a user’s voice is monitored, for example, by recording speech every year,” Herff said. He noted that this can only be a first indication, not a full diagnosis. The accuracy still needs to be validated in studies that follows individuals over a period of time, he said.
Toby Walsh, a professor of artificial intelligence at the University of New South Wales, also thinks Canary’s tool has potential but highlights that Canary could diagnose some people who don’t really have the disease. “This is an interesting and promising application of AI,” he said, “but these tools need to be used carefully. Imagine the anxiety of being misdiagnosed with Alzheimer’s.”
As with many other AI tools, privacy and bias are additional issues to monitor closely, Walsh said.
Other languages
A related issue is that not everyone is fluent in English. Mahnaz Arvaneh, a senior lecturer in automatic control and systems engineering at the University of Sheffield, said this could be a blind spot.
“The system may not be very accurate for those who have English as their second language as their speaking patterns would be different, and any issue might be because of language deficiency rather than cognitive issues,” Arvaneh said.
The team is expanding to multiple languages starting with Japanese and Spanish. The elements of the model that make up the algorithm are very similar, but they need to be validated and retrained in a different language, which will require access to more data.
Recently, Canary analyzed the phone calls of 233 Japanese patients who had mild cognitive impairment and 704 healthy people. Using an English model they were able to identify the Japanese patients who had mild cognitive impairment with 78 percent accuracy. They also developed a model in Japanese that was 45 percent accurate, and they’re continuing to train it with more data.
The future
Canary is using their model to look at other diseases like Huntington’s and Parkinson’s. They’re also collaborating with pharmaceuticals to validate potential therapies for Alzheimer’s. By looking at speech patterns over time, Canary can get an indication of how well these drugs are working.
Dave Arnold and his wife dance at his nephew’s wedding in Rochester, New York, ten years ago, before his Alzheimer's diagnosis.
Dave Arnold
Ultimately, they want to integrate their tool into everyday life. “We want it to be used in a smartphone, or a teleconference call so that individuals could be examined in their home,” O’Connell said. “We could follow them over time and work with clinical teams and hospitals to improve the evaluation of patients and contribute towards an accurate diagnosis.”
Arnold, the patient with early stage Alzheimer’s, sees great promise. “The process of getting a diagnosis is already filled with so much anxiety,” he said. “Anything that can be done to make it easier and less stressful would be a good thing, as long as it’s proven accurate.”