New AI Approach Optimizes Antibody Drugs

Proteins have evolved to excel at everything from contracting muscles to digesting food to recognizing viruses. To engineer better proteins, including antibodies, scientists often iteratively mutate the amino acids - the units that are arranged in a sequence to make up proteins - in different positions until the resulting protein has an improved function, like eliciting a stronger immune response or capturing carbon dioxide from the atmosphere more efficiently.

But there are more possible amino acid sequences than there are grains of sand in the world. And finding the best protein and, therefore, the best potential drug, is often expensive or impossible.

Stanford scientists have developed a new machine learning-based method to more quickly and accurately predict the molecular changes that will lead to better antibody drugs. Published in Science on July 4, the approach combines the 3D structure of the protein backbone with large language models based on amino acid sequence, and allows researchers to find, in minutes, rare and desirable mutations that would otherwise only be found with exhaustive experiments.

Led by Peter S. Kim, professor of biochemistry and institute scholar at Sarafan ChEM-H, and Brian Hie, assistant professor of chemical engineering, the team showed that they could improve a once FDA-approved SARS-CoV-2 antibody that had been discontinued due to its ineffectiveness against a new strain in November 2022. Their approach resulted in a 25-fold improvement against the virus.

"A lot of effort in AI and drug development is centered around amassing tons of data about how well a certain molecule performs a certain task so that a computer can learn enough to design a better version," said Kim. "What's remarkable is that we've shown that structure can be used in lieu of a lot of that data, and the computer will still learn."

"Now, more antibodies actually get a shot at being optimized," said Hie, who is also an innovation investigator at the Arc Institute.

When faced with the challenge of finding the best amino acid sequence, scientists will often make millions and test them in miniaturized, simplified versions of biological systems. They hope that the best drug in a dish will also be the best drug in humans.

"It's a lot of guess and check," said Hie. "The goal of a lot of intelligent algorithms is to remove the guesswork from this."

To speed up the process, scientists have developed ChatGPT-like machine learning algorithms that are trained on the amino acid sequences of millions of proteins to predict desirable mutations.

These models, however, often point scientists toward sequences that, once produced in the lab, are unstable or worse than where they started.

This is partially because protein function depends not only on the sequence of amino acids but also on the 3D structure of that sequence. For example, to trigger an immune response, antibodies must be the right shape to bind to molecules that sit atop the surface of viruses.

The key, the team thought, to developing a better prediction algorithm was structure. So, they constrained the long list of possible beneficial mutations - as determined by the sequence-based large language model - to only those that would preserve the 3D shape of the starting protein.

In December 2022, the team put it to the test on a recently discontinued SARS-CoV-2 antibody therapy.

"The prevailing theory was that trying to improve this antibody would fail," said Varun Shanker, a medical student, graduate student in biophysics, and lead author on the study. "The virus was too smart. It evolved as it spread through millions of people to know exactly how to mutate to avoid these antibodies."

Using purely sequence-based models to optimize the protein resulted in a modest twofold increase in effectiveness. But with their structure-guided approach, the team saw a 25-fold increase.

"We were finally catching up to the virus," said Shanker, who is also a fellow in the Chemistry/Biology Interface Training Program at Sarafan ChEM-H.

Most efforts in using AI to build better drugs rely on "training" or "supervising" the model, which involves generating huge amounts of data about the function and performance of unique protein sequences. This approach takes a lot of time, and results in a model tailored to specific protein performing a specific task.

This model does not require any input about what the protein does, how well it does it, or any lab experiments. Because structure is so closely tied to function, the protein's coordinates become a proxy for performance. For the COVID antibody work, they constrained the structure not just to the antibody itself, but to the antibody when it is bound to the virus. From there, their model “learned” some rules of antibody binding without ever needing to be taught.

Early experiments show that the approach is generalizable to other kinds of proteins, like enzymes, which help catalyze chemical reactions in our bodies. So far, the researchers have found that the model points scientists to tens of proteins, and, on average, half are better than the starting point.

This tool could be useful to quickly respond to emerging or evolving diseases. It also lowers the barrier to making more effective medicines. Stronger medicines mean lower doses are necessary, which means that a given quantity could benefit more patients. For infectious diseases like HIV, where studies have shown that large but infrequent doses of an antibody can protect patients from infection, this could be transformational.

The team is making their model and code freely available to anyone.

"This is an exciting example of the power of deep learning to democratize the process of building better proteins," said Shanker. "This not only allows people to develop new medicines, but also opens up new areas of scientific exploration that had been inaccessible."

Shanker VR, Bruun TUJ, Hie BL, Kim PS.
Unsupervised evolution of protein and antibody complexes with a structure-informed language model.
Science. 2024 Jul 5;385(6704):46-53. doi: 10.1126/science.adk8946

Most Popular Now

Clanwilliam Brings Epic Care to the UK

Care homes looking to digitise their administration and care procedures have a new option with the launch of Epic Care in the UK. Epic Care is a modular, scalable system developed...

AI Language Models Write Good Doctor…

Generative AI should be able to write usable doctor's letters and thus potentially speed up medical documentation, according to a study by the University Medical Center Freiburg. Around 93% of...

West Yorkshire and Harrogate Hospitals S…

Clinicians working at five of the six trusts in the West Yorkshire Association of Acute Trusts (WYAAT) can access test results from across their pathology network, following a summer roll-out...

ChatGPT Shows Human-Level Assessment of …

As artificial intelligence advances, its uses and capabilities in real-world applications continue to reach new heights that may even surpass human expertise. In the field of radiology, where a correct...

When Detecting Depression, the Eyes have…

It has been estimated that nearly 300 million people, or about 4% of the global population, are afflicted by some form of depression. But detecting it can be difficult, particularly...

When it comes to Emergency Care, ChatGPT…

If ChatGPT were cut loose in the Emergency Department, it might suggest unneeded x-rays and antibiotics for some patients and admit others who didn't require hospital treatment, a new study...

HWL 2024 Brings Together a Record Number…

1 - 2 October 2024, Luxembourg. The second edition of Healthcare Week Luxembourg on 1 and 2 October 2024, organised by the Federation of Luxembourg Hospitals (FHL), in partnership with the...

AI Drives Development of Cancer Fighting…

University of Houston researchers and their students are developing a new software technology, based on artificial intelligence, for advancing cell-based immunotherapy to treat cancer and other diseases. CellChorus...

MEDICA 2024 + COMPAMED 2024: Adapted Hal…

11 - 14 November 2024, Düsseldorf, Germany. The final preparations for MEDICA 2024 and COMPAMED 2024 in Düsseldorf have begun. A total of more than 5,500 exhibitors from approximately 70 countries...

Revolutionizing Cardiovascular Risk Asse…

A recent position paper in the Asia-Pacific Journal of Ophthalmology explores the transformative potential of artificial intelligence (AI) in ophthalmology. Led by Lama Al-Aswad, Professor of Ophthalmology and Irene Heinz...

AI does Not Necessarily Lead to more Eff…

The use of artificial intelligence (AI) in hospitals and patient care is steadily increasing. Especially in specialist areas with a high proportion of imaging, such as radiology, AI has long...

Why the NHS Needs a Transparency Revolut…

Opinion Article by Dr Mark Ratnarajah, NHS paediatrician and UK Managing Director for C2-Ai. Wes Streeting wanted 'no stone left unturned' when he asked Lord Darzi to examine the current state...