AI Matches Protein Interaction Partners

Proteins are the building blocks of life, involved in virtually every biological process. Understanding how proteins interact with each other is crucial for deciphering the complexities of cellular functions, and has significant implications for drug development and the treatment of diseases.

However, predicting which proteins bind together has been a challenging aspect of computational biology, primarily due to the vast diversity and complexity of protein structures. But a new study from the group of Ann-Florence Bitbol at EPFL might now change all that.

The team of scientists, including Umberto Lupo, Damiano Sgarbossa and Bitbol, has developed DiffPALM (Differentiable Pairing using Alignment-based Language Models), an AI-based approach that can significantly advance the prediction of interacting protein sequences. The study is published in PNAS.

DiffPALM leverages the power of protein language models, an advanced machine learning concept borrowed from natural language processing, to analyze and predict protein interactions among the members of two protein families with unprecedented accuracy. It uses these machine learning techniques to predict interacting protein pairs. This leads to a significant improvement over other methods that often require large, diverse datasets, and struggle with the complexity of eukaryotic protein complexes.

Another advantage of DiffPALM is its versatility, as it can work even with smaller sequence datasets and thus address rare proteins that have few homologs – proteins of different species that share common evolutionary ancestry. It relies on protein language models trained on multiple sequence alignments (MSAs), such as the MSA Transformer and AlphaFold's EvoFormer module, which allows it to understand and predict the complex interactions between proteins with a high degree of accuracy. Even more, using DiffPALM shows high promise when it comes to predicting the structure of protein complexes, which are intricate structures formed by the binding of multiple proteins, and are essential for many of the cell’s processes.

In the study, the team compared DiffPALM with traditional coevolution-based pairing methods, which study how protein sequences evolve together over time when they interact closely – changes in one protein can lead to changes in its interacting partner. This is an extremely important aspect of molecular and cell biology, which is well-captured by protein language models trained on MSAs. DiffPALM is shown to outperform traditional methods Top of Formon challenging benchmarks, demonstrating its robustness and efficiency.

The application of DiffPALM is obvious in the field of basic protein biology, but extends beyond it, as it has the potential to become a powerful tool in medical research and drug development. For instance, accurately predicting protein interactions can help understand disease mechanisms and develop targeted therapies.

The researchers have made DiffPALM freely available, hoping that the scientific community adopts it widely to further advancements in computational biology and enable researchers to explore the complexities of protein interactions.

By combining advanced machine learning techniques and efficient handling of complex biological data, DiffPALM marks a significant leap forward in computational biology. It not only enhances our understanding of protein interactions but also opens up new avenues in medical research, potentially leading to breakthroughs in disease treatment and drug development.

Umberto Lupo, Damiano Sgarbossa, Anne-Florence Bitbol.
Pairing interacting protein sequences using masked language modeling.
PNAS 24 June 2024. doi: 10.1073/pnas.2311887121

Most Popular Now

Bayer and Samsung Take Action Against Sl…

Bayer AG today announced a strategic collaboration with Samsung Electronics America, Inc. to address data gaps on sleep disturbances associated with menopause (SDM). The companies will co-develop an observational study...

New AI Algorithm Detects Rare Epileptic …

More than 3.4 million people in the US and 65 million people worldwide have epilepsy, a neurological disorder that affects the nervous system and causes seizures. One in 26 people...

AI detects more breast cancers with fewe…

Using artificial intelligence (AI), breast radiologists in Denmark have improved breast cancer screening performance and reduced the rate of false-positive findings. Results of the study were published today in Radiology...

Transforming Drug Discovery with AI

A new AI-powered program will allow researchers to level up their drug discovery efforts. The program, called TopoFormer, was developed by an interdisciplinary team led by Guowei Wei, a Michigan...

We may Soon be Able to Detect Cancer wit…

A new paper in Biology Methods & Protocols, published by Oxford University Press, indicates that it may soon be possible for doctors to use artificial intelligence (AI) to detect and...

Maternity Tech Launched to Help NHS Meas…

Health tech provider C2-Ai has formally launched a new 'observatory' system to help hospitals gain a better understanding of risks, outcomes and safety within maternity and neonatal services. Announced at the...

With New Omega Tool, Scientists can Rapi…

In a new research article, scientists at Chan Zuckerberg Biohub San Francisco (CZ Biohub SF) describe Omega, an open-source software tool that significantly advances the field of bioimage analysis. Omega...

Large Language Models Illuminate a Progr…

This study is led by Prof. Bin Dong (Beijing International Center for Mathematical Research, Peking University) and Prof. Lin Shen (Department of Gastrointestinal Oncology, Key Laboratory of Carcinogenesis and Translational...

An AI-Powered Wearable System Tracks the…

Scientists at the University of Southern California have developed an artificial intelligence (AI)-powered system to track tiny devices that monitor markers of disease in the gut. Devices using the novel...

Health Innovation East Partners with Cog…

Health Innovation East, the innovation arm of the NHS in the East of England and Cogniss, a no-code ecosystem for digital health solutions, have announced a strategic partnership to launch...

"Self-Taught" AI Tool Helps to…

A computer program based on data from nearly a half-million tissue images and powered by artificial intelligence (AI) can accurately diagnose cases of adenocarcinoma, the most common form of lung...

New AI Tool Finds Rare Variants Linked t…

Using an advanced artificial intelligence (AI) tool, researchers at the Icahn School of Medicine at Mount Sinai have identified rare coding variants in 17 genes that shed light on the...