Score: 0

Protein Inverse Folding From Structure Feedback

Published: June 3, 2025 | arXiv ID: 2506.03028v1

By: Junde Xu , Zijun Gao , Xinyi Zhou and more

Potential Business Impact:

Designs proteins that fold into specific shapes.

Business Areas:

Bioinformatics Biotechnology, Data and Analytics, Science and Engineering

The inverse folding problem, aiming to design amino acid sequences that fold into desired three-dimensional structures, is pivotal for various biotechnological applications. Here, we introduce a novel approach leveraging Direct Preference Optimization (DPO) to fine-tune an inverse folding model using feedback from a protein folding model. Given a target protein structure, we begin by sampling candidate sequences from the inverse-folding model, then predict the three-dimensional structure of each sequence with the folding model to generate pairwise structural-preference labels. These labels are used to fine-tune the inverse-folding model under the DPO objective. Our results on the CATH 4.2 test set demonstrate that DPO fine-tuning not only improves sequence recovery of baseline models but also leads to a significant improvement in average TM-Score from 0.77 to 0.81, indicating enhanced structure similarity. Furthermore, iterative application of our DPO-based method on challenging protein structures yields substantial gains, with an average TM-Score increase of 79.5\% with regard to the baseline model. This work establishes a promising direction for enhancing protein sequence design ability from structure feedback by effectively utilizing preference optimization.

EnerBridge-DPO: Energy-Guided Protein Inverse Folding with Markov Bridges and Direct Preference Optimization

Machine Learning (CS)

Designs stronger, more stable body-building blocks.

11 Jun 2025 1

91%

Improving Protein Sequence Design through Designability Preference Optimization

Machine Learning (CS)

Designs proteins that reliably fold into shapes.

30 May 2025 3

89%

Zero-shot protein stability prediction by inverse folding models: a free energy interpretation

Machine Learning (CS)

Improves protein stability predictions for new medicines.

5 Jun 2025 0

View PDF Login to Bookmark

Page Count

18 pages

Protein Inverse Folding From Structure Feedback

Designs proteins that fold into specific shapes.

Technical Abstract

EnerBridge-DPO: Energy-Guided Protein Inverse Folding with Markov Bridges and Direct Preference Optimization

Improving Protein Sequence Design through Designability Preference Optimization

Zero-shot protein stability prediction by inverse folding models: a free energy interpretation