Contrastive Multi-Task Learning with Solvent-Aware Augmentation for Drug Discovery
By: Jing Lan , Hexiao Ding , Hongzhao Chen and more
Potential Business Impact:
Predicts drug-protein bonds more accurately in liquids
Accurate prediction of protein-ligand interactions is essential for computer-aided drug discovery. However, existing methods often fail to capture solvent-dependent conformational changes and lack the ability to jointly learn multiple related tasks. To address these limitations, we introduce a pre-training method that incorporates ligand conformational ensembles generated under diverse solvent conditions as augmented input. This design enables the model to learn both structural flexibility and environmental context in a unified manner. The training process integrates molecular reconstruction to capture local geometry, interatomic distance prediction to model spatial relationships, and contrastive learning to build solvent-invariant molecular representations. Together, these components lead to significant improvements, including a 3.7% gain in binding affinity prediction, an 82% success rate on the PoseBusters Astex docking benchmarks, and an area under the curve of 97.1% in virtual screening. The framework supports solvent-aware, multi-task modeling and produces consistent results across benchmarks. A case study further demonstrates sub-angstrom docking accuracy with a root-mean-square deviation of 0.157 angstroms, offering atomic-level insight into binding mechanisms and advancing structure-based drug design.
Similar Papers
Structure-Aware Contrastive Learning with Fine-Grained Binding Representations for Drug Discovery
Machine Learning (CS)
Finds new medicines faster by looking at how they fit.
S$^2$Drug: Bridging Protein Sequence and 3D Structure in Contrastive Representation Learning for Virtual Screening
Machine Learning (CS)
Finds new medicines faster using protein clues.
AANet: Virtual Screening under Structural Uncertainty via Alignment and Aggregation
Machine Learning (CS)
Finds new medicines even with missing puzzle pieces.