Contrastive Geometric Learning Unlocks Unified Structure- and Ligand-Based Drug Design
By: Lisa Schneckenreiter , Sohvi Luukkonen , Lukas Friedrich and more
Structure-based and ligand-based computational drug design have traditionally relied on disjoint data sources and modeling assumptions, limiting their joint use at scale. In this work, we introduce Contrastive Geometric Learning for Unified Computational Drug Design (ConGLUDe), a single contrastive geometric model that unifies structure- and ligand-based training. ConGLUDe couples a geometric protein encoder that produces whole-protein representations and implicit embeddings of predicted binding sites with a fast ligand encoder, removing the need for pre-defined pockets. By aligning ligands with both global protein representations and multiple candidate binding sites through contrastive learning, ConGLUDe supports ligand-conditioned pocket prediction in addition to virtual screening and target fishing, while being trained jointly on protein-ligand complexes and large-scale bioactivity data. Across diverse benchmarks, ConGLUDe achieves state-of-the-art zero-shot virtual screening performance in settings where no binding pocket information is provided as input, substantially outperforms existing methods on a challenging target fishing task, and demonstrates competitive ligand-conditioned pocket selection. These results highlight the advantages of unified structure-ligand training and position ConGLUDe as a step toward general-purpose foundation models for drug discovery.
Similar Papers
A Geometric Graph-Based Deep Learning Model for Drug-Target Affinity Prediction
Biomolecules
Finds best medicines for diseases faster.
Contrastive Multi-Task Learning with Solvent-Aware Augmentation for Drug Discovery
Biomolecules
Predicts drug-protein bonds more accurately in liquids
Structure-Aware Contrastive Learning with Fine-Grained Binding Representations for Drug Discovery
Machine Learning (CS)
Finds new medicines faster by looking at how they fit.