GraphCliff: Short-Long Range Gating for Subtle Differences but Critical Changes
By: Hajung Kim , Jueon Park , Junseok Choe and more
Potential Business Impact:
Helps computers tell similar drugs apart.
Quantitative structure-activity relationship assumes a smooth relationship between molecular structure and biological activity. However, activity cliffs defined as pairs of structurally similar compounds with large potency differences break this continuity. Recent benchmarks targeting activity cliffs have revealed that classical machine learning models with extended connectivity fingerprints outperform graph neural networks. Our analysis shows that graph embeddings fail to adequately separate structurally similar molecules in the embedding space, making it difficult to distinguish between structurally similar but functionally different molecules. Despite this limitation, molecular graph structures are inherently expressive and attractive, as they preserve molecular topology. To preserve the structural representation of molecules as graphs, we propose a new model, GraphCliff, which integrates short- and long-range information through a gating mechanism. Experimental results demonstrate that GraphCliff consistently improves performance on both non-cliff and cliff compounds. Furthermore, layer-wise node embedding analyses reveal reduced over-smoothing and enhanced discriminative power relative to strong baseline graph models.
Similar Papers
GraphCliff: Short-Long Range Gating for Subtle Differences but Critical Changes
Computational Engineering, Finance, and Science
Helps find medicines by understanding molecule differences.
A Semi-supervised Molecular Learning Framework for Activity Cliff Estimation
Computational Engineering, Finance, and Science
Finds better medicines faster, even with little data.
Structure-Aware Compound-Protein Affinity Prediction via Graph Neural Network with Group Lasso Regularization
Machine Learning (CS)
Finds better drug parts by seeing tiny changes.