Rep3Net: An Approach Exploiting Multimodal Representation for Molecular Bioactivity Prediction
By: Sabrina Islam , Md. Atiqur Rahman , Md. Bakhtiar Hasan and more
Potential Business Impact:
Finds new medicines faster by predicting how they work.
In early stage drug discovery, bioactivity prediction of molecules against target proteins plays a crucial role. Trdaitional QSAR models that utilizes molecular descriptor based data often struggles to predict bioactivity of molecules effectively due to its limitation in capturing structural and contextual information embedded within each compound. To address this challenge, we propose Rep3Net, a unified deep learning architecture that not only incorporates descriptor data but also includes spatial and relational information through graph-based represenation of compounds and contextual information through ChemBERTa generated embeddings from SMILES strings. Our model employing multimodal concatenated features produce reliable bioactivity prediction on Poly [ADP-ribose] polymerase 1 (PARP-1) dataset. PARP-1 is a crucial agent in DNA damage repair and has become a significant theraputic target in malignancies that depend on it for survival and growth. A comprehensive analysis and comparison with conventional standalone models including GCN, GAT, XGBoost, etc. demonstrates that our architecture achieves the highest predictive performance. In computational screening of compounds in drug discovery, our architecture provides a scalable framework for bioactivity prediction.
Similar Papers
PolyRecommender: A Multimodal Recommendation System for Polymer Discovery
Machine Learning (CS)
Finds new materials for better batteries and plastics.
Learning Cell-Aware Hierarchical Multi-Modal Representations for Robust Molecular Modeling
Machine Learning (CS)
Predicts drug effects better by looking at cells.
Fine-Tuning ChemBERTa for Predicting Inhibitory Activity Against TDP1 Using Deep Learning
Machine Learning (CS)
Finds cancer-fighting drugs faster.