Score: 0

Rep3Net: An Approach Exploiting Multimodal Representation for Molecular Bioactivity Prediction

Published: November 29, 2025 | arXiv ID: 2512.00521v1

By: Sabrina Islam , Md. Atiqur Rahman , Md. Bakhtiar Hasan and more

Potential Business Impact:

Finds new medicines faster by predicting how they work.

Business Areas:
Bioinformatics Biotechnology, Data and Analytics, Science and Engineering

In early stage drug discovery, bioactivity prediction of molecules against target proteins plays a crucial role. Trdaitional QSAR models that utilizes molecular descriptor based data often struggles to predict bioactivity of molecules effectively due to its limitation in capturing structural and contextual information embedded within each compound. To address this challenge, we propose Rep3Net, a unified deep learning architecture that not only incorporates descriptor data but also includes spatial and relational information through graph-based represenation of compounds and contextual information through ChemBERTa generated embeddings from SMILES strings. Our model employing multimodal concatenated features produce reliable bioactivity prediction on Poly [ADP-ribose] polymerase 1 (PARP-1) dataset. PARP-1 is a crucial agent in DNA damage repair and has become a significant theraputic target in malignancies that depend on it for survival and growth. A comprehensive analysis and comparison with conventional standalone models including GCN, GAT, XGBoost, etc. demonstrates that our architecture achieves the highest predictive performance. In computational screening of compounds in drug discovery, our architecture provides a scalable framework for bioactivity prediction.

Page Count
11 pages

Category
Computer Science:
Machine Learning (CS)