Task-Specific Sparse Feature Masks for Molecular Toxicity Prediction with Chemical Language Models
By: Kwun Sy Lee , Jiawei Chen , Fuk Sheng Ford Chung and more
Potential Business Impact:
Shows drug parts that make them safe or unsafe.
Reliable in silico molecular toxicity prediction is a cornerstone of modern drug discovery, offering a scalable alternative to experimental screening. However, the black-box nature of state-of-the-art models remains a significant barrier to adoption, as high-stakes safety decisions demand verifiable structural insights alongside predictive performance. To address this, we propose a novel multi-task learning (MTL) framework designed to jointly enhance accuracy and interpretability. Our architecture integrates a shared chemical language model with task-specific attention modules. By imposing an L1 sparsity penalty on these modules, the framework is constrained to focus on a minimal set of salient molecular fragments for each distinct toxicity endpoint. The resulting framework is trained end-to-end and is readily adaptable to various transformer-based backbones. Evaluated on the ClinTox, SIDER, and Tox21 benchmark datasets, our approach consistently outperforms both single-task and standard MTL baselines. Crucially, the sparse attention weights provide chemically intuitive visualizations that reveal the specific fragments influencing predictions, thereby enhancing insight into the model's decision-making process.
Similar Papers
Breaking Bad Molecules: Are MLLMs Ready for Structure-Level Molecular Detoxification?
Artificial Intelligence
Fixes drug molecules to make them safer.
CoTox: Chain-of-Thought-Based Molecular Toxicity Reasoning and Prediction
Machine Learning (CS)
Finds drug dangers before testing on people.
Quantum-Enhanced Multi-Task Learning with Learnable Weighting for Pharmacokinetic and Toxicity Prediction
Machine Learning (CS)
Finds safer medicines faster using quantum tricks.