Score: 1

Task-Specific Sparse Feature Masks for Molecular Toxicity Prediction with Chemical Language Models

Published: December 12, 2025 | arXiv ID: 2512.11412v1

By: Kwun Sy Lee , Jiawei Chen , Fuk Sheng Ford Chung and more

Potential Business Impact:

Shows drug parts that make them safe or unsafe.

Business Areas:
Bioinformatics Biotechnology, Data and Analytics, Science and Engineering

Reliable in silico molecular toxicity prediction is a cornerstone of modern drug discovery, offering a scalable alternative to experimental screening. However, the black-box nature of state-of-the-art models remains a significant barrier to adoption, as high-stakes safety decisions demand verifiable structural insights alongside predictive performance. To address this, we propose a novel multi-task learning (MTL) framework designed to jointly enhance accuracy and interpretability. Our architecture integrates a shared chemical language model with task-specific attention modules. By imposing an L1 sparsity penalty on these modules, the framework is constrained to focus on a minimal set of salient molecular fragments for each distinct toxicity endpoint. The resulting framework is trained end-to-end and is readily adaptable to various transformer-based backbones. Evaluated on the ClinTox, SIDER, and Tox21 benchmark datasets, our approach consistently outperforms both single-task and standard MTL baselines. Crucially, the sparse attention weights provide chemically intuitive visualizations that reveal the specific fragments influencing predictions, thereby enhancing insight into the model's decision-making process.

Country of Origin
🇭🇰 Hong Kong

Page Count
6 pages

Category
Computer Science:
Computational Engineering, Finance, and Science