Score: 0

Interpreting GFlowNets for Drug Discovery: Extracting Actionable Insights for Medicinal Chemistry

Published: November 24, 2025 | arXiv ID: 2511.19264v1

By: Amirtha Varshini A S, Duminda S. Ranasinghe, Hok Hei Tam

Potential Business Impact:

Shows how computers design new medicines.

Business Areas:
Biopharma Biotechnology, Health Care, Science and Engineering

Generative Flow Networks, or GFlowNets, offer a promising framework for molecular design, but their internal decision policies remain opaque. This limits adoption in drug discovery, where chemists require clear and interpretable rationales for proposed structures. We present an interpretability framework for SynFlowNet, a GFlowNet trained on documented chemical reactions and purchasable starting materials that generates both molecules and the synthetic routes that produce them. Our approach integrates three complementary components. Gradient based saliency combined with counterfactual perturbations identifies which atomic environments influence reward and how structural edits change molecular outcomes. Sparse autoencoders reveal axis aligned latent factors that correspond to physicochemical properties such as polarity, lipophilicity, and molecular size. Motif probes show that functional groups including aromatic rings and halogens are explicitly encoded and linearly decodable from the internal embeddings. Together, these results expose the chemical logic inside SynFlowNet and provide actionable and mechanistic insight that supports transparent and controllable molecular design.

Page Count
13 pages

Category
Computer Science:
Machine Learning (CS)