Interpreting GFlowNets for Drug Discovery: Extracting Actionable Insights for Medicinal Chemistry
By: Amirtha Varshini A S, Duminda S. Ranasinghe, Hok Hei Tam
Potential Business Impact:
Shows how computers design new medicines.
Generative Flow Networks, or GFlowNets, offer a promising framework for molecular design, but their internal decision policies remain opaque. This limits adoption in drug discovery, where chemists require clear and interpretable rationales for proposed structures. We present an interpretability framework for SynFlowNet, a GFlowNet trained on documented chemical reactions and purchasable starting materials that generates both molecules and the synthetic routes that produce them. Our approach integrates three complementary components. Gradient based saliency combined with counterfactual perturbations identifies which atomic environments influence reward and how structural edits change molecular outcomes. Sparse autoencoders reveal axis aligned latent factors that correspond to physicochemical properties such as polarity, lipophilicity, and molecular size. Motif probes show that functional groups including aromatic rings and halogens are explicitly encoded and linearly decodable from the internal embeddings. Together, these results expose the chemical logic inside SynFlowNet and provide actionable and mechanistic insight that supports transparent and controllable molecular design.
Similar Papers
Boosted GFlowNets: Improving Exploration via Sequential Learning
Machine Learning (CS)
Finds rare, valuable things by exploring better.
Pretraining Generative Flow Networks with Inexpensive Rewards for Molecular Graph Generation
Machine Learning (CS)
Finds new medicines by building molecules atom by atom.
GFlowNets for Learning Better Drug-Drug Interaction Representations
Machine Learning (CS)
Finds dangerous medicine mixes doctors miss.