LLM-Augmented Chemical Synthesis and Design Decision Programs
By: Haorui Wang , Jeff Guo , Lingkai Kong and more
Potential Business Impact:
Computers plan how to build new medicines faster.
Retrosynthesis, the process of breaking down a target molecule into simpler precursors through a series of valid reactions, stands at the core of organic chemistry and drug development. Although recent machine learning (ML) research has advanced single-step retrosynthetic modeling and subsequent route searches, these solutions remain restricted by the extensive combinatorial space of possible pathways. Concurrently, large language models (LLMs) have exhibited remarkable chemical knowledge, hinting at their potential to tackle complex decision-making tasks in chemistry. In this work, we explore whether LLMs can successfully navigate the highly constrained, multi-step retrosynthesis planning problem. We introduce an efficient scheme for encoding reaction pathways and present a new route-level search strategy, moving beyond the conventional step-by-step reactant prediction. Through comprehensive evaluations, we show that our LLM-augmented approach excels at retrosynthesis planning and extends naturally to the broader challenge of synthesizable molecular design.
Similar Papers
Chemical reasoning in LLMs unlocks strategy-aware synthesis planning and reaction mechanism elucidation
Artificial Intelligence
Computers plan chemical reactions like expert scientists.
Atom-anchored LLMs speak Chemistry: A Retrosynthesis Demonstration
Machine Learning (CS)
Teaches computers to invent new medicines.
Enhancing Chemical Reaction and Retrosynthesis Prediction with Large Language Model and Dual-task Learning
Machine Learning (CS)
Helps scientists invent new medicines faster.