DMol: A Schedule-Driven Diffusion Model for Highly Efficient and Versatile Molecule Generation
By: Peizhi Niu , Yu-Hsiang Wang , Vishal Rana and more
Potential Business Impact:
Creates new medicines faster and better.
We introduce a new graph diffusion model for small molecule generation, \emph{DMol}, which outperforms the state-of-the-art DiGress model in terms of validity by roughly $1.5\%$ across all benchmarking datasets while reducing the number of diffusion steps by at least $10$-fold, and the running time to roughly one half. The performance improvements are a result of a careful change in the objective function and a ``graph noise" scheduling approach which, at each diffusion step, allows one to only change a subset of nodes of varying size in the molecule graph. Another relevant property of the method is that it can be easily combined with junction-tree-like graph representations that arise by compressing a collection of relevant ring structures into supernodes. Unlike classical junction-tree techniques that involve VAEs and require complicated reconstruction steps, compressed DMol directly performs graph diffusion on a graph that compresses only a carefully selected set of frequent carbon rings into supernodes, which results in straightforward sample generation. This compressed DMol method offers additional validity improvements over generic DMol of roughly $2\%$, increases the novelty of the method, and further improves the running time due to reductions in the graph size.
Similar Papers
Straight-Line Diffusion Model for Efficient 3D Molecular Generation
Machine Learning (CS)
Makes computers design new molecules 100 times faster.
Graph Diffusion that can Insert and Delete
Machine Learning (CS)
Designs new molecules by adding or removing atoms.
Peptide2Mol: A Diffusion Model for Generating Small Molecules as Peptide Mimics for Targeted Protein Binding
Machine Learning (CS)
Creates better medicines by copying nature.