Score: 2

OMTRA: A Multi-Task Generative Model for Structure-Based Drug Design

Published: December 4, 2025 | arXiv ID: 2512.05080v1

By: Ian Dunn , Liv Toft , Tyler Katz and more

Potential Business Impact:

Finds new medicines by building molecules.

Business Areas:
Bioinformatics Biotechnology, Data and Analytics, Science and Engineering

Structure-based drug design (SBDD) focuses on designing small-molecule ligands that bind to specific protein pockets. Computational methods are integral in modern SBDD workflows and often make use of virtual screening methods via docking or pharmacophore search. Modern generative modeling approaches have focused on improving novel ligand discovery by enabling de novo design. In this work, we recognize that these tasks share a common structure and can therefore be represented as different instantiations of a consistent generative modeling framework. We propose a unified approach in OMTRA, a multi-modal flow matching model that flexibly performs many tasks relevant to SBDD, including some with no analogue in conventional workflows. Additionally, we curate a dataset of 500M 3D molecular conformers, complementing protein-ligand data and expanding the chemical diversity available for training. OMTRA obtains state of the art performance on pocket-conditioned de novo design and docking; however, the effects of large-scale pretraining and multi-task training are modest. All code, trained models, and dataset for reproducing this work are available at https://github.com/gnina/OMTRA

Repos / Data Links

Page Count
29 pages

Category
Computer Science:
Machine Learning (CS)