Score: 1

RiboGen: RNA Sequence and Structure Co-Generation with Equivariant MultiFlow

Published: March 3, 2025 | arXiv ID: 2503.02058v4

By: Dana Rubin , Allan dos Santos Costa , Manvitha Ponnapati and more

BigTech Affiliations: Massachusetts Institute of Technology

Potential Business Impact:

Designs new RNA molecules with their 3D shapes.

Business Areas:
Bioinformatics Biotechnology, Data and Analytics, Science and Engineering

Ribonucleic acid (RNA) plays fundamental roles in biological systems, from carrying genetic information to performing enzymatic function. Understanding and designing RNA can enable novel therapeutic application and biotechnological innovation. To enhance RNA design, in this paper we introduce RiboGen, the first deep learning model to simultaneously generate RNA sequence and all-atom 3D structure. RiboGen leverages the standard Flow Matching with Discrete Flow Matching in a multimodal data representation. RiboGen is based on Euclidean Equivariant neural networks for efficiently processing and learning three-dimensional geometry. Our experiments show that RiboGen can efficiently generate chemically plausible and self-consistent RNA samples, suggesting that co-generation of sequence and structure is a competitive approach for modeling RNA.

Country of Origin
🇺🇸 United States

Page Count
9 pages

Category
Quantitative Biology:
Biomolecules