Learning Cell-Aware Hierarchical Multi-Modal Representations for Robust Molecular Modeling
By: Mengran Li , Zelin Zang , Wenbin Xing and more
Potential Business Impact:
Predicts drug effects better by looking at cells.
Understanding how chemical perturbations propagate through biological systems is essential for robust molecular property prediction. While most existing methods focus on chemical structures alone, recent advances highlight the crucial role of cellular responses such as morphology and gene expression in shaping drug effects. However, current cell-aware approaches face two key limitations: (1) modality incompleteness in external biological data, and (2) insufficient modeling of hierarchical dependencies across molecular, cellular, and genomic levels. We propose CHMR (Cell-aware Hierarchical Multi-modal Representations), a robust framework that jointly models local-global dependencies between molecules and cellular responses and captures latent biological hierarchies via a novel tree-structured vector quantization module. Evaluated on nine public benchmarks spanning 728 tasks, CHMR outperforms state-of-the-art baselines, yielding average improvements of 3.6% on classification and 17.2% on regression tasks. These results demonstrate the advantage of hierarchy-aware, multimodal learning for reliable and biologically grounded molecular representations, offering a generalizable framework for integrative biomedical modeling. The code is in https://github.com/limengran98/CHMR.
Similar Papers
Multi-Modal Molecular Representation Learning via Structure Awareness
Machine Learning (CS)
Finds better medicines by understanding molecule shapes.
Hierarchical Molecular Language Models (HMLMs)
Molecular Networks
Helps AI understand how cells talk to each other.
Bidirectional Hierarchical Protein Multi-Modal Representation Learning
Machine Learning (CS)
Helps predict how proteins work by combining two views.