Medical World Model: Generative Simulation of Tumor Evolution for Treatment Planning
By: Yijun Yang , Zhao-Yang Wang , Qiuping Liu and more
Potential Business Impact:
Helps doctors pick best cancer treatments by predicting results.
Providing effective treatment and making informed clinical decisions are essential goals of modern medicine and clinical care. We are interested in simulating disease dynamics for clinical decision-making, leveraging recent advances in large generative models. To this end, we introduce the Medical World Model (MeWM), the first world model in medicine that visually predicts future disease states based on clinical decisions. MeWM comprises (i) vision-language models to serve as policy models, and (ii) tumor generative models as dynamics models. The policy model generates action plans, such as clinical treatments, while the dynamics model simulates tumor progression or regression under given treatment conditions. Building on this, we propose the inverse dynamics model that applies survival analysis to the simulated post-treatment tumor, enabling the evaluation of treatment efficacy and the selection of the optimal clinical action plan. As a result, the proposed MeWM simulates disease dynamics by synthesizing post-treatment tumors, with state-of-the-art specificity in Turing tests evaluated by radiologists. Simultaneously, its inverse dynamics model outperforms medical-specialized GPTs in optimizing individualized treatment protocols across all metrics. Notably, MeWM improves clinical decision-making for interventional physicians, boosting F1-score in selecting the optimal TACE protocol by 13%, paving the way for future integration of medical world models as the second readers.
Similar Papers
Beyond Generative AI: World Models for Clinical Prediction, Counterfactuals, and Planning
Machine Learning (CS)
Helps doctors predict patient health and plan treatments.
Surgical Vision World Model
Image and Video Processing
Trains robots to perform surgery using videos.
Discrete Diffusion Models with MLLMs for Unified Medical Multimodal Generation
CV and Pattern Recognition
Helps doctors understand patient health from pictures and words.