Score: 0

Developing a PET/CT Foundation Model for Cross-Modal Anatomical and Functional Imaging

Published: March 4, 2025 | arXiv ID: 2503.02824v1

By: Yujin Oh , Robert Seifert , Yihan Cao and more

Potential Business Impact:

Helps doctors find cancer faster and better.

Business Areas:

Image Recognition Data and Analytics, Software

In oncology, Positron Emission Tomography-Computed Tomography (PET/CT) is widely used in cancer diagnosis, staging, and treatment monitoring, as it combines anatomical details from CT with functional metabolic activity and molecular marker expression information from PET. However, existing artificial intelligence-driven PET/CT analyses rely predominantly on task-specific models trained from scratch or on limited datasets, limiting their generalizability and robustness. To address this, we propose a foundation model approach specifically designed for multimodal PET/CT imaging. We introduce the Cross-Fraternal Twin Masked Autoencoder (FratMAE), a novel framework that effectively integrates whole-body anatomical and functional or molecular information. FratMAE employs separate Vision Transformer (ViT) encoders for PET and CT scans, along with cross-attention decoders that enable synergistic interactions between modalities during masked autoencoder training. Additionally, it incorporates textual metadata to enhance PET representation learning. By pre-training on PET/CT datasets, FratMAE captures intricate cross-modal relationships and global uptake patterns, achieving superior performance on downstream tasks and demonstrating its potential as a generalizable foundation model.

Whole-Body Image-to-Image Translation for a Virtual Scanner in a Healthcare Digital Twin

Image and Video Processing

Creates fake PET scans from CT scans.

18 Mar 2025 2

88%

TRAECR: A Tool for Preprocessing Positron Emission Tomography Imaging for Statistical Modeling

Tissues and Organs

Improves brain scans for disease diagnosis.

6 Nov 2025 0

87%

Few-Shot Deployment of Pretrained MRI Transformers in Brain Imaging Tasks

CV and Pattern Recognition

Helps doctors find brain problems with fewer scans.

7 Aug 2025 1

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

11 pages

Developing a PET/CT Foundation Model for Cross-Modal Anatomical and Functional Imaging

Helps doctors find cancer faster and better.

Technical Abstract

Whole-Body Image-to-Image Translation for a Virtual Scanner in a Healthcare Digital Twin

TRAECR: A Tool for Preprocessing Positron Emission Tomography Imaging for Statistical Modeling

Few-Shot Deployment of Pretrained MRI Transformers in Brain Imaging Tasks