T-SYNTH: A Knowledge-Based Dataset of Synthetic Breast Images
By: Christopher Wiedeman , Anastasiia Sarmakeeva , Elena Sizikova and more
Potential Business Impact:
Creates fake X-rays to train medical scanners.
One of the key impediments for developing and assessing robust medical imaging algorithms is limited access to large-scale datasets with suitable annotations. Synthetic data generated with plausible physical and biological constraints may address some of these data limitations. We propose the use of physics simulations to generate synthetic images with pixel-level segmentation annotations, which are notoriously difficult to obtain. Specifically, we apply this approach to breast imaging analysis and release T-SYNTH, a large-scale open-source dataset of paired 2D digital mammography (DM) and 3D digital breast tomosynthesis (DBT) images. Our initial experimental results indicate that T-SYNTH images show promise for augmenting limited real patient datasets for detection tasks in DM and DBT. Our data and code are publicly available at https://github.com/DIDSR/tsynth-release.
Similar Papers
SynBT: High-quality Tumor Synthesis for Breast Tumor Segmentation by 3D Diffusion Model
CV and Pattern Recognition
Creates fake tumors to train cancer-spotting AI.
From Healthy Scans to Annotated Tumors: A Tumor Fabrication Framework for 3D Brain MRI Synthesis
CV and Pattern Recognition
Creates fake tumor scans to train AI doctors.
SYN-LUNGS: Towards Simulating Lung Nodules with Anatomy-Informed Digital Twins for AI Training
Machine Learning (CS)
Creates fake lung scans to train cancer detectors.