Score: 1

Synthetic medical data generation: state of the art and application to trauma mechanism classification

Published: August 4, 2025 | arXiv ID: 2508.02771v1

By: Océane Doremus , Ariel Guerra-Adames , Marta Avalos-Fernandez and more

Potential Business Impact:

Creates fake patient data for medical research.

Faced with the challenges of patient confidentiality and scientific reproducibility, research on machine learning for health is turning towards the conception of synthetic medical databases. This article presents a brief overview of state-of-the-art machine learning methods for generating synthetic tabular and textual data, focusing their application to the automatic classification of trauma mechanisms, followed by our proposed methodology for generating high-quality, synthetic medical records combining tabular and unstructured text data.

Page Count
6 pages

Category
Computer Science:
Machine Learning (CS)