Generative Models for Synthetic Data: Transforming Data Mining in the GenAI Era
By: Dawei Li , Yue Huang , Ming Li and more
Potential Business Impact:
Creates fake data to train computers faster.
Generative models such as Large Language Models, Diffusion Models, and generative adversarial networks have recently revolutionized the creation of synthetic data, offering scalable solutions to data scarcity, privacy, and annotation challenges in data mining. This tutorial introduces the foundations and latest advances in synthetic data generation, covers key methodologies and practical frameworks, and discusses evaluation strategies and applications. Attendees will gain actionable insights into leveraging generative synthetic data to enhance data mining research and practice. More information can be found on our website: https://syndata4dm.github.io/.
Similar Papers
New Money: A Systematic Review of Synthetic Data Generation for Finance
Machine Learning (CS)
Creates fake money data to train computers safely.
Causal Synthetic Data Generation in Recruitment
Machine Learning (CS)
Creates fair job rankings without using real people's private info.
Generative Artificial Intelligence and Agents in Research and Teaching
Computers and Society
Helps computers create text, art, and ideas.