Evaluating Privacy-Utility Tradeoffs in Synthetic Smart Grid Data
By: Andre Catarino , Rui Melo , Rui Abreu and more
Potential Business Impact:
Creates fake electricity use data to protect privacy.
The widespread adoption of dynamic Time-of-Use (dToU) electricity tariffs requires accurately identifying households that would benefit from such pricing structures. However, the use of real consumption data poses serious privacy concerns, motivating the adoption of synthetic alternatives. In this study, we conduct a comparative evaluation of four synthetic data generation methods, Wasserstein-GP Generative Adversarial Networks (WGAN), Conditional Tabular GAN (CTGAN), Diffusion Models, and Gaussian noise augmentation, under different synthetic regimes. We assess classification utility, distribution fidelity, and privacy leakage. Our results show that architectural design plays a key role: diffusion models achieve the highest utility (macro-F1 up to 88.2%), while CTGAN provide the strongest resistance to reconstruction attacks. These findings highlight the potential of structured generative models for developing privacy-preserving, data-driven energy systems.
Similar Papers
Privacy-Preserving Fair Synthetic Tabular Data
Machine Learning (CS)
Creates private, fair data for sharing without bias.
SMOTE-DP: Improving Privacy-Utility Tradeoff with Synthetic Data
Machine Learning (CS)
Makes private data useful without losing secrets.
Synthesizing Grid Data with Cyber Resilience and Privacy Guarantees
Systems and Control
Protects power grids from hackers using private data.