Diffusion-Driven Synthetic Tabular Data Generation for Enhanced DoS/DDoS Attack Classification
By: Aravind B , Anirud R. S. , Sai Surya Teja N and more
Potential Business Impact:
Makes computer security better at finding rare attacks.
Class imbalance refers to a situation where certain classes in a dataset have significantly fewer samples than oth- ers, leading to biased model performance. Class imbalance in network intrusion detection using Tabular Denoising Diffusion Probability Models (TabDDPM) for data augmentation is ad- dressed in this paper. Our approach synthesizes high-fidelity minority-class samples from the CIC-IDS2017 dataset through iterative denoising processes. For the minority classes that have smaller samples, synthetic samples were generated and merged with the original dataset. The augmented training data enables an ANN classifier to achieve near-perfect recall on previously underrepresented attack classes. These results establish diffusion models as an effective solution for tabular data imbalance in security domains, with potential applications in fraud detection and medical diagnostics.
Similar Papers
Latent Diffusion for Internet of Things Attack Data Generation in Intrusion Detection
Machine Learning (CS)
Makes smart home devices safer from hackers.
Cyberattack Detection in Critical Infrastructure and Supply Chains
Cryptography and Security
Teaches computers to spot new cyberattacks.
Bias-Corrected Data Synthesis for Imbalanced Learning
Machine Learning (Stat)
Fixes computer guessing when most examples are wrong.