Score: 0

Enhanced Web Payload Classification Using WAMM: An AI-Based Framework for Dataset Refinement and Model Evaluation

Published: December 29, 2025 | arXiv ID: 2512.23610v1

By: Heba Osama , Omar Elebiary , Youssef Qassim and more

Potential Business Impact:

AI stops sneaky website attacks better than old rules.

Business Areas:

Machine Learning Artificial Intelligence, Data and Analytics, Software

Web applications increasingly face evasive and polymorphic attack payloads, yet traditional web application firewalls (WAFs) based on static rule sets such as the OWASP Core Rule Set (CRS) often miss obfuscated or zero-day patterns without extensive manual tuning. This work introduces WAMM, an AI-driven multiclass web attack detection framework designed to reveal the limitations of rule-based systems by reclassifying HTTP requests into OWASP-aligned categories for a specific technology stack. WAMM applies a multi-phase enhancement pipeline to the SR-BH 2020 dataset that includes large-scale deduplication, LLM-guided relabeling, realistic attack data augmentation, and LLM-based filtering, producing three refined datasets. Four machine and deep learning models are evaluated using a unified feature space built from statistical and text-based representations. Results show that using an augmented and LLM-filtered dataset on the same technology stack, XGBoost reaches 99.59% accuracy with microsecond-level inference while deep learning models degrade under noisy augmentation. When tested against OWASP CRS using an unseen augmented dataset, WAMM achieves true positive block rates between 96 and 100% with improvements of up to 86%. These findings expose gaps in widely deployed rule-based defenses and demonstrate that curated training pipelines combined with efficient machine learning models enable a more resilient, real-time approach to web attack detection suitable for production WAF environments.

Adaptive Dual-Layer Web Application Firewall (ADL-WAF) Leveraging Machine Learning for Enhanced Anomaly and Threat Detection

Cryptography and Security

Stops hackers from breaking into websites better.

16 Nov 2025 0

87%

LLM-Driven Feature-Level Adversarial Attacks on Android Malware Detectors

Cryptography and Security

Makes bad apps trick phone security software.

24 Dec 2025 1

87%

LLM-based Multi-class Attack Analysis and Mitigation Framework in IoT/IIoT Networks

Cryptography and Security

Makes smart devices safer from hackers.

30 Oct 2025 0

View PDF Login to Bookmark

Page Count

14 pages

Enhanced Web Payload Classification Using WAMM: An AI-Based Framework for Dataset Refinement and Model Evaluation

AI stops sneaky website attacks better than old rules.

Technical Abstract

Adaptive Dual-Layer Web Application Firewall (ADL-WAF) Leveraging Machine Learning for Enhanced Anomaly and Threat Detection

LLM-Driven Feature-Level Adversarial Attacks on Android Malware Detectors

LLM-based Multi-class Attack Analysis and Mitigation Framework in IoT/IIoT Networks