Score: 1

Evasion-Resilient Detection of DNS-over-HTTPS Data Exfiltration: A Practical Evaluation and Toolkit

Published: December 23, 2025 | arXiv ID: 2512.20423v1

By: Adam Elaoumari

Potential Business Impact:

Finds secret data hidden in internet code.

Business Areas:

Intrusion Detection Information Technology, Privacy and Security

The purpose of this project is to assess how well defenders can detect DNS-over-HTTPS (DoH) file exfiltration, and which evasion strategies can be used by attackers. While providing a reproducible toolkit to generate, intercept and analyze DoH exfiltration, and comparing Machine Learning vs threshold-based detection under adversarial scenarios. The originality of this project is the introduction of an end-to-end, containerized pipeline that generates configurable file exfiltration over DoH using several parameters (e.g., chunking, encoding, padding, resolver rotation). It allows for file reconstruction at the resolver side, while extracting flow-level features using a fork of DoHLyzer. The pipeline contains a prediction side, which allows the training of machine learning models based on public labelled datasets and then evaluates them side-by-side with threshold-based detection methods against malicious and evasive DNS-Over-HTTPS traffic. We train Random Forest, Gradient Boosting and Logistic Regression classifiers on a public DoH dataset and benchmark them against evasive DoH exfiltration scenarios. The toolkit orchestrates traffic generation, file capture, feature extraction, model training and analysis. The toolkit is then encapsulated into several Docker containers for easy setup and full reproducibility regardless of the platform it is run on. Future research regarding this project is directed at validating the results on mixed enterprise traffic, extending the protocol coverage to HTTP/3/QUIC request, adding a benign traffic generation, and working on real-time traffic evaluation. A key objective is to quantify when stealth constraints make DoH exfiltration uneconomical and unworthy for the attacker.

CO-DEFEND: Continuous Decentralized Federated Learning for Secure DoH-Based Threat Detection

Machine Learning (CS)

Finds hidden bad internet traffic without sharing secrets.

2 Apr 2025 0

85%

Improving the Identification of Real-world Malware's DNS Covert Channels Using Locality Sensitive Hashing

Cryptography and Security

Finds hidden computer viruses using website names.

25 Nov 2025 0

85%

Evading Data Provenance in Deep Neural Networks

CV and Pattern Recognition

Bypasses AI data copyright detection stealthily

1 Aug 2025 2

View PDF Login to Bookmark

Repos / Data Links

github.com github.com github.com github.com github.com github.com github.com github.com github.com github.com github.com github.com github.com

Page Count

61 pages

Evasion-Resilient Detection of DNS-over-HTTPS Data Exfiltration: A Practical Evaluation and Toolkit

Finds secret data hidden in internet code.

Technical Abstract

CO-DEFEND: Continuous Decentralized Federated Learning for Secure DoH-Based Threat Detection

Improving the Identification of Real-world Malware's DNS Covert Channels Using Locality Sensitive Hashing

Evading Data Provenance in Deep Neural Networks