Towards Benchmarking Design Pattern Detection Under Obfuscation: Reproducing and Evaluating Attention-Based Detection Method
By: Manthan Shenoy, Andreas Rausch
Potential Business Impact:
Makes computer code checkers understand true meaning.
This paper investigates the semantic robustness of attention-based classifiers for design pattern detection, particularly focusing on their reliance on structural and behavioral semantics. We reproduce the DPDAtt, an attention-based design pattern detection approach using learning-based classifiers, and evaluate its performance under obfuscation. To this end, we curate an obfuscated version of the DPDAtt Corpus, where the name identifiers in code such as class names, method names, etc., and string literals like print statements and comment blocks are replaced while preserving control flow, inheritance, and logic. Our findings reveal that these trained classifiers in DPDAtt depend significantly on superficial syntactic features, leading to substantial misclassification when such cues are removed through obfuscation. This work highlights the need for more robust detection tools capable of capturing deeper semantic meanings in source code. We propose our curated Obfuscated corpus (containing 34 Java source files) as a reusable proof-of-concept benchmark for evaluating state-of-the-art design pattern detectors on their true semantic generalization capabilities.
Similar Papers
Generalized Design Choices for Deepfake Detectors
CV and Pattern Recognition
Finds fake videos more reliably.
Multi-Language Detection of Design Pattern Instances
Software Engineering
Finds computer code patterns in many languages.
DPS: Design Pattern Summarisation Using Code Features
Software Engineering
Summarizes computer code to explain design ideas.