Score: 0

Adversarial Question Answering Robustness: A Multi-Level Error Analysis and Mitigation Study

Published: January 6, 2026 | arXiv ID: 2601.02700v1

By: Agniv Roy Choudhury, Vignesh Ponselvan Rajasingh

Potential Business Impact:

Makes AI better at answering questions, even tricky ones.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Question answering (QA) systems achieve impressive performance on standard benchmarks like SQuAD, but remain vulnerable to adversarial examples. This project investigates the adversarial robustness of transformer models on the AddSent adversarial dataset through systematic experimentation across model scales and targeted mitigation strategies. We perform comprehensive multi-level error analysis using five complementary categorization schemes, identifying negation confusion and entity substitution as the primary failure modes. Through systematic evaluation of adversarial fine-tuning ratios, we identify 80% clean + 20% adversarial data as optimal. Data augmentation experiments reveal a capacity bottleneck in small models. Scaling from ELECTRA-small (14M parameters) to ELECTRA-base (110M parameters) eliminates the robustness-accuracy trade-off, achieving substantial improvements on both clean and adversarial data. We implement three targeted mitigation strategies, with Entity-Aware contrastive learning achieving best performance: 89.89% AddSent Exact Match (EM) and 90.73% SQuAD EM, representing 94.9% closure of the adversarial gap. To our knowledge, this is the first work integrating comprehensive linguistic error analysis with Named Entity Recognition (NER)-guided contrastive learning for adversarial QA, demonstrating that targeted mitigation can achieve near-parity between clean and adversarial performance.

Differential Robustness in Transformer Language Models: Empirical Evaluation Under Adversarial Text Attacks

Cryptography and Security

Makes AI smarter and harder to trick.

5 Sep 2025 0

88%

MultiQ&A: An Analysis in Measuring Robustness via Automated Crowdsourcing of Question Perturbations and Answers

Computation and Language

Makes AI tell the truth, not make things up.

6 Feb 2025 0

88%

The Impact of Scaling Training Data on Adversarial Robustness

CV and Pattern Recognition

Makes AI smarter and harder to trick.

30 Sep 2025 2

View PDF Login to Bookmark

Page Count

12 pages

Adversarial Question Answering Robustness: A Multi-Level Error Analysis and Mitigation Study

Makes AI better at answering questions, even tricky ones.

Technical Abstract

Differential Robustness in Transformer Language Models: Empirical Evaluation Under Adversarial Text Attacks

MultiQ&A: An Analysis in Measuring Robustness via Automated Crowdsourcing of Question Perturbations and Answers

The Impact of Scaling Training Data on Adversarial Robustness