Score: 0

Semantic Reconstruction of Adversarial Plagiarism: A Context-Aware Framework for Detecting and Restoring "Tortured Phrases" in Scientific Literature

Published: December 11, 2025 | arXiv ID: 2512.10435v1

By: Agniva Maiti, Prajwal Panth, Suresh Chandra Satapathy

Potential Business Impact:

Finds hidden copied text in science papers.

Business Areas:

Semantic Search Internet Services

The integrity and reliability of scientific literature is facing a serious threat by adversarial text generation techniques, specifically from the use of automated paraphrasing tools to mask plagiarism. These tools generate "tortured phrases", statistically improbable synonyms (e.g. "counterfeit consciousness" for "artificial intelligence"), that preserve the local grammar while obscuring the original source. Most existing detection methods depend heavily on static blocklists or general-domain language models, which suffer from high false-negative rates for novel obfuscations and cannot determine the source of the plagiarized content. In this paper, we propose Semantic Reconstruction of Adversarial Plagiarism (SRAP), a framework designed not only to detect these anomalies but to mathematically recover the original terminology. We use a two-stage architecture: (1) statistical anomaly detection with a domain-specific masked language model (SciBERT) using token-level pseudo-perplexity, and (2) source-based semantic reconstruction using dense vector retrieval (FAISS) and sentence-level alignment (SBERT). Experiments on a parallel corpus of adversarial scientific text show that while zero-shot baselines fail completely (0.00 percent restoration accuracy), our retrieval-augmented approach achieves 23.67 percent restoration accuracy, significantly outperforming baseline methods. We also show that static decision boundaries are necessary for robust detection in jargon-heavy scientific text, since dynamic thresholding fails under high variance. SRAP enables forensic analysis by linking obfuscated expressions back to their most probable source documents.

Adversarial Paraphrasing: A Universal Attack for Humanizing AI-Generated Text

Computation and Language

Makes AI writing trick detectors into thinking it's human.

8 Jun 2025 1

88%

SPARTA: Evaluating Reasoning Segmentation Robustness through Black-Box Adversarial Paraphrasing in Text Autoencoder Latent Space

Computation and Language

Makes AI understand different ways of asking the same thing.

28 Oct 2025 0

87%

Same Question, Different Words: A Latent Adversarial Framework for Prompt Robustness

Computation and Language

Makes AI understand questions asked in different ways.

3 Mar 2025 1

View PDF Login to Bookmark

Country of Origin

🇮🇳 India

Page Count

17 pages

Semantic Reconstruction of Adversarial Plagiarism: A Context-Aware Framework for Detecting and Restoring "Tortured Phrases" in Scientific Literature

Finds hidden copied text in science papers.

Technical Abstract

Adversarial Paraphrasing: A Universal Attack for Humanizing AI-Generated Text

SPARTA: Evaluating Reasoning Segmentation Robustness through Black-Box Adversarial Paraphrasing in Text Autoencoder Latent Space

Same Question, Different Words: A Latent Adversarial Framework for Prompt Robustness