Score: 1

Model-Agnostic Sentiment Distribution Stability Analysis for Robust LLM-Generated Texts Detection

Published: August 9, 2025 | arXiv ID: 2508.06913v1

By: Siyuan Li , Xi Lin , Guangyan Li and more

Potential Business Impact:

Finds fake writing by checking emotions.

The rapid advancement of large language models (LLMs) has resulted in increasingly sophisticated AI-generated content, posing significant challenges in distinguishing LLM-generated text from human-written language. Existing detection methods, primarily based on lexical heuristics or fine-tuned classifiers, often suffer from limited generalizability and are vulnerable to paraphrasing, adversarial perturbations, and cross-domain shifts. In this work, we propose SentiDetect, a model-agnostic framework for detecting LLM-generated text by analyzing the divergence in sentiment distribution stability. Our method is motivated by the empirical observation that LLM outputs tend to exhibit emotionally consistent patterns, whereas human-written texts display greater emotional variability. To capture this phenomenon, we define two complementary metrics: sentiment distribution consistency and sentiment distribution preservation, which quantify stability under sentiment-altering and semantic-preserving transformations. We evaluate SentiDetect on five diverse datasets and a range of advanced LLMs,including Gemini-1.5-Pro, Claude-3, GPT-4-0613, and LLaMa-3.3. Experimental results demonstrate its superiority over state-of-the-art baselines, with over 16% and 11% F1 score improvements on Gemini-1.5-Pro and GPT-4-0613, respectively. Moreover, SentiDetect also shows greater robustness to paraphrasing, adversarial attacks, and text length variations, outperforming existing detectors in challenging scenarios.

Robust Fake News Detection using Large Language Models under Adversarial Sentiment Attacks

Computation and Language

Stops fake news from tricking computers by changing feelings.

21 Jan 2026 2

90%

Evaluating Large Language Models Against Human Annotators in Latent Content Analysis: Sentiment, Political Leaning, Emotional Intensity, and Sarcasm

Computation and Language

Computers understand feelings and opinions in text.

5 Jan 2025 2

89%

Towards Consistent Detection of Cognitive Distortions: LLM-Based Annotation and Dataset-Agnostic Evaluation

Computation and Language

Computers learn to spot bad thoughts better.

3 Nov 2025 1

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Page Count

9 pages

Model-Agnostic Sentiment Distribution Stability Analysis for Robust LLM-Generated Texts Detection

Finds fake writing by checking emotions.

Technical Abstract

Robust Fake News Detection using Large Language Models under Adversarial Sentiment Attacks

Evaluating Large Language Models Against Human Annotators in Latent Content Analysis: Sentiment, Political Leaning, Emotional Intensity, and Sarcasm

Towards Consistent Detection of Cognitive Distortions: LLM-Based Annotation and Dataset-Agnostic Evaluation