Score: 3

Detecting LLM-Generated Spam Reviews by Integrating Language Model Embeddings and Graph Neural Network

Published: October 2, 2025 | arXiv ID: 2510.01801v1

By: Xin Liu , Rongwu Xu , Xinyi Jia and more

Potential Business Impact:

Catches fake online reviews written by AI.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

The rise of large language models (LLMs) has enabled the generation of highly persuasive spam reviews that closely mimic human writing. These reviews pose significant challenges for existing detection systems and threaten the credibility of online platforms. In this work, we first create three realistic LLM-generated spam review datasets using three distinct LLMs, each guided by product metadata and genuine reference reviews. Evaluations by GPT-4.1 confirm the high persuasion and deceptive potential of these reviews. To address this threat, we propose FraudSquad, a hybrid detection model that integrates text embeddings from a pre-trained language model with a gated graph transformer for spam node classification. FraudSquad captures both semantic and behavioral signals without relying on manual feature engineering or massive training resources. Experiments show that FraudSquad outperforms state-of-the-art baselines by up to 44.22% in precision and 43.01% in recall on three LLM-generated datasets, while also achieving promising results on two human-written spam datasets. Furthermore, FraudSquad maintains a modest model size and requires minimal labeled training data, making it a practical solution for real-world applications. Our contributions include new synthetic datasets, a practical detection framework, and empirical evidence highlighting the urgency of adapting spam detection to the LLM era. Our code and datasets are available at: https://anonymous.4open.science/r/FraudSquad-5389/.

Weak Links in LinkedIn: Enhancing Fake Profile Detection in the Age of LLMs

Social and Information Networks

Makes fake online profiles easier to spot.

21 Jul 2025 1

90%

An Investigation of Large Language Models and Their Vulnerabilities in Spam Detection

Cryptography and Security

New AI can't always spot tricky spam emails.

14 Apr 2025 1

90%

A Theoretically Grounded Hybrid Ensemble for Reliable Detection of LLM-Generated Text

Computation and Language

Finds fake writing in schoolwork better.

27 Nov 2025 0

View PDF Login to Bookmark

Country of Origin

🇨🇦 🇨🇳 Canada, China

Repos / Data Links

github.com github.com github.com

Page Count

13 pages

Detecting LLM-Generated Spam Reviews by Integrating Language Model Embeddings and Graph Neural Network

Catches fake online reviews written by AI.

Technical Abstract

Weak Links in LinkedIn: Enhancing Fake Profile Detection in the Age of LLMs

An Investigation of Large Language Models and Their Vulnerabilities in Spam Detection

A Theoretically Grounded Hybrid Ensemble for Reliable Detection of LLM-Generated Text