Score: 0

Evaluating AI capabilities in detecting conspiracy theories on YouTube

Published: May 29, 2025 | arXiv ID: 2505.23570v2

By: Leonardo La Rocca, Francesco Corso, Francesco Pierri

Potential Business Impact:

Finds fake news videos on YouTube.

Business Areas:
Natural Language Processing Artificial Intelligence, Data and Analytics, Software

As a leading online platform with a vast global audience, YouTube's extensive reach also makes it susceptible to hosting harmful content, including disinformation and conspiracy theories. This study explores the use of open-weight Large Language Models (LLMs), both text-only and multimodal, for identifying conspiracy theory videos shared on YouTube. Leveraging a labeled dataset of thousands of videos, we evaluate a variety of LLMs in a zero-shot setting and compare their performance to a fine-tuned RoBERTa baseline. Results show that text-based LLMs achieve high recall but lower precision, leading to increased false positives. Multimodal models lag behind their text-only counterparts, indicating limited benefits from visual data integration. To assess real-world applicability, we evaluate the most accurate models on an unlabeled dataset, finding that RoBERTa achieves performance close to LLMs with a larger number of parameters. Our work highlights the strengths and limitations of current LLM-based approaches for online harmful content detection, emphasizing the need for more precise and robust systems.

Page Count
28 pages

Category
Computer Science:
Computation and Language