Score: 2

MMM-Fact: A Multimodal, Multi-Domain Fact-Checking Dataset with Multi-Level Retrieval Difficulty

Published: October 29, 2025 | arXiv ID: 2510.25120v1

By: Wenyan Xu , Dawei Xiang , Tianqi Ding and more

Potential Business Impact:

Helps computers spot fake news with text, images, and video.

Business Areas:

Semantic Web Internet Services

Misinformation and disinformation demand fact checking that goes beyond simple evidence-based reasoning. Existing benchmarks fall short: they are largely single modality (text-only), span short time horizons, use shallow evidence, cover domains unevenly, and often omit full articles -- obscuring models' real-world capability. We present MMM-Fact, a large-scale benchmark of 125,449 fact-checked statements (1995--2025) across multiple domains, each paired with the full fact-check article and multimodal evidence (text, images, videos, tables) from four fact-checking sites and one news outlet. To reflect verification effort, each statement is tagged with a retrieval-difficulty tier -- Basic (1--5 sources), Intermediate (6--10), and Advanced (>10) -- supporting fairness-aware evaluation for multi-step, cross-modal reasoning. The dataset adopts a three-class veracity scheme (true/false/not enough information) and enables tasks in veracity prediction, explainable fact-checking, complex evidence aggregation, and longitudinal analysis. Baselines with mainstream LLMs show MMM-Fact is markedly harder than prior resources, with performance degrading as evidence complexity rises. MMM-Fact offers a realistic, scalable benchmark for transparent, reliable, multimodal fact-checking.

XFacta: Contemporary, Real-World Dataset and Evaluation for Multimodal Misinformation Detection with Multimodal LLMs

Computation and Language

Finds fake news shared with pictures and words.

4 Aug 2025 1

90%

Facts are Harder Than Opinions -- A Multilingual, Comparative Analysis of LLM-Based Fact-Checking Reliability

Computers and Society

Helps computers spot fake news in many languages.

4 Jun 2025 1

90%

RealFactBench: A Benchmark for Evaluating Large Language Models in Real-World Fact-Checking

Computation and Language

Tests computers' ability to spot fake news.

14 Jun 2025 1

View PDF Login to Bookmark

Country of Origin

🇨🇳 🇺🇸 China, United States

Repos / Data Links

huggingface.co

Page Count

5 pages

MMM-Fact: A Multimodal, Multi-Domain Fact-Checking Dataset with Multi-Level Retrieval Difficulty

Helps computers spot fake news with text, images, and video.

Technical Abstract

XFacta: Contemporary, Real-World Dataset and Evaluation for Multimodal Misinformation Detection with Multimodal LLMs

Facts are Harder Than Opinions -- A Multilingual, Comparative Analysis of LLM-Based Fact-Checking Reliability

RealFactBench: A Benchmark for Evaluating Large Language Models in Real-World Fact-Checking