Score: 0

Multimedia Verification Through Multi-Agent Deep Research Multimodal Large Language Models

Published: July 6, 2025 | arXiv ID: 2507.04410v1

By: Huy Hoan Le , Van Sy Thinh Nguyen , Thi Le Chi Dang and more

Potential Business Impact:

Finds fake videos and pictures online.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

This paper presents our submission to the ACMMM25 - Grand Challenge on Multimedia Verification. We developed a multi-agent verification system that combines Multimodal Large Language Models (MLLMs) with specialized verification tools to detect multimedia misinformation. Our system operates through six stages: raw data processing, planning, information extraction, deep research, evidence collection, and report generation. The core Deep Researcher Agent employs four tools: reverse image search, metadata analysis, fact-checking databases, and verified news processing that extracts spatial, temporal, attribution, and motivational context. We demonstrate our approach on a challenge dataset sample involving complex multimedia content. Our system successfully verified content authenticity, extracted precise geolocation and timing information, and traced source attribution across multiple platforms, effectively addressing real-world multimedia verification scenarios.

DeepMMSearch-R1: Empowering Multimodal LLMs in Multimodal Web Search

CV and Pattern Recognition

Lets computers search the web for answers.

14 Oct 2025 0

91%

Fact-Checking at Scale: Multimodal AI for Authenticity and Context Verification in Online Media

Multimedia

Checks if online videos and pictures are real.

12 Aug 2025 1

91%

Fact-Checking at Scale: Multimodal AI for Authenticity and Context Verification in Online Media

Multimedia

Checks if online videos and pictures are real.

12 Aug 2025 1

View PDF Login to Bookmark

Country of Origin

🇨🇦 Canada

Page Count

7 pages

Multimedia Verification Through Multi-Agent Deep Research Multimodal Large Language Models

Finds fake videos and pictures online.

Technical Abstract

DeepMMSearch-R1: Empowering Multimodal LLMs in Multimodal Web Search

Fact-Checking at Scale: Multimodal AI for Authenticity and Context Verification in Online Media

Fact-Checking at Scale: Multimodal AI for Authenticity and Context Verification in Online Media