Score: 0

MADTempo: An Interactive System for Multi-Event Temporal Video Retrieval with Query Augmentation

Published: December 15, 2025 | arXiv ID: 2512.12929v1

By: Huu-An Vu , Van-Khanh Mai , Trong-Tam Nguyen and more

Potential Business Impact:

Finds videos by understanding how events connect.

Business Areas:

Motion Capture Media and Entertainment, Video

The rapid expansion of video content across online platforms has accelerated the need for retrieval systems capable of understanding not only isolated visual moments but also the temporal structure of complex events. Existing approaches often fall short in modeling temporal dependencies across multiple events and in handling queries that reference unseen or rare visual concepts. To address these challenges, we introduce MADTempo, a video retrieval framework developed by our team, AIO_Trinh, that unifies temporal search with web-scale visual grounding. Our temporal search mechanism captures event-level continuity by aggregating similarity scores across sequential video segments, enabling coherent retrieval of multi-event queries. Complementarily, a Google Image Search-based fallback module expands query representations with external web imagery, effectively bridging gaps in pretrained visual embeddings and improving robustness against out-of-distribution (OOD) queries. Together, these components advance the temporal reasoning and generalization capabilities of modern video retrieval systems, paving the way for more semantically aware and adaptive retrieval across large-scale video corpora.

Enhanced Multimodal Video Retrieval System: Integrating Query Expansion and Cross-modal Temporal Event Retrieval

Information Retrieval

Finds video clips using many search words.

6 Dec 2025 0

90%

Unified Interactive Multimodal Moment Retrieval via Cascaded Embedding-Reranking and Temporal-Aware Score Fusion

CV and Pattern Recognition

Finds specific video moments using smart searching.

15 Dec 2025 0

88%

A Lightweight Moment Retrieval System with Global Re-Ranking and Robust Adaptive Bidirectional Temporal Search

CV and Pattern Recognition

Find video clips faster in huge collections.

12 Apr 2025 0

View PDF Login to Bookmark

Country of Origin

🇻🇳 Viet Nam

Page Count

15 pages

MADTempo: An Interactive System for Multi-Event Temporal Video Retrieval with Query Augmentation

Finds videos by understanding how events connect.

Technical Abstract

Enhanced Multimodal Video Retrieval System: Integrating Query Expansion and Cross-modal Temporal Event Retrieval

Unified Interactive Multimodal Moment Retrieval via Cascaded Embedding-Reranking and Temporal-Aware Score Fusion

A Lightweight Moment Retrieval System with Global Re-Ranking and Robust Adaptive Bidirectional Temporal Search