Score: 2

The 1st Solution for MOSEv2 Challenge 2025: Long-term and Concept-aware Video Segmentation via SeC

Published: September 23, 2025 | arXiv ID: 2509.19183v1

By: Mingqi Gao , Jingkun Chen , Yunqi Miao and more

Potential Business Impact:

Helps computers track moving things in videos better.

Business Areas:
Semantic Search Internet Services

This technical report explores the MOSEv2 track of the LSVOS Challenge, which targets complex semi-supervised video object segmentation. By analysing and adapting SeC, an enhanced SAM-2 framework, we conduct a detailed study of its long-term memory and concept-aware memory, showing that long-term memory preserves temporal continuity under occlusion and reappearance, while concept-aware memory supplies semantic priors that suppress distractors; together, these traits directly benefit several MOSEv2's core challenges. Our solution achieves a JF score of 39.89% on the test set, ranking 1st in the MOSEv2 track of the LSVOS Challenge.

Country of Origin
🇨🇳 🇬🇧 China, United Kingdom

Repos / Data Links

Page Count
6 pages

Category
Computer Science:
CV and Pattern Recognition