Action Spotting and Precise Event Detection in Sports: Datasets, Methods, and Challenges
By: Hao Xu , Arbind Agrahari Baniya , Sam Well and more
Potential Business Impact:
Helps computers find sports highlights automatically.
Video event detection is central to modern sports analytics, enabling automated understanding of key moments for performance evaluation, content creation, and tactical feedback. While deep learning has significantly advanced tasks like Temporal Action Localization (TAL), Action Spotting (AS), and Precise Event Spotting (PES), existing surveys often overlook the fine-grained temporal demands and domain-specific challenges posed by sports. This survey first provides a clear conceptual distinction between TAL, AS, and PES, then introduces a methods-based taxonomy covering recent deep learning approaches for AS and PES, including feature-based pipelines, end-to-end architectures, and multimodal strategies. We further review benchmark datasets and evaluation protocols, identifying critical limitations such as reliance on broadcast-quality footage and lenient multi-label metrics that hinder real-world deployment. Finally, we outline open challenges and future directions toward more temporally precise, generalizable, and practical event spotting in sports video analysis.
Similar Papers
Precise Event Spotting in Sports Videos: Solving Long-Range Dependency and Class Imbalance
CV and Pattern Recognition
Spots exact moments of sports actions in videos.
Few-Shot Precise Event Spotting via Unified Multi-Entity Graph and Distillation
CV and Pattern Recognition
Teaches computers to spot sports actions with little data.
Towards Precise Action Spotting: Addressing Temporal Misalignment in Labels with Dynamic Label Assignment
CV and Pattern Recognition
Fixes computer action spotting when labels are messy.