RacketVision: A Multiple Racket Sports Benchmark for Unified Ball and Racket Analysis
By: Linfeng Dong , Yuchen Yang , Hao Wu and more
Potential Business Impact:
Helps computers understand racket sports better.
We introduce RacketVision, a novel dataset and benchmark for advancing computer vision in sports analytics, covering table tennis, tennis, and badminton. The dataset is the first to provide large-scale, fine-grained annotations for racket pose alongside traditional ball positions, enabling research into complex human-object interactions. It is designed to tackle three interconnected tasks: fine-grained ball tracking, articulated racket pose estimation, and predictive ball trajectory forecasting. Our evaluation of established baselines reveals a critical insight for multi-modal fusion: while naively concatenating racket pose features degrades performance, a CrossAttention mechanism is essential to unlock their value, leading to trajectory prediction results that surpass strong unimodal baselines. RacketVision provides a versatile resource and a strong starting point for future research in dynamic object tracking, conditional motion forecasting, and multimodal analysis in sports. Project page at https://github.com/OrcustD/RacketVision
Similar Papers
RacketVision: A Multiple Racket Sports Benchmark for Unified Ball and Racket Analysis
CV and Pattern Recognition
Helps computers understand racket sports better.
Bridging the Gap: Doubles Badminton Analysis with Singles-Trained Models
CV and Pattern Recognition
Helps computers understand fast badminton games.
Automated Tennis Player and Ball Tracking with Court Keypoints Detection (Hawk Eye System)
CV and Pattern Recognition
Analyzes tennis games automatically for better coaching.