Score: 1

Enhancing Fitness Movement Recognition with Attention Mechanism and Pre-Trained Feature Extractors

Published: September 2, 2025 | arXiv ID: 2509.02511v1

By: Shanjid Hasan Nishat, Srabonti Deb, Mohiuddin Ahmed

Potential Business Impact:

Lets phones recognize exercises from videos.

Business Areas:
Image Recognition Data and Analytics, Software

Fitness movement recognition, a focused subdomain of human activity recognition (HAR), plays a vital role in health monitoring, rehabilitation, and personalized fitness training by enabling automated exercise classification from video data. However, many existing deep learning approaches rely on computationally intensive 3D models, limiting their feasibility in real-time or resource-constrained settings. In this paper, we present a lightweight and effective framework that integrates pre-trained 2D Convolutional Neural Networks (CNNs) such as ResNet50, EfficientNet, and Vision Transformers (ViT) with a Long Short-Term Memory (LSTM) network enhanced by spatial attention. These models efficiently extract spatial features while the LSTM captures temporal dependencies, and the attention mechanism emphasizes informative segments. We evaluate the framework on a curated subset of the UCF101 dataset, achieving a peak accuracy of 93.34\% with the ResNet50-based configuration. Comparative results demonstrate the superiority of our approach over several state-of-the-art HAR systems. The proposed method offers a scalable and real-time-capable solution for fitness activity recognition with broader applications in vision-based health and activity monitoring.

Page Count
6 pages

Category
Computer Science:
CV and Pattern Recognition