MMA: A Momentum Mamba Architecture for Human Activity Recognition with Inertial Sensors
By: Thai-Khanh Nguyen , Uyen Vo , Tan M. Nguyen and more
Potential Business Impact:
Helps computers understand body movements better.
Human activity recognition (HAR) from inertial sensors is essential for ubiquitous computing, mobile health, and ambient intelligence. Conventional deep models such as Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), and transformers have advanced HAR but remain limited by vanishing or exloding gradients, high computational cost, and difficulty in capturing long-range dependencies. Structured state-space models (SSMs) like Mamba address these challenges with linear complexity and effective temporal modeling, yet they are restricted to first-order dynamics without stable longterm memory mechanisms. We introduce Momentum Mamba, a momentum-augmented SSM that incorporates second-order dynamics to improve stability of information flow across time steps, robustness, and long-sequence modeling. Two extensions further expand its capacity: Complex Momentum Mamba for frequency-selective memory scaling. Experiments on multiple HAR benchmarks demonstrate consistent gains over vanilla Mamba and Transformer baselines in accuracy, robustness, and convergence speed. With only moderate increases in training cost, momentum-augmented SSMs offer a favorable accuracy-efficiency balance, establishing them as a scalable paradigm for HAR and a promising principal framework for broader sequence modeling applications.
Similar Papers
RadMamba: Efficient Human Activity Recognition through Radar-based Micro-Doppler-Oriented Mamba State-Space Model
CV and Pattern Recognition
Lets radar see your movements without cameras.
eMamba: Efficient Acceleration Framework for Mamba Models in Edge Computing
Machine Learning (CS)
Makes smart devices run AI faster, using less power.
SasMamba: A Lightweight Structure-Aware Stride State Space Model for 3D Human Pose Estimation
CV and Pattern Recognition
Helps computers understand human body movements better.