Joint Image-Instance Spatial-Temporal Attention for Few-shot Action Recognition
By: Zefeng Qian , Chongyang Zhang , Yifei Huang and more
Potential Business Impact:
Helps computers learn new actions from few examples.
Few-shot Action Recognition (FSAR) constitutes a crucial challenge in computer vision, entailing the recognition of actions from a limited set of examples. Recent approaches mainly focus on employing image-level features to construct temporal dependencies and generate prototypes for each action category. However, a considerable number of these methods utilize mainly image-level features that incorporate background noise and focus insufficiently on real foreground (action-related instances), thereby compromising the recognition capability, particularly in the few-shot scenario. To tackle this issue, we propose a novel joint Image-Instance level Spatial-temporal attention approach (I2ST) for Few-shot Action Recognition. The core concept of I2ST is to perceive the action-related instances and integrate them with image features via spatial-temporal attention. Specifically, I2ST consists of two key components: Action-related Instance Perception and Joint Image-Instance Spatial-temporal Attention. Given the basic representations from the feature extractor, the Action-related Instance Perception is introduced to perceive action-related instances under the guidance of a text-guided segmentation model. Subsequently, the Joint Image-Instance Spatial-temporal Attention is used to construct the feature dependency between instances and images...
Similar Papers
Hierarchical Relation-augmented Representation Generalization for Few-shot Action Recognition
CV and Pattern Recognition
Teaches computers to learn new actions from few examples.
Temporal Alignment-Free Video Matching for Few-shot Action Recognition
CV and Pattern Recognition
Teaches computers to recognize actions from few examples.
Spatial-Temporal Perception with Causal Inference for Naturalistic Driving Action Recognition
CV and Pattern Recognition
Helps cars watch drivers to prevent accidents.