Score: 0

Generalizing WiFi Gesture Recognition via Large-Model-Aware Semantic Distillation and Alignment

Published: October 15, 2025 | arXiv ID: 2510.13390v1

By: Feng-Qi Cui , Yu-Tong Guo , Tianyue Zheng and more

Potential Business Impact:

Lets WiFi understand your hand movements.

Business Areas:

Image Recognition Data and Analytics, Software

WiFi-based gesture recognition has emerged as a promising RF sensing paradigm for enabling non-contact and privacy-preserving human-computer interaction in AIoT environments. However, existing methods often suffer from limited generalization and semantic expressiveness due to the domain-sensitive nature of Channel State Information and the lack of high-level gesture abstraction. To address these challenges, we propose a novel generalization framework, termed Large-Model-Aware Semantic Distillation and Alignment (GLSDA), which leverages the semantic prior of pre-trained large foundation models to enhance gesture representation learning in both in-domain and cross-domain scenarios. Specifically, we first design a dual-path CSI encoding pipeline that captures geometric and dynamic gesture patterns via CSI-Ratio phase sequences and Doppler spectrograms. These representations are then fed into a Multiscale Semantic Encoder, which learns robust temporal embeddings and aligns them with gesture semantics through cross-modal attention mechanisms. To further enhance category discrimination, we introduce a Semantic-Aware Soft Supervision scheme that encodes inter-class correlations and reduces label ambiguity, especially for semantically similar gestures. Finally, we develop a Robust Dual-Distillation strategy to compress the aligned model into a lightweight student network, jointly distilling intermediate features and semantic-informed soft labels from the teacher model. Extensive experiments on the Widar3.0 benchmark show that GLSDA consistently outperforms state-of-the-art methods in both in-domain and cross-domain gesture recognition tasks, while significantly reducing model size and inference latency. Our method offers a scalable and deployable solution for generalized RF-based gesture interfaces in real-world AIoT applications.

WiFi-based Cross-Domain Gesture Recognition Using Attention Mechanism

CV and Pattern Recognition

Wi-Fi signals recognize gestures in any room.

4 Dec 2025 0

88%

SASG-DA: Sparse-Aware Semantic-Guided Diffusion Augmentation For Myoelectric Gesture Recognition

CV and Pattern Recognition

Improves robot arm control with better muscle signal reading.

11 Nov 2025 0

87%

Scale What Counts, Mask What Matters: Evaluating Foundation Models for Zero-Shot Cross-Domain Wi-Fi Sensing

CV and Pattern Recognition

Makes Wi-Fi track people and actions anywhere.

24 Nov 2025 1

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Page Count

8 pages

Generalizing WiFi Gesture Recognition via Large-Model-Aware Semantic Distillation and Alignment

Lets WiFi understand your hand movements.

Technical Abstract

WiFi-based Cross-Domain Gesture Recognition Using Attention Mechanism

SASG-DA: Sparse-Aware Semantic-Guided Diffusion Augmentation For Myoelectric Gesture Recognition

Scale What Counts, Mask What Matters: Evaluating Foundation Models for Zero-Shot Cross-Domain Wi-Fi Sensing