Score: 3

MOL: Joint Estimation of Micro-Expression, Optical Flow, and Landmark via Transformer-Graph-Style Convolution

Published: June 17, 2025 | arXiv ID: 2506.14511v1

By: Zhiwen Shao , Yifan Cheng , Feiran Li and more

Potential Business Impact:

Reads tiny facial movements to understand emotions.

Business Areas:

Motion Capture Media and Entertainment, Video

Facial micro-expression recognition (MER) is a challenging problem, due to transient and subtle micro-expression (ME) actions. Most existing methods depend on hand-crafted features, key frames like onset, apex, and offset frames, or deep networks limited by small-scale and low-diversity datasets. In this paper, we propose an end-to-end micro-action-aware deep learning framework with advantages from transformer, graph convolution, and vanilla convolution. In particular, we propose a novel F5C block composed of fully-connected convolution and channel correspondence convolution to directly extract local-global features from a sequence of raw frames, without the prior knowledge of key frames. The transformer-style fully-connected convolution is proposed to extract local features while maintaining global receptive fields, and the graph-style channel correspondence convolution is introduced to model the correlations among feature patterns. Moreover, MER, optical flow estimation, and facial landmark detection are jointly trained by sharing the local-global features. The two latter tasks contribute to capturing facial subtle action information for MER, which can alleviate the impact of insufficient training data. Extensive experiments demonstrate that our framework (i) outperforms the state-of-the-art MER methods on CASME II, SAMM, and SMIC benchmarks, (ii) works well for optical flow estimation and facial landmark detection, and (iii) can capture facial subtle muscle actions in local regions associated with MEs. The code is available at https://github.com/CYF-cuber/MOL.

Micro-Expression Recognition via Fine-Grained Dynamic Perception

CV and Pattern Recognition

Helps computers understand tiny, fast facial changes.

7 Sep 2025 3

89%

MELLM: Exploring LLM-Powered Micro-Expression Understanding Enhanced by Subtle Motion Perception

CV and Pattern Recognition

Helps computers spot tiny, hidden facial emotions.

11 May 2025 1

89%

Rethinking Key-frame-based Micro-expression Recognition: A Robust and Accurate Framework Against Key-frame Errors

CV and Pattern Recognition

Helps computers read emotions from tiny face changes.

8 Aug 2025 2

View PDF Login to Bookmark

Country of Origin

🇨🇳 🇦🇺 China, Australia

Repos / Data Links

github.com

Page Count

14 pages

MOL: Joint Estimation of Micro-Expression, Optical Flow, and Landmark via Transformer-Graph-Style Convolution

Reads tiny facial movements to understand emotions.

Technical Abstract

Micro-Expression Recognition via Fine-Grained Dynamic Perception

MELLM: Exploring LLM-Powered Micro-Expression Understanding Enhanced by Subtle Motion Perception

Rethinking Key-frame-based Micro-expression Recognition: A Robust and Accurate Framework Against Key-frame Errors