FIELDS: Face reconstruction with accurate Inference of Expression using Learning with Direct Supervision
By: Chen Ling, Henglin Shi, Hedvig Kjellström
Potential Business Impact:
Makes computer faces show real feelings.
Facial expressions convey the bulk of emotional information in human communication, yet existing 3D face reconstruction methods often miss subtle affective details due to reliance on 2D supervision and lack of 3D ground truth. We propose FIELDS (Face reconstruction with accurate Inference of Expression using Learning with Direct Supervision) to address these limitations by extending self-supervised 2D image consistency cues with direct 3D expression parameter supervision and an auxiliary emotion recognition branch. Our encoder is guided by authentic expression parameters from spontaneous 4D facial scans, while an intensity-aware emotion loss encourages the 3D expression parameters to capture genuine emotion content without exaggeration. This dual-supervision strategy bridges the 2D/3D domain gap and mitigates expression-intensity bias, yielding high-fidelity 3D reconstructions that preserve subtle emotional cues. From a single image, FIELDS produces emotion-rich face models with highly realistic expressions, significantly improving in-the-wild facial expression recognition performance without sacrificing naturalness.
Similar Papers
Deep Learning-Based Real-Time Sequential Facial Expression Analysis Using Geometric Features
CV and Pattern Recognition
Lets computers understand your feelings from your face.
CS3D: An Efficient Facial Expression Recognition via Event Vision
CV and Pattern Recognition
Helps robots understand your face better, using less power.
Instant Expressive Gaussian Head Avatar via 3D-Aware Expression Distillation
CV and Pattern Recognition
Makes talking faces in 3D, fast and real.