Efficient Label Refinement for Face Parsing Under Extreme Poses Using 3D Gaussian Splatting
By: Ankit Gahlawat, Anirban Mukherjee, Dinesh Babu Jayagopi
Potential Business Impact:
Makes computers understand faces from any angle.
Accurate face parsing under extreme viewing angles remains a significant challenge due to limited labeled data in such poses. Manual annotation is costly and often impractical at scale. We propose a novel label refinement pipeline that leverages 3D Gaussian Splatting (3DGS) to generate accurate segmentation masks from noisy multiview predictions. By jointly fitting two 3DGS models, one to RGB images and one to their initial segmentation maps, our method enforces multiview consistency through shared geometry, enabling the synthesis of pose-diverse training data with only minimal post-processing. Fine-tuning a face parsing model on this refined dataset significantly improves accuracy on challenging head poses, while maintaining strong performance on standard views. Extensive experiments, including human evaluations, demonstrate that our approach achieves superior results compared to state-of-the-art methods, despite requiring no ground-truth 3D annotations and using only a small set of initial images. Our method offers a scalable and effective solution for improving face parsing robustness in real-world settings.
Similar Papers
Camera Pose Refinement via 3D Gaussian Splatting
CV and Pattern Recognition
Makes 3D pictures more accurate without retraining.
Unlocking Zero-shot Potential of Semi-dense Image Matching via Gaussian Splatting
CV and Pattern Recognition
Makes computer vision better at matching pictures.
Unposed 3DGS Reconstruction with Probabilistic Procrustes Mapping
CV and Pattern Recognition
Creates detailed 3D worlds from many photos.