Symmetrical Flow Matching: Unified Image Generation, Segmentation, and Classification with Score-Based Generative Models
By: Francisco Caetano , Christiaan Viviers , Peter H. N. De With and more
Potential Business Impact:
Makes one AI do many picture tasks at once.
Flow Matching has emerged as a powerful framework for learning continuous transformations between distributions, enabling high-fidelity generative modeling. This work introduces Symmetrical Flow Matching (SymmFlow), a new formulation that unifies semantic segmentation, classification, and image generation within a single model. Using a symmetric learning objective, SymmFlow models forward and reverse transformations jointly, ensuring bi-directional consistency, while preserving sufficient entropy for generative diversity. A new training objective is introduced to explicitly retain semantic information across flows, featuring efficient sampling while preserving semantic structure, allowing for one-step segmentation and classification without iterative refinement. Unlike previous approaches that impose strict one-to-one mapping between masks and images, SymmFlow generalizes to flexible conditioning, supporting both pixel-level and image-level class labels. Experimental results on various benchmarks demonstrate that SymmFlow achieves state-of-the-art performance on semantic image synthesis, obtaining FID scores of 11.9 on CelebAMask-HQ and 7.0 on COCO-Stuff with only 25 inference steps. Additionally, it delivers competitive results on semantic segmentation and shows promising capabilities in classification tasks. The code will be publicly available.
Similar Papers
MedSymmFlow: Bridging Generative Modeling and Classification in Medical Imaging through Symmetrical Flow Matching
CV and Pattern Recognition
Helps doctors see how sure they are about medical pictures.
Flow Matching for Medical Image Synthesis: Bridging the Gap Between Speed and Quality
CV and Pattern Recognition
Makes AI create better medical pictures faster.
Iterative Flow Matching -- Path Correction and Gradual Refinement for Enhanced Generative Modeling
Machine Learning (CS)
Makes computer pictures look more real.