Training for Identity, Inference for Controllability: A Unified Approach to Tuning-Free Face Personalization
By: Lianyu Pang , Ji Zhou , Qiping Wang and more
Potential Business Impact:
Makes AI create faces that look like real people.
Tuning-free face personalization methods have developed along two distinct paradigms: text embedding approaches that map facial features into the text embedding space, and adapter-based methods that inject features through auxiliary cross-attention layers. While both paradigms have shown promise, existing methods struggle to simultaneously achieve high identity fidelity and flexible text controllability. We introduce UniID, a unified tuning-free framework that synergistically integrates both paradigms. Our key insight is that when merging these approaches, they should mutually reinforce only identity-relevant information while preserving the original diffusion prior for non-identity attributes. We realize this through a principled training-inference strategy: during training, we employ an identity-focused learning scheme that guides both branches to capture identity features exclusively; at inference, we introduce a normalized rescaling mechanism that recovers the text controllability of the base diffusion model while enabling complementary identity signals to enhance each other. This principled design enables UniID to achieve high-fidelity face personalization with flexible text controllability. Extensive experiments against six state-of-the-art methods demonstrate that UniID achieves superior performance in both identity preservation and text controllability. Code will be available at https://github.com/lyuPang/UniID
Similar Papers
A Training-Free Approach for Multi-ID Customization via Attention Adjustment and Spatial Control
CV and Pattern Recognition
Creates realistic pictures of many people together.
Beyond Inference Intervention: Identity-Decoupled Diffusion for Face Anonymization
CV and Pattern Recognition
Makes faces look different but still real.
PSTF-AttControl: Per-Subject-Tuning-Free Personalized Image Generation with Controllable Face Attributes
CV and Pattern Recognition
Changes faces in pictures precisely, easily.