Auto-regressive transformation for image alignment
By: Kanggeon Lee, Soochahn Lee, Kyoung Mu Lee
Potential Business Impact:
Makes pictures match even when they're tricky.
Existing methods for image alignment struggle in cases involving feature-sparse regions, extreme scale and field-of-view differences, and large deformations, often resulting in suboptimal accuracy. Robustness to these challenges improves through iterative refinement of the transformation field while focusing on critical regions in multi-scale image representations. We thus propose Auto-Regressive Transformation (ART), a novel method that iteratively estimates the coarse-to-fine transformations within an auto-regressive framework. Leveraging hierarchical multi-scale features, our network refines the transformations using randomly sampled points at each scale. By incorporating guidance from the cross-attention layer, the model focuses on critical regions, ensuring accurate alignment even in challenging, feature-limited conditions. Extensive experiments across diverse datasets demonstrate that ART significantly outperforms state-of-the-art methods, establishing it as a powerful new method for precise image alignment with broad applicability.
Similar Papers
A Training-Free Style-aligned Image Generation with Scale-wise Autoregressive Model
CV and Pattern Recognition
Makes AI pictures match the style you want.
Personalized Text-to-Image Generation with Auto-Regressive Models
CV and Pattern Recognition
Makes AI draw pictures of *your* stuff.
Progress by Pieces: Test-Time Scaling for Autoregressive Image Generation
CV and Pattern Recognition
Makes AI pictures better and faster.