Score: 1

MixRI: Mixing Features of Reference Images for Novel Object Pose Estimation

Published: January 11, 2026 | arXiv ID: 2601.06883v1

By: Xinhang Liu , Jiawei Shi , Zheng Dang and more

Potential Business Impact:

Lets robots see and grab new things instantly.

Business Areas:

Image Recognition Data and Analytics, Software

We present MixRI, a lightweight network that solves the CAD-based novel object pose estimation problem in RGB images. It can be instantly applied to a novel object at test time without finetuning. We design our network to meet the demands of real-world applications, emphasizing reduced memory requirements and fast inference time. Unlike existing works that utilize many reference images and have large network parameters, we directly match points based on the multi-view information between the query and reference images with a lightweight network. Thanks to our reference image fusion strategy, we significantly decrease the number of reference images, thus decreasing the time needed to process these images and the memory required to store them. Furthermore, with our lightweight network, our method requires less inference time. Though with fewer reference images, experiments on seven core datasets in the BOP challenge show that our method achieves comparable results with other methods that require more reference images and larger network parameters.

AlignPose: Generalizable 6D Pose Estimation via Multi-view Feature-metric Alignment

CV and Pattern Recognition

Helps robots see objects from many angles.

23 Dec 2025 0

86%

Enhancing Rotation-Invariant 3D Learning with Global Pose Awareness and Attention Mechanisms

CV and Pattern Recognition

Helps computers tell apart similar 3D shapes.

11 Nov 2025 2

86%

Unified Category-Level Object Detection and Pose Estimation from RGB Images using 3D Prototypes

CV and Pattern Recognition

Lets computers see objects in 3D from photos.

4 Aug 2025 2

View PDF Login to Bookmark

Country of Origin

🇨🇳 🇨🇭 China, Switzerland

Page Count

18 pages

MixRI: Mixing Features of Reference Images for Novel Object Pose Estimation

Lets robots see and grab new things instantly.

Technical Abstract

AlignPose: Generalizable 6D Pose Estimation via Multi-view Feature-metric Alignment

Enhancing Rotation-Invariant 3D Learning with Global Pose Awareness and Attention Mechanisms

Unified Category-Level Object Detection and Pose Estimation from RGB Images using 3D Prototypes