Score: 0

Understanding Cross-Model Perceptual Invariances Through Ensemble Metamers

Published: April 2, 2025 | arXiv ID: 2504.01739v2

By: Lukas Boehm , Jonas Leo Mueller , Christoffer Loeffler and more

Potential Business Impact:

Makes AI see the world like humans.

Business Areas:
Visual Search Internet Services

Understanding the perceptual invariances of artificial neural networks is essential for improving explainability and aligning models with human vision. Metamers - stimuli that are physically distinct yet produce identical neural activations - serve as a valuable tool for investigating these invariances. We introduce a novel approach to metamer generation by leveraging ensembles of artificial neural networks, capturing shared representational subspaces across diverse architectures, including convolutional neural networks and vision transformers. To characterize the properties of the generated metamers, we employ a suite of image-based metrics that assess factors such as semantic fidelity and naturalness. Our findings show that convolutional neural networks generate more recognizable and human-like metamers, while vision transformers produce realistic but less transferable metamers, highlighting the impact of architectural biases on representational invariances.

Country of Origin
🇩🇪 Germany

Page Count
23 pages

Category
Computer Science:
CV and Pattern Recognition