Score: 0

Seeing Further on the Shoulders of Giants: Knowledge Inheritance for Vision Foundation Models

Published: August 20, 2025 | arXiv ID: 2508.14707v1

By: Jiabo Huang, Chen Chen, Lingjuan Lyu

Potential Business Impact:

Combines old AI to make new, smarter AI.

Business Areas:

Image Recognition Data and Analytics, Software

Vision foundation models (VFMs) are predominantly developed using data-centric methods. These methods require training on vast amounts of data usually with high-quality labels, which poses a bottleneck for most institutions that lack both large-scale data and high-end GPUs. On the other hand, many open-source vision models have been pretrained on domain-specific data, enabling them to distill and represent core knowledge in a form that is transferable across diverse applications. Even though these models are highly valuable assets, they remain largely under-explored in empowering the development of a general-purpose VFM. In this paper, we presents a new model-driven approach for training VFMs through joint knowledge transfer and preservation. Our method unifies multiple pre-trained teacher models in a shared latent space to mitigate the ``imbalanced transfer'' issue caused by their distributional gaps. Besides, we introduce a knowledge preservation strategy to take a general-purpose teacher as a knowledge base for integrating knowledge from the remaining purpose-specific teachers using an adapter module. By unifying and aggregating existing models, we build a powerful VFM to inherit teachers' expertise without needing to train on a large amount of labeled data. Our model not only provides generalizable visual features, but also inherently supports multiple downstream tasks. Extensive experiments demonstrate that our VFM outperforms existing data-centric models across four fundamental vision tasks, including image classification, object detection, semantic and instance segmentation.

Seeing Further on the Shoulders of Giants: Knowledge Inheritance for Vision Foundation Models

CV and Pattern Recognition

Combines old models to make new smart vision.

20 Aug 2025 0

90%

An Investigation of Visual Foundation Models Robustness

CV and Pattern Recognition

Makes computer vision work better in bad conditions.

22 Aug 2025 1

90%

Task-Specific Knowledge Distillation from the Vision Foundation Model for Enhanced Medical Image Segmentation

CV and Pattern Recognition

Teaches computers to see diseases in X-rays.

10 Mar 2025 0

View PDF Login to Bookmark

Page Count

9 pages

Seeing Further on the Shoulders of Giants: Knowledge Inheritance for Vision Foundation Models

Combines old AI to make new, smarter AI.

Technical Abstract

Seeing Further on the Shoulders of Giants: Knowledge Inheritance for Vision Foundation Models

An Investigation of Visual Foundation Models Robustness

Task-Specific Knowledge Distillation from the Vision Foundation Model for Enhanced Medical Image Segmentation