IAP: Improving Continual Learning of Vision-Language Models via Instance-Aware Prompting
By: Hao Fu , Hanbin Zhao , Jiahua Dong and more
Potential Business Impact:
Helps AI learn new things without forgetting old ones.
Recent pre-trained vision-language models (PT-VLMs) often face a Multi-Domain Task Incremental Learning (MTIL) scenario in practice, where several classes and domains of multi-modal tasks are incrementally arrived. Without access to previously seen tasks and unseen tasks, memory-constrained MTIL suffers from forward and backward forgetting. To alleviate the above challenges, parameter-efficient fine-tuning techniques (PEFT), such as prompt tuning, are employed to adapt the PT-VLM to the diverse incrementally learned tasks. To achieve effective new task adaptation, existing methods only consider the effect of PEFT strategy selection, but neglect the influence of PEFT parameter setting (e.g., prompting). In this paper, we tackle the challenge of optimizing prompt designs for diverse tasks in MTIL and propose an Instance-Aware Prompting (IAP) framework. Specifically, our Instance-Aware Gated Prompting (IA-GP) strategy enhances adaptation to new tasks while mitigating forgetting by adaptively assigning prompts across transformer layers at the instance level. Our Instance-Aware Class-Distribution-Driven Prompting (IA-CDDP) improves the task adaptation process by determining an accurate task-label-related confidence score for each instance. Experimental evaluations across 11 datasets, using three performance metrics, demonstrate the effectiveness of our proposed method. The source codes are available at https://github.com/FerdinandZJU/IAP.
Similar Papers
Prompt-based Adaptation in Large-scale Vision Models: A Survey
CV and Pattern Recognition
Helps computers learn new things with less data.
All You Need is One: Capsule Prompt Tuning with a Single Vector
Computation and Language
Makes AI understand tasks better with less effort.
Towards Generalizable AI-Generated Image Detection via Image-Adaptive Prompt Learning
CV and Pattern Recognition
Finds fake pictures from new AI tools.