Assessing Color Vision Test in Large Vision-language Models
By: Hongfei Ye , Bin Chen , Wenxi Liu and more
Potential Business Impact:
Teaches computers to see and understand colors.
With the widespread adoption of large vision-language models, the capacity for color vision in these models is crucial. However, the color vision abilities of large visual-language models have not yet been thoroughly explored. To address this gap, we define a color vision testing task for large vision-language models and construct a dataset \footnote{Anonymous Github Showing some of the data https://anonymous.4open.science/r/color-vision-test-dataset-3BCD} that covers multiple categories of test questions and tasks of varying difficulty levels. Furthermore, we analyze the types of errors made by large vision-language models and propose fine-tuning strategies to enhance their performance in color vision tests.
Similar Papers
Diagnosing Vision Language Models' Perception by Leveraging Human Methods for Color Vision Deficiencies
CV and Pattern Recognition
AI can't see colors like people with color blindness.
ColorBlindnessEval: Can Vision-Language Models Pass Color Blindness Tests?
CV and Pattern Recognition
Tests if AI can see numbers like colorblind people.
ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness
CV and Pattern Recognition
Tests if computers understand colors like people.