Context-aware Multimodal AI Reveals Hidden Pathways in Five Centuries of Art Evolution
By: Jin Kim , Byunghwee Lee , Taekho You and more
Potential Business Impact:
AI finds art's meaning from pictures and history.
The rise of multimodal generative AI is transforming the intersection of technology and art, offering deeper insights into large-scale artwork. Although its creative capabilities have been widely explored, its potential to represent artwork in latent spaces remains underexamined. We use cutting-edge generative AI, specifically Stable Diffusion, to analyze 500 years of Western paintings by extracting two types of latent information with the model: formal aspects (e.g., colors) and contextual aspects (e.g., subject). Our findings reveal that contextual information differentiates between artistic periods, styles, and individual artists more successfully than formal elements. Additionally, using contextual keywords extracted from paintings, we show how artistic expression evolves alongside societal changes. Our generative experiment, infusing prospective contexts into historical artworks, successfully reproduces the evolutionary trajectory of artworks, highlighting the significance of mutual interaction between society and art. This study demonstrates how multimodal AI expands traditional formal analysis by integrating temporal, cultural, and historical contexts.
Similar Papers
Unraveling Hidden Representations: A Multi-Modal Layer Analysis for Better Synthetic Content Forensics
Artificial Intelligence
Spots fake pictures and sounds fast.
Speaking images. A novel framework for the automated self-description of artworks
CV and Pattern Recognition
Makes old art explain itself in videos.
Ways of Seeing, and Selling, AI Art
Computers and Society
Helps AI art sell by making it look like real art.