Retrieval-Augmented Generation for Natural Language Art Provenance Searches in the Getty Provenance Index
By: Mathew Henrickson
Potential Business Impact:
Helps find art history by asking questions.
This research presents a Retrieval-Augmented Generation (RAG) framework for art provenance studies, focusing on the Getty Provenance Index. Provenance research establishes the ownership history of artworks, which is essential for verifying authenticity, supporting restitution and legal claims, and understanding the cultural and historical context of art objects. The process is complicated by fragmented, multilingual archival data that hinders efficient retrieval. Current search portals require precise metadata, limiting exploratory searches. Our method enables natural-language and multilingual searches through semantic retrieval and contextual summarization, reducing dependence on metadata structures. We assess RAG's capability to retrieve and summarize auction records using a 10,000-record sample from the Getty Provenance Index - German Sales. The results show this approach provides a scalable solution for navigating art market archives, offering a practical tool for historians and cultural heritage professionals conducting historically sensitive research.
Similar Papers
Provenance Analysis of Archaeological Artifacts via Multimodal RAG Systems
Information Retrieval
Helps archaeologists identify ancient objects faster.
A Systematic Literature Review of Retrieval-Augmented Generation: Techniques, Metrics, and Challenges
Digital Libraries
Makes AI smarter by giving it more facts.
A Systematic Literature Review of Retrieval-Augmented Generation: Techniques, Metrics, and Challenges
Digital Libraries
Helps AI answer questions with new facts.