Automated Archival Descriptions with Federated Intelligence of LLMs
By: Jinghua Groppe , Andreas Marquet , Annabel Walz and more
Potential Business Impact:
AI writes labels for old documents automatically.
Enforcing archival standards requires specialized expertise, and manually creating metadata descriptions for archival materials is a tedious and error-prone task. This work aims at exploring the potential of agentic AI and large language models (LLMs) in addressing the challenges of implementing a standardized archival description process. To this end, we introduce an agentic AI-driven system for automated generation of high-quality metadata descriptions of archival materials. We develop a federated optimization approach that unites the intelligence of multiple LLMs to construct optimal archival metadata. We also suggest methods to overcome the challenges associated with using LLMs for consistent metadata generation. To evaluate the feasibility and effectiveness of our techniques, we conducted extensive experiments using a real-world dataset of archival materials, which covers a variety of document types and data formats. The evaluation results demonstrate the feasibility of our techniques and highlight the superior performance of the federated optimization approach compared to single-model solutions in metadata quality and reliability.
Similar Papers
Leveraging Retrieval Augmented Generative LLMs For Automated Metadata Description Generation to Enhance Data Catalogs
Information Retrieval
Helps find data faster by writing better descriptions.
Leveraging Machine Learning and Large Language Models for Automated Image Clustering and Description in Legal Discovery
Information Retrieval
Organizes huge picture piles for faster searching.
LAFA: Agentic LLM-Driven Federated Analytics over Decentralized Data Sources
Artificial Intelligence
Lets computers analyze private data using plain English.