Leveraging Large Language Models for Automated Definition Extraction with TaxoMatic A Case Study on Media Bias
By: Timo Spinde , Luyang Lin , Smi Hinterreiter and more
Potential Business Impact:
Finds and explains key ideas in science papers.
This paper introduces TaxoMatic, a framework that leverages large language models to automate definition extraction from academic literature. Focusing on the media bias domain, the framework encompasses data collection, LLM-based relevance classification, and extraction of conceptual definitions. Evaluated on a dataset of 2,398 manually rated articles, the study demonstrates the frameworks effectiveness, with Claude-3-sonnet achieving the best results in both relevance classification and definition extraction. Future directions include expanding datasets and applying TaxoMatic to additional domains.
Similar Papers
LLMTaxo: Leveraging Large Language Models for Constructing Taxonomy of Factual Claims from Social Media
Computation and Language
Organizes social media facts into easy-to-understand topics.
Transforming Expert Knowledge into Scalable Ontology via Large Language Models
Artificial Intelligence
Helps computers understand and connect different ideas.
Unraveling Media Perspectives: A Comprehensive Methodology Combining Large Language Models, Topic Modeling, Sentiment Analysis, and Ontology Learning to Analyse Media Bias
Artificial Intelligence
Finds unfairness in news stories.