Score: 1

Leveraging Large Language Models for Automated Definition Extraction with TaxoMatic A Case Study on Media Bias

Published: April 1, 2025 | arXiv ID: 2504.00343v1

By: Timo Spinde , Luyang Lin , Smi Hinterreiter and more

Potential Business Impact:

Finds and explains key ideas in science papers.

Business Areas:
Text Analytics Data and Analytics, Software

This paper introduces TaxoMatic, a framework that leverages large language models to automate definition extraction from academic literature. Focusing on the media bias domain, the framework encompasses data collection, LLM-based relevance classification, and extraction of conceptual definitions. Evaluated on a dataset of 2,398 manually rated articles, the study demonstrates the frameworks effectiveness, with Claude-3-sonnet achieving the best results in both relevance classification and definition extraction. Future directions include expanding datasets and applying TaxoMatic to additional domains.

Country of Origin
🇩🇪 Germany

Repos / Data Links

Page Count
8 pages

Category
Computer Science:
Computation and Language