Score: 1

SemEval-2025 Task 5: LLMs4Subjects -- LLM-based Automated Subject Tagging for a National Technical Library's Open-Access Catalog

Published: April 9, 2025 | arXiv ID: 2504.07199v3

By: Jennifer D'Souza , Sameer Sadruddin , Holger Israel and more

Potential Business Impact:

Helps libraries sort science papers automatically.

Business Areas:
Semantic Web Internet Services

We present SemEval-2025 Task 5: LLMs4Subjects, a shared task on automated subject tagging for scientific and technical records in English and German using the GND taxonomy. Participants developed LLM-based systems to recommend top-k subjects, evaluated through quantitative metrics (precision, recall, F1-score) and qualitative assessments by subject specialists. Results highlight the effectiveness of LLM ensembles, synthetic data generation, and multilingual processing, offering insights into applying LLMs for digital library classification.

Repos / Data Links

Page Count
14 pages

Category
Computer Science:
Computation and Language