amc: The Automated Mission Classifier for Telescope Bibliographies
By: John F. Wu , Joshua E. G. Peek , Sophie J. Miller and more
Telescope bibliographies record the pulse of astronomy research by capturing publication statistics and citation metrics for telescope facilities. Robust and scalable bibliographies ensure that we can measure the scientific impact of our facilities and archives. However, the growing rate of publications threatens to outpace our ability to manually label astronomical literature. We therefore present the Automated Mission Classifier (amc), a tool that uses large language models (LLMs) to identify and categorize telescope references by processing large quantities of paper text. A modified version of amc performs well on the TRACS Kaggle challenge, achieving a macro $F_1$ score of 0.84 on the held-out test set. amc is valuable for other telescopes beyond TRACS; we developed the initial software for identifying papers that featured scientific results by NASA missions. Additionally, we investigate how amc can also be used to interrogate historical datasets and surface potential label errors. Our work demonstrates that LLM-based applications offer powerful and scalable assistance for library sciences.
Similar Papers
Multi-Agent Taskforce Collaboration: Self-Correction of Compounding Errors in Long-Form Literature Review Generation
Computational Engineering, Finance, and Science
Helps computers write accurate science reports.
Zero-shot data citation function classification using transformer-based large language models (LLMs)
Machine Learning (CS)
Helps understand how science papers use data.
Textual interpretation of transient image classifications from large language models
Instrumentation and Methods for Astrophysics
Helps find real space explosions in telescope pictures.