Holistic Evaluations of Topic Models
By: Thomas Compton
Potential Business Impact:
Helps understand big groups of words better.
Topic models are gaining increasing commercial and academic interest for their ability to summarize large volumes of unstructured text. As unsupervised machine learning methods, they enable researchers to explore data and help general users understand key themes in large text collections. However, they risk becoming a 'black box', where users input data and accept the output as an accurate summary without scrutiny. This article evaluates topic models from a database perspective, drawing insights from 1140 BERTopic model runs. The goal is to identify trade-offs in optimizing model parameters and to reflect on what these findings mean for the interpretation and responsible use of topic models
Similar Papers
Experimental Evaluation of Dynamic Topic Modeling Algorithms
Information Retrieval
Tracks how online topics change over time.
Bridging the Evaluation Gap: Leveraging Large Language Models for Topic Model Evaluation
Computation and Language
Helps find science papers by understanding changing topics.
Toward Purpose-oriented Topic Model Evaluation enabled by Large Language Models
Computation and Language
Helps computers understand changing information better.