Score: 0

LLM-as-classifier: Semi-Supervised, Iterative Framework for Hierarchical Text Classification using Large Language Models

Published: August 22, 2025 | arXiv ID: 2508.16478v1

By: Doohee You , Andy Parisi , Zach Vander Velden and more

Potential Business Impact:

Makes smart computer programs sort text better.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

The advent of Large Language Models (LLMs) has provided unprecedented capabilities for analyzing unstructured text data. However, deploying these models as reliable, robust, and scalable classifiers in production environments presents significant methodological challenges. Standard fine-tuning approaches can be resource-intensive and often struggle with the dynamic nature of real-world data distributions, which is common in the industry. In this paper, we propose a comprehensive, semi-supervised framework that leverages the zero- and few-shot capabilities of LLMs for building hierarchical text classifiers as a framework for a solution to these industry-wide challenges. Our methodology emphasizes an iterative, human-in-the-loop process that begins with domain knowledge elicitation and progresses through prompt refinement, hierarchical expansion, and multi-faceted validation. We introduce techniques for assessing and mitigating sequence-based biases and outline a protocol for continuous monitoring and adaptation. This framework is designed to bridge the gap between the raw power of LLMs and the practical need for accurate, interpretable, and maintainable classification systems in industry applications.

Small sample-based adaptive text classification through iterative and contrastive description refinement

Machine Learning (CS)

Teaches computers to sort text without new training.

1 Aug 2025 0

90%

LLM-MemCluster: Empowering Large Language Models with Dynamic Memory for Text Clustering

Computation and Language

Lets computers group words by meaning better.

19 Nov 2025 0

89%

LLM driven Text-to-Table Generation through Sub-Tasks Guidance and Iterative Refinement

Computation and Language

Helps computers turn messy notes into organized charts.

12 Aug 2025 1

View PDF Login to Bookmark

Page Count

22 pages

LLM-as-classifier: Semi-Supervised, Iterative Framework for Hierarchical Text Classification using Large Language Models

Makes smart computer programs sort text better.

Technical Abstract

Small sample-based adaptive text classification through iterative and contrastive description refinement

LLM-MemCluster: Empowering Large Language Models with Dynamic Memory for Text Clustering

LLM driven Text-to-Table Generation through Sub-Tasks Guidance and Iterative Refinement