Score: 0

Based on Data Balancing and Model Improvement for Multi-Label Sentiment Classification Performance Enhancement

Published: November 18, 2025 | arXiv ID: 2511.14073v2

By: Zijin Su , Huanzhu Lyu , Yuren Niu and more

Potential Business Impact:

Helps computers understand many feelings in one message.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Multi-label sentiment classification plays a vital role in natural language processing by detecting multiple emotions within a single text. However, existing datasets like GoEmotions often suffer from severe class imbalance, which hampers model performance, especially for underrepresented emotions. To address this, we constructed a balanced multi-label sentiment dataset by integrating the original GoEmotions data, emotion-labeled samples from Sentiment140 using a RoBERTa-base-GoEmotions model, and manually annotated texts generated by GPT-4 mini. Our data balancing strategy ensured an even distribution across 28 emotion categories. Based on this dataset, we developed an enhanced multi-label classification model that combines pre-trained FastText embeddings, convolutional layers for local feature extraction, bidirectional LSTM for contextual learning, and an attention mechanism to highlight sentiment-relevant words. A sigmoid-activated output layer enables multi-label prediction, and mixed precision training improves computational efficiency. Experimental results demonstrate significant improvements in accuracy, precision, recall, F1-score, and AUC compared to models trained on imbalanced data, highlighting the effectiveness of our approach.

Based on Data Balancing and Model Improvement for Multi-Label Sentiment Classification Performance Enhancement

Computation and Language

Helps computers understand many feelings in one message.

18 Nov 2025 0

90%

Optimizing Small Transformer-Based Language Models for Multi-Label Sentiment Analysis in Short Texts

Computation and Language

Makes computers understand feelings in short messages.

5 Sep 2025 1

89%

Multimodal Emotion Recognition and Sentiment Analysis in Multi-Party Conversation Contexts

CV and Pattern Recognition

Lets computers understand feelings from talking, faces, and videos.

9 Mar 2025 0

View PDF Login to Bookmark

Page Count

9 pages

Based on Data Balancing and Model Improvement for Multi-Label Sentiment Classification Performance Enhancement

Helps computers understand many feelings in one message.

Technical Abstract

Based on Data Balancing and Model Improvement for Multi-Label Sentiment Classification Performance Enhancement

Optimizing Small Transformer-Based Language Models for Multi-Label Sentiment Analysis in Short Texts

Multimodal Emotion Recognition and Sentiment Analysis in Multi-Party Conversation Contexts