Multilingual Sentiment Analysis of Summarized Texts: A Cross-Language Study of Text Shortening Effects
By: Mikhail Krasitskii , Grigori Sidorov , Olga Kolesnikova and more
Potential Business Impact:
Summaries keep opinions clear, especially for tricky languages.
Summarization significantly impacts sentiment analysis across languages with diverse morphologies. This study examines extractive and abstractive summarization effects on sentiment classification in English, German, French, Spanish, Italian, Finnish, Hungarian, and Arabic. We assess sentiment shifts post-summarization using multilingual transformers (mBERT, XLM-RoBERTa, T5, and BART) and language-specific models (FinBERT, AraBERT). Results show extractive summarization better preserves sentiment, especially in morphologically complex languages, while abstractive summarization improves readability but introduces sentiment distortion, affecting sentiment accuracy. Languages with rich inflectional morphology, such as Finnish, Hungarian, and Arabic, experience greater accuracy drops than English or German. Findings emphasize the need for language-specific adaptations in sentiment analysis and propose a hybrid summarization approach balancing readability and sentiment preservation. These insights benefit multilingual sentiment applications, including social media monitoring, market analysis, and cross-lingual opinion mining.
Similar Papers
Hybrid Extractive Abstractive Summarization for Multilingual Sentiment Analysis
Computation and Language
Understands feelings in many languages faster.
Comparative Approaches to Sentiment Analysis Using Datasets in Major European and Arabic Languages
Computation and Language
Helps computers understand feelings in many languages.
Sentiment-Aware Extractive and Abstractive Summarization for Unstructured Text Mining
Computation and Language
Summaries understand feelings in online posts.