Score: 0

NepEMO: A Multi-Label Emotion and Sentiment Analysis on Nepali Reddit with Linguistic Insights and Temporal Trends

Published: December 28, 2025 | arXiv ID: 2512.22823v1

By: Sameer Sitoula , Tej Bahadur Shahi , Laxmi Prasad Bhatt and more

Potential Business Impact:

Helps computers understand feelings in Nepali posts.

Business Areas:

Social News Media and Entertainment

Social media (SM) platforms (e.g. Facebook, Twitter, and Reddit) are increasingly leveraged to share opinions and emotions, specifically during challenging events, such as natural disasters, pandemics, and political elections, and joyful occasions like festivals and celebrations. Among the SM platforms, Reddit provides a unique space for its users to anonymously express their experiences and thoughts on sensitive issues such as health and daily life. In this work, we present a novel dataset, called NepEMO, for multi-label emotion (MLE) and sentiment classification (SC) on the Nepali subreddit post. We curate and build a manually annotated dataset of 4,462 posts (January 2019- June 2025) written in English, Romanised Nepali and Devanagari script for five emotions (fear, anger, sadness, joy, and depression) and three sentiment classes (positive, negative, and neutral). We perform a detailed analysis of posts to capture linguistic insights, including emotion trends, co-occurrence of emotions, sentiment-specific n-grams, and topic modelling using Latent Dirichlet Allocation and TF-IDF keyword extraction. Finally, we compare various traditional machine learning (ML), deep learning (DL), and transformer models for MLE and SC tasks. The result shows that transformer models consistently outperform the ML and DL models for both tasks.

EmoBench-Reddit: A Hierarchical Benchmark for Evaluating the Emotional Intelligence of Multimodal Large Language Models

Computation and Language

Helps computers understand feelings in pictures and words.

14 Sep 2025 0

88%

Seeing is Not Understanding: A Benchmark on Perception-Cognition Disparities in Large Language Models

Computation and Language

Helps computers understand feelings in pictures and words.

14 Sep 2025 0

88%

When a Nation Speaks: Machine Learning and NLP in People's Sentiment Analysis During Bangladesh's 2024 Mass Uprising

Computation and Language

Helps understand people's feelings during protests.

17 Dec 2025 0

View PDF Login to Bookmark

Country of Origin

🇦🇺 Australia

Page Count

33 pages

NepEMO: A Multi-Label Emotion and Sentiment Analysis on Nepali Reddit with Linguistic Insights and Temporal Trends

Helps computers understand feelings in Nepali posts.

Technical Abstract

EmoBench-Reddit: A Hierarchical Benchmark for Evaluating the Emotional Intelligence of Multimodal Large Language Models

Seeing is Not Understanding: A Benchmark on Perception-Cognition Disparities in Large Language Models

When a Nation Speaks: Machine Learning and NLP in People's Sentiment Analysis During Bangladesh's 2024 Mass Uprising