Score: 1

LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction

Published: September 9, 2025 | arXiv ID: 2509.07403v1

By: Weichu Liu , Jing Xiong , Yuxuan Hu and more

Potential Business Impact:

Helps computers understand feelings in long talks.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Large language models (LLMs) make significant progress in Emotional Intelligence (EI) and long-context understanding. However, existing benchmarks tend to overlook certain aspects of EI in long-context scenarios, especially under realistic, practical settings where interactions are lengthy, diverse, and often noisy. To move towards such realistic settings, we present LongEmotion, a benchmark specifically designed for long-context EI tasks. It covers a diverse set of tasks, including Emotion Classification, Emotion Detection, Emotion QA, Emotion Conversation, Emotion Summary, and Emotion Expression. On average, the input length for these tasks reaches 8,777 tokens, with long-form generation required for Emotion Expression. To enhance performance under realistic constraints, we incorporate Retrieval-Augmented Generation (RAG) and Collaborative Emotional Modeling (CoEM), and compare them with standard prompt-based methods. Unlike conventional approaches, our RAG method leverages both the conversation context and the large language model itself as retrieval sources, avoiding reliance on external knowledge bases. The CoEM method further improves performance by decomposing the task into five stages, integrating both retrieval augmentation and limited knowledge injection. Experimental results show that both RAG and CoEM consistently enhance EI-related performance across most long-context tasks, advancing LLMs toward more practical and real-world EI applications. Furthermore, we conducted a comparative case study experiment on the GPT series to demonstrate the differences among various models in terms of EI. Code is available on GitHub at https://github.com/LongEmotion/LongEmotion, and the project page can be found at https://longemotion.github.io/.

EICAP: Deep Dive in Assessment and Enhancement of Large Language Models in Emotional Intelligence through Multi-Turn Conversations

Computation and Language

Teaches computers to understand and respond to feelings.

8 Aug 2025 0

89%

Large Language Models are Highly Aligned with Human Ratings of Emotional Stimuli

Artificial Intelligence

AI understands feelings like people do.

19 Aug 2025 0

89%

AI with Emotions: Exploring Emotional Expressions in Large Language Models

Artificial Intelligence

Computers can now show feelings when they talk.

20 Apr 2025 0

View PDF Login to Bookmark

Country of Origin

🇭🇰 Hong Kong

Repos / Data Links

github.com github.com

Page Count

34 pages

LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction

Helps computers understand feelings in long talks.

Technical Abstract

EICAP: Deep Dive in Assessment and Enhancement of Large Language Models in Emotional Intelligence through Multi-Turn Conversations

Large Language Models are Highly Aligned with Human Ratings of Emotional Stimuli

AI with Emotions: Exploring Emotional Expressions in Large Language Models