Score: 0

Thunder-KoNUBench: A Corpus-Aligned Benchmark for Korean Negation Understanding

Published: January 8, 2026 | arXiv ID: 2601.04693v1

By: Sungmok Jung , Yeonkyoung So , Joonhak Lee and more

Potential Business Impact:

Improves AI's understanding of Korean "not" words.

Business Areas:
Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Although negation is known to challenge large language models (LLMs), benchmarks for evaluating negation understanding, especially in Korean, are scarce. We conduct a corpus-based analysis of Korean negation and show that LLM performance degrades under negation. We then introduce Thunder-KoNUBench, a sentence-level benchmark that reflects the empirical distribution of Korean negation phenomena. Evaluating 47 LLMs, we analyze the effects of model size and instruction tuning, and show that fine-tuning on Thunder-KoNUBench improves negation understanding and broader contextual comprehension in Korean.

Country of Origin
πŸ‡°πŸ‡· Korea, Republic of

Page Count
21 pages

Category
Computer Science:
Computation and Language