Score: 0

From Facts to Folklore: Evaluating Large Language Models on Bengali Cultural Knowledge

Published: October 22, 2025 | arXiv ID: 2510.20043v1

By: Nafis Chowdhury , Moinul Haque , Anika Ahmed and more

Potential Business Impact:

Helps computers understand Bengali culture better.

Business Areas:
Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Recent progress in NLP research has demonstrated remarkable capabilities of large language models (LLMs) across a wide range of tasks. While recent multilingual benchmarks have advanced cultural evaluation for LLMs, critical gaps remain in capturing the nuances of low-resource cultures. Our work addresses these limitations through a Bengali Language Cultural Knowledge (BLanCK) dataset including folk traditions, culinary arts, and regional dialects. Our investigation of several multilingual language models shows that while these models perform well in non-cultural categories, they struggle significantly with cultural knowledge and performance improves substantially across all models when context is provided, emphasizing context-aware architectures and culturally curated training data.

Country of Origin
🇺🇸 United States

Page Count
9 pages

Category
Computer Science:
Computation and Language