Score: 0

Evaluating Cultural Knowledge Processing in Large Language Models: A Cognitive Benchmarking Framework Integrating Retrieval-Augmented Generation

Published: November 3, 2025 | arXiv ID: 2511.01649v1

By: Hung-Shin Lee , Chen-Chi Chang , Ching-Yuan Chen and more

Potential Business Impact:

Tests if computers understand different cultures.

Business Areas:
Natural Language Processing Artificial Intelligence, Data and Analytics, Software

This study proposes a cognitive benchmarking framework to evaluate how large language models (LLMs) process and apply culturally specific knowledge. The framework integrates Bloom's Taxonomy with Retrieval-Augmented Generation (RAG) to assess model performance across six hierarchical cognitive domains: Remembering, Understanding, Applying, Analyzing, Evaluating, and Creating. Using a curated Taiwanese Hakka digital cultural archive as the primary testbed, the evaluation measures LLM-generated responses' semantic accuracy and cultural relevance.

Country of Origin
🇹🇼 Taiwan, Province of China

Page Count
31 pages

Category
Computer Science:
Computation and Language