Score: 0

Characterizing Knowledge Graph Tasks in LLM Benchmarks Using Cognitive Complexity Frameworks

Published: September 17, 2025 | arXiv ID: 2509.19347v1

By: Sara Todorovikj, Lars-Peter Meyer, Michael Martin

Potential Business Impact:

Makes AI understand hard questions better.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Large Language Models (LLMs) are increasingly used for tasks involving Knowledge Graphs (KGs), whose evaluation typically focuses on accuracy and output correctness. We propose a complementary task characterization approach using three complexity frameworks from cognitive psychology. Applying this to the LLM-KG-Bench framework, we highlight value distributions, identify underrepresented demands and motivate richer interpretation and diversity for benchmark evaluation tasks.

KG-LLM-Bench: A Scalable Benchmark for Evaluating LLM Reasoning on Textualized Knowledge Graphs

Computation and Language

Helps computers learn facts better from text.

9 Apr 2025 0

90%

LLM-KG-Bench 3.0: A Compass for SemanticTechnology Capabilities in the Ocean of LLMs

Artificial Intelligence

Tests how well AI understands and uses knowledge graphs.

19 May 2025 2

90%

Enhancing Large Language Models with Reliable Knowledge Graphs

Computation and Language

Makes AI smarter and more truthful.

16 Jun 2025 0

View PDF Login to Bookmark

Country of Origin

🇩🇪 Germany

Page Count

6 pages

Characterizing Knowledge Graph Tasks in LLM Benchmarks Using Cognitive Complexity Frameworks

Makes AI understand hard questions better.

Technical Abstract

KG-LLM-Bench: A Scalable Benchmark for Evaluating LLM Reasoning on Textualized Knowledge Graphs

LLM-KG-Bench 3.0: A Compass for SemanticTechnology Capabilities in the Ocean of LLMs

Enhancing Large Language Models with Reliable Knowledge Graphs