Score: 0

KG-LLM-Bench: A Scalable Benchmark for Evaluating LLM Reasoning on Textualized Knowledge Graphs

Published: April 9, 2025 | arXiv ID: 2504.07087v1

By: Elan Markowitz , Krupa Galiya , Greg Ver Steeg and more

Potential Business Impact:

Helps computers learn facts better from text.

Business Areas:

Text Analytics Data and Analytics, Software

Knowledge graphs have emerged as a popular method for injecting up-to-date, factual knowledge into large language models (LLMs). This is typically achieved by converting the knowledge graph into text that the LLM can process in context. While multiple methods of encoding knowledge graphs have been proposed, the impact of this textualization process on LLM performance remains under-explored. We introduce KG-LLM-Bench, a comprehensive and extensible benchmark spanning five knowledge graph understanding tasks, and evaluate how different encoding strategies affect performance across various base models. Our extensive experiments with seven language models and five textualization strategies provide insights for optimizing LLM performance on KG reasoning tasks.

LLM-KG-Bench 3.0: A Compass for SemanticTechnology Capabilities in the Ocean of LLMs

Artificial Intelligence

Tests how well AI understands and uses knowledge graphs.

19 May 2025 2

91%

Injecting Knowledge Graphs into Large Language Models

Machine Learning (CS)

Helps computers understand facts better.

12 May 2025 1

91%

Characterizing Knowledge Graph Tasks in LLM Benchmarks Using Cognitive Complexity Frameworks

Computation and Language

Makes AI understand hard questions better.

17 Sep 2025 0

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

24 pages

KG-LLM-Bench: A Scalable Benchmark for Evaluating LLM Reasoning on Textualized Knowledge Graphs

Helps computers learn facts better from text.

Technical Abstract

LLM-KG-Bench 3.0: A Compass for SemanticTechnology Capabilities in the Ocean of LLMs

Injecting Knowledge Graphs into Large Language Models

Characterizing Knowledge Graph Tasks in LLM Benchmarks Using Cognitive Complexity Frameworks