LLM-Empowered Knowledge Graph Construction: A Survey

Author: Haonan Bian, Xidian University
Published: ICAIS 2025 (arXiv: 2510.20345)

Core Framing

The paper surveys how LLMs are reshaping knowledge graph construction across three classical stages: ontology engineering, knowledge extraction, and knowledge fusion. The central argument is a paradigm shift from rule-based pipelines to language-driven, generative frameworks.

The most valuable insight is the inversion of the LLM-KG relationship: from “LLMs as tools for building KGs” to “KGs as infrastructure for grounding LLMs.”

Key Contributions

Three bottlenecks of traditional KG construction:

Scalability and data sparsity (rule-based systems fail to generalize)
Expert dependency and rigidity (schemas require extensive human intervention)
Pipeline fragmentation (cumulative error propagation across stages)

Three mechanisms LLMs enable:

Generative knowledge modeling (synthesizing structured representations from text)
Semantic unification (integrating heterogeneous sources via natural language)
Instruction-driven orchestration (coordinating workflows via prompts)

Two paradigms in ontology construction:

Top-down - LLMs as ontology assistants (semantic modeling, logical consistency)
Bottom-up - KGs for LLMs (automatic extraction, schema induction, dynamic evolution)

Two paradigms in knowledge extraction:

Schema-based - emphasizes structure, normalization, consistency
Schema-free - emphasizes flexibility, adaptability, open discovery

Future Directions Identified

KG-based reasoning for LLMs (structured knowledge enhancing inference)
Dynamic knowledge memory for agentic systems (KGs as persistent memory substrate)
Multimodal KG construction (integrating text, images, audio, video)
KGs as “cognitive middle layer” (beyond RAG, providing scaffolding for planning and decision-making)

Connections to heyMHK

Extends the vectors-vs-graphs conversation with systematic evidence
Connects to SECI framework: explicit knowledge (KGs) now serves implicit reasoning (LLMs)
The “dynamic memory substrate” concept echoes the digital garden model
Schema evolution mirrors the atoms→molecules→organisms maturation pattern

>heyMHK

LLM-Empowered Knowledge Graph Construction: A Survey

LLM-Empowered Knowledge Graph Construction: A Survey

Core Framing

Key Contributions

Future Directions Identified

Connections to heyMHK

Properties

Graph view

Table of Contents