LLM-Empowered Knowledge Graph Construction: A Survey
Author: Haonan Bian, Xidian University
Published: ICAIS 2025 (arXiv: 2510.20345)
Core Framing
The paper surveys how LLMs are reshaping knowledge graph construction across three classical stages: ontology engineering, knowledge extraction, and knowledge fusion. The central argument is a paradigm shift from rule-based pipelines to language-driven, generative frameworks.
The most valuable insight is the inversion of the LLM-KG relationship: from “LLMs as tools for building KGs” to “KGs as infrastructure for grounding LLMs.”
Key Contributions
Three bottlenecks of traditional KG construction:
- Scalability and data sparsity (rule-based systems fail to generalize)
- Expert dependency and rigidity (schemas require extensive human intervention)
- Pipeline fragmentation (cumulative error propagation across stages)
Three mechanisms LLMs enable:
- Generative knowledge modeling (synthesizing structured representations from text)
- Semantic unification (integrating heterogeneous sources via natural language)
- Instruction-driven orchestration (coordinating workflows via prompts)
Two paradigms in ontology construction:
- Top-down - LLMs as ontology assistants (semantic modeling, logical consistency)
- Bottom-up - KGs for LLMs (automatic extraction, schema induction, dynamic evolution)
Two paradigms in knowledge extraction:
- Schema-based - emphasizes structure, normalization, consistency
- Schema-free - emphasizes flexibility, adaptability, open discovery
Future Directions Identified
- KG-based reasoning for LLMs (structured knowledge enhancing inference)
- Dynamic knowledge memory for agentic systems (KGs as persistent memory substrate)
- Multimodal KG construction (integrating text, images, audio, video)
- KGs as “cognitive middle layer” (beyond RAG, providing scaffolding for planning and decision-making)
Connections to heyMHK
- Extends the vectors-vs-graphs conversation with systematic evidence
- Connects to SECI framework: explicit knowledge (KGs) now serves implicit reasoning (LLMs)
- The “dynamic memory substrate” concept echoes the digital garden model
- Schema evolution mirrors the atoms→molecules→organisms maturation pattern
Related: 07-molecule—vectors-vs-graphs, 06-molecule—seci-framework