Entity Linking
The task of identifying mentions of entities in text and linking them to entries in a knowledge base. Bridges unstructured text and structured knowledge.
The Challenge
Text mentions are ambiguous:
- “Apple” → Company? Fruit?
- “Washington” → George? State? DC?
- “Michael Jordan” → Basketball player? Professor?
Process
- Mention Detection: Find spans that refer to entities
- Candidate Generation: Retrieve possible KB entries
- Disambiguation: Select correct entry based on context
- NIL Handling: Manage mentions with no KB entry
Methods
Traditional: String matching + feature engineering Neural: Learned embeddings for mentions and entities Hybrid: Combine string features with semantic similarity
Why It Matters
Entity linking enables:
- Knowledge graph population from text
- Question answering with KB grounding
- Document enrichment with structured metadata
- Cross-document coreference
Related: 06-molecule—knowledge-graph-construction, 06-molecule—ontology-design-patterns