Reference Data Types and Their AI Value

Different types of open source reference data serve different enrichment purposes:

Occupational taxonomies (O*NET, ESCO, SOC)

  • Enable AI to understand job roles, required skills, task structures
  • Support HR, workforce planning, talent management assistance
  • Bridge between informal job descriptions and standardized categories

Knowledge graphs (Wikidata, DBpedia)

  • Provide entity metadata for disambiguation
  • Help AI distinguish between “Apple” the company and “apple” the fruit
  • Rich relationship data for contextual understanding

Skills taxonomies (ESCO, OSMT)

  • Map competencies to occupational requirements
  • Enable skills-based matching and gap analysis
  • Support career development and workforce planning

Technical ontologies (Common Core, CodeOntology)

  • Structure knowledge about software, engineering, systems
  • Enable more accurate assistance with technical documentation
  • Provide domain-specific vocabulary and relationships

Commonsense knowledge (ATOMIC, ConceptNet)

  • Capture everyday reasoning patterns
  • Help AI understand causes, effects, social dynamics
  • Fill gaps that technical knowledge doesn’t cover

The pattern: each type of reference data enables a different class of AI capability. The combination creates compound enrichment.

Related: 04-molecule—reference-data-multiplier, 06-atom—entity-linking-dimensionality, 06-molecule—knowledge-graph-construction