Three-Tier License Framework for Open Source Data

A risk-based classification for evaluating open source datasets for enterprise use:

Tier 1: Public Domain — No Restrictions

  • CC0 1.0, U.S. Government Works, ODC-PDDL
  • No attribution required, no compliance workflow needed
  • Lowest legal risk; prioritize these

Tier 2: Attribution Required — Simple Compliance

  • CC BY 4.0, Apache 2.0, BSD 3-Clause, Open Government licenses
  • Must credit source in documentation
  • Straightforward compliance; establish a standard attribution workflow

Tier 3: ShareAlike — Requires Legal Assessment

  • CC BY-SA, ODbL 1.0
  • Derivatives must use same license
  • The hard question: does internal enrichment constitute a “derivative work”?

The framework enables rapid triage. Tier 1 datasets can deploy immediately. Tier 2 needs a workflow. Tier 3 needs lawyers.

Related: 04-atom—data-governance, 04-molecule—reference-data-multiplier