Data Quality Contextual Relationships

Data quality is not intrinsic to data, it’s a relationship between data and its intended use. The same dataset can be high-quality for one purpose and low-quality for another.

The Contextual Principle

“Fitness for use” is the core concept. Quality questions must always be:

  • Quality for what task?
  • Quality for which users?
  • Quality under what constraints?

Examples

Same Dataset, Different Quality:

  • Customer addresses: High quality for marketing, low quality for emergency response (needs real-time updates)
  • Historical sales: High quality for trend analysis, low quality for demand forecasting (distribution shift)

Implications

No Universal Quality Score: Multi-dimensional assessment required Use-Case Documentation: Quality requirements must be specified per application Quality Degradation: As use cases change, quality assessments must be revisited Producer-Consumer Gap: Data producers may not understand consumer quality needs

Governance Response

  • Catalog data with intended uses, not just schemas
  • Define quality thresholds per use case
  • Monitor quality relative to specific applications
  • Create feedback loops between consumers and producers

Related: 04-atom—data-quality-dimensions-consensus-gap, 04-atom—data-governance