The Task Complexity Gradient

Context

Organizations evaluating AI for knowledge work need to understand where current capabilities actually apply - not just headline benchmark numbers.

The Problem

Marketing claims about AI capability rarely map to the distribution of work in an actual organization. “Approaching expert parity” sounds impressive until you realize it applies primarily to shorter, well-specified tasks while longer, ambiguous work shows much lower performance.

The Pattern

AI model win rates decline predictably along a complexity gradient:

Task Duration	Win Rate Range	Implication
0-2 hours	45-56%	Viable for augmentation
2-4 hours	25-33%	Review essential
4-8 hours	23-35%	Human primary, AI assist
8+ hours	19-37%	Human primary

The gradient also applies to context specification:

Well-specified tasks: baseline performance
Under-specified tasks (42% of original prompt length): measurable degradation

The Solution

Map organizational work to this gradient before committing to AI tooling:

Audit task portfolio: What percentage of knowledge work is <2 hours and well-specified?
Match expectations to segment: Apply different success criteria by task type
Design review workflows: Build human oversight proportional to task complexity
Measure actual win rates: Don’t assume benchmark performance transfers

Consequences

Organizations with many short, routine tasks will see faster ROI
Complex, judgment-heavy work remains human-primary for now
The “easy wins” may already be captured by early adopters
Remaining tasks are harder, with slower improvement curves

>heyMHK

The Task Complexity Gradient

The Task Complexity Gradient

Context

The Problem

The Pattern

The Solution

Consequences

Properties

Graph view

Table of Contents