The 0.8 Cosine Similarity Threshold
Ideas with cosine similarity ≥ 0.8 can be treated as functionally identical for brainstorming purposes.
This empirically-derived threshold was established through testing and provides a practical cutoff for automated de-duplication. Below 0.8, ideas are sufficiently different to warrant separate consideration. Above 0.8, the differences are typically superficial, same core concept expressed differently.
Example at 0.82 similarity (considered identical):
- “MiniMend Sewing Kit: A compact, travel-sized sewing kit with pre-threaded needles…”
- “QuickFix Clothing Repair Kit: A compact kit with needles, thread, buttons…”
Example at 0.36 similarity (considered distinct):
- A battery-powered heating mug
- An LED desk lamp with Pomodoro timer
The threshold enables automated measurement of idea diversity at scale, though it has limitations: it doesn’t consider similarity to ideas that already exist in the world, and isn’t perfectly calibrated to human judgments of semantic difference.
Related:, 05-atom—idea-space-as-landscape