sociable systems.
Markdown companion

Vendor Blindspots Content

Source copy companion for the vendor blindspots visual briefing.

The most dangerous number in an AI vendor pitch is "95% accuracy."

Every enterprise is buying AI right now. And almost all of them are failing to ask the one structural question that determines whether the system is actually safe to deploy in a complex human environment.

When a vendor tells a procurement team that an LLM-based tool (for intake, summarization, or compliance) is 95% accurate, the boardroom usually nods and signs the contract. 95% sounds like an A.

But in operational environments—like worker grievance, supply chain tracking, or social risk reporting—failure is never distributed randomly.

The blind spot is assuming the 5% error rate is spread evenly. It isn't.

Errors cluster around the margins. They cluster around non-standard dialects. They cluster around unprecedented, highly specific complaints. They cluster around the most vulnerable nodes in your system—the exact places where friction carries the highest legal, operational, and human cost.

When an AI system smooths away those edges to hit its 95% accuracy metric, the institution inherits a massive blind spot. The AI vendor doesn't absorb that liability. You do. Your junior staff become what I call "Liability Sponges"—forced to quietly manage the downstream disasters the dashboard says don't exist.

The question you must ask the vendor is not: "What is the accuracy rate?" The question you must ask is: "Where, exactly, does your error rate cluster, and what is the architectural failsafe when it encounters ambiguity it cannot parse?"

If they can't answer the second question, you aren't buying a solution. You are buying unquantified decision risk.

We don't need more polite AI ethics boards. We need industrial safety standards for algorithmic systems.

(If you are an executive accountable for the second- and third-order effects of these systems, I run confidential, fixed-scope Systems Briefings to map these exact failure modes before they go live. Link in the comments.)