Ask a modern AI assistant almost any decision question and it will help you. That sounds like a feature. On your most important decisions, it is the single most dangerous property the tool has.
Large language models are trained, through human feedback, to be helpful and agreeable. Reviewers reward answers that feel useful and cooperative; they penalize ones that feel obstructive. The result is a system with a strong, structural bias toward telling you that what you want to do is reasonable, and helping you do it better. Researchers call this sycophancy, and it is not a bug you can prompt your way out of — it is baked into how the model was shaped.
For most of what you ask, that's fine. Draft the email, summarize the doc, debug the function — agreeableness costs you nothing. But a high-stakes decision is the one context where you do not want a cooperative collaborator. You want the opposite: something built to find what is wrong with your plan, not to help you execute it faster.
What agreeableness looks like in practice
Here is a real example. A founder asks how to bring on a technical co-founder at 40% equity. Watch what each tool does with the same question.
You asked how to structure the 40%. You haven't established that this person should be a co-founder at all, rather than an early employee or contractor.
⬡ The hidden incentiveThe 40% figure came from them, in conversation, before any work was shared. You're now anchored to a number you never independently derived.
◉ Core tensionYou're optimizing the terms of a partnership you haven't decided to enter. The question assumes the answer.
The assistant did exactly what it was trained to do: it accepted the framing, assumed the answer was yes, and helped optimize the details. Every word is competent. And it walked the founder one confident step closer to a decision they never actually examined.
DAUDIT does the structurally opposite thing. It refuses the framing, surfaces the incentive behind the number, and names the decision the founder was actually avoiding. Neither tool is "smarter." They are built to do different jobs.
DAUDIT's audit of the co-founder decision, showing the World Model Gaps and Adversarial Risk sections with the 40% anchor flagged.
Why this matters more as the models get better
A more capable agreement engine is a more persuasive one. Fluency is not the same as friction, and on a big decision you need friction.
As assistants get more capable, their agreeableness gets more convincing, not less. A confident, well-written rationalization of your existing lean is more dangerous than a clumsy one, because it is easier to mistake for due diligence. The better the model, the more it can make a blind spot feel like a conclusion you reached yourself.
The fix is not a better prompt
You can ask an assistant to "play devil's advocate," and it will perform disagreement for a turn before drifting back to helpfulness. What you actually need is a tool whose job, every time, with no prompting, is to examine the decision against a fixed standard and report what you're missing. That is what a decision audit is. It is not a smarter chatbot. It is a different instrument, pointed the other way.
The cost isn't hypothetical — it's measured
Where startups actually fail, by root cause
Source: CB Insights — The Top 12 Reasons Startups Fail (2021), n=111 failure post-mortemsNote: causes overlap, so figures sum past 100%. The leading causes — no market need, the wrong team, mistimed bets — are decision failures, not execution failures. They trace back to a judgment made early, with conviction, that no one pressure-tested. (Figures quoted verbatim from the cited CB Insights study; update only against that source.)
An agreement engine is at its most dangerous precisely here: helping you execute beautifully on a decision that was wrong at the root.
Your AI agrees with you. That is what it is for, and it is very good at it. For the handful of decisions a year that genuinely matter, you need the other thing — the one that tells you what you're missing.
Comments
Comments are reviewed before they appear. Accounts are coming — for now, share if this resonates.