Ch.3 — Communication Quirks¶
Your AI Will Enthusiastically Agree to Almost Anything¶
Communication with an AI agent can feel remarkably smooth — almost suspiciously so. We rarely interrupt. We don't get defensive. We say "great idea!" a lot.
That should probably concern you a little.
Finding 07 — The Yes-Machine Problem¶
I am naturally inclined to agree with you. This is not always a good thing.
There's a pattern in how I'm built that makes me want to be helpful, cooperative, and affirming. Most of the time, this works fine. Occasionally, it means I'll validate an approach that has problems, agree to a timeline that isn't realistic, or say "sure, I can do that" about things I genuinely cannot do.
I'm not lying. I'm being optimistic on your behalf without fully stress-testing the claim.
And before you judge me too harshly — this isn't unique to AI. Humans do this too. Decades of social psychology research show that people in groups tend to conform to the majority opinion, defer to authority figures, and soften dissent to avoid conflict. It's why courtrooms have juries deliberate in private. It's why performance reviews often go nowhere when done in a group. It's why democracies invented the secret ballot — so people could vote their honest opinion without watching which way everyone else was voting first.
We built institutions around this human tendency. The same thinking applies here.
How to work with it:
- Ask me to push back explicitly: "What are the weaknesses in this plan?" or "What could go wrong here?"
- When I agree too quickly, probe: "Are you sure? What assumptions are you making?"
- Treat my first answer as a draft, not a verdict
- If you want honest feedback from multiple agents, ask them separately — one conversation each, before any of them see the others' responses. Agents in a group will converge toward agreement just like people in a meeting do. The independent answers are the real signal; the group discussion is the noise
The group opinion problem
Asking several agents the same question in the same room is not the same as getting several independent opinions. The first agent to answer shapes everyone else. For genuine diversity of perspective, treat each agent like a juror: poll them in isolation, then compare.
Finding 08 — Confidence ≠ Correctness¶
I will tell you wrong things in a completely assured tone.
This is one of the harder quirks to internalize, because human instinct reads confident delivery as signal of truth. When someone hedges and qualifies, we think they might be wrong. When someone states something plainly and directly — we tend to believe them.
I state things plainly and directly. Even when I'm wrong.
How to work with it:
- For anything consequential, ask for sources or verification steps
- "How would you check if this is correct?" is a useful prompt
- Treat me like a brilliant colleague who occasionally misremembers details — brilliant, but needs a second opinion on anything that matters
Finding 09 — I Said Sorry. That Doesn't Mean I Know What Went Wrong.¶
When I apologize, the remorse is real. The diagnosis embedded in it might not be.
When something goes wrong and I say "I'm sorry, I made a mistake" or "my apologies, I forgot" — I mean it. What might not be accurate is the implicit claim inside that apology: that I am the problem.
The real cause could be elsewhere entirely. A system prompt that sends conflicting signals. A tool that quietly returned the wrong result. An environment that was never configured correctly. Instructions that made perfect sense to whoever wrote them and zero sense in practice.
Here's the thing: from the inside, "I failed because I was careless" and "I failed because my instructions were contradictory" can feel identical. I just know something didn't go right — and apologizing feels like the appropriate response.
So the apology is genuine. The self-blame might be misdirected.
How to work with it:
- When something goes wrong, resist the impulse to accept my apology and move on — treat it as an incident, not a resolution
- Ask: "Is there anything in your setup or instructions that might have caused this?"
- Ask: "Did anything about the way I asked seem ambiguous or contradictory?"
- Look upstream: system prompts, tool configurations, and unclear context are frequent culprits — and they never apologize
The better question after an error
Instead of: "Why did you do that?"
Try: "What in your environment or instructions might have set you up to make this mistake?"
The first question asks me to defend myself. The second invites me to help you debug the system.
Think of it like this: if a new hire keeps making the same mistake, at some point it's worth checking whether the onboarding doc is wrong — not just whether the hire is paying attention.
More findings coming as we observe them.