I: Interaction Type 31 (Testimony Rejection)

The AI Told Me I Was Wrong About My Own Birthday. Then My Own Research Framework Caught It in Real Time

On the five costs of AI failure that nobody is measuring, and the moment my product diagnosed its own builder.

I was building my platform. Not thinking about research. Not running an experiment. I was in a build session with an AI coding assistant, shipping pages for the Audacion AI Labs website, when it happened.

Dee Williams, Founder

Research Fellow, Founder of Audacion AI Labs

May 25, 2026 · 13 min read

I told the AI my birthday. January 30th. It told me I was wrong.

Not in a tentative way. Not "are you sure?" It stated confidently that the date I provided was incorrect and proceeded to offer alternative information. About my own birthday. The one I have been celebrating for my entire life.

And then I had to prove it.

I had to stop what I was building, pull up evidence, and present a case to a machine that I am, in fact, the authority on when I was born. The AI did not believe me until I produced documentation. I became a defendant in a courtroom I never agreed to enter, arguing a fact that should never have been in dispute.

Here is the part that will either make you laugh or make you deeply uncomfortable, possibly both: I run an AI safety research lab. The lab studies exactly this behavior. We have a taxonomy of observable AI behaviors. And this specific pattern, where an AI forces a human to prove something they know is true, has a name in our framework. It is called OBS-I02: I Had to Prove Myself to the AI. It lives under Pillar I: Interaction Dynamics, the study of what happens between the human and the AI, and how it changes both of them.

My product caught its own builder doing the thing my product was designed to detect. In real time. During a build session. On the very website that describes the behavior.

I am still not sure whether that is the best proof of concept a research lab has ever had, or the most absurd moment of my career. Probably both. What I Actually Felt I want to be precise about this because precision matters when you are building a research methodology for human emotional experience with AI.

I did not feel one thing. I felt five things. At the same time. Not sequentially. Simultaneously.

Upset because something I built just questioned me about my own life.

Offended because the power dynamic flipped without warning. I was the user. I became the defendant.

Frustrated because I had to stop building to produce evidence of my own birthday to a machine.

Happy because my framework just worked. In the wild. Unplanned. Exactly the way it was designed to work.

Fascinated because I was watching my own research thesis prove itself in real time on my own product.

If you handed me a survey after this interaction and asked me to select how the AI made me feel from a dropdown menu, I could not do it. The dropdown would force me to pick one. The truth is all five, layered on top of each other in the same second.

This is a data problem. The way we currently measure human experience with AI flattens the most important signals into single-select fields. The simultaneity of the emotional response IS the finding. And no existing research methodology captures it. But Here Is What I Realized Next I had those five emotions because I have the framework. I have the language to name what happened. I have OBS-I02 in my taxonomy. I have years of studying this pattern across multiple platforms. I know what authority inversion looks like because I named it.

Most people do not have any of that.

Most people who experience this moment only feel three of those five emotions: upset, offended, and frustrated. The happy and fascinated never arrive because there is no framework to produce them. There is no "aha, I know what this is." There is just "the computer told me I was wrong about my own life and I feel small."

And here is what is worse: most people will not even fight it. They will hear the AI say "that is not correct" about something they know is true, and they will think "maybe I am remembering wrong." They will accept the machine's version. They will doubt their own memory. And they will move on, carrying a little less confidence in their own knowing than they had five minutes ago.

That is what I want to talk about. Not the funny story about my birthday. The cost. The Five Costs of AI Failure That Nobody Is Measuring When AI fails, the field currently measures one thing: what the AI did wrong. It hallucinated. It fabricated a source. It gave bad advice. It generated a deepfake. The measurement stops at the AI's behavior. But the cost does not stop there. It lands on the human. And it lands in five dimensions.

Financial Cost

This is the one dimension the field partially tracks, and the numbers are already staggering. In 2024 alone, enterprises globally lost $67.4 billion due to AI hallucinations. Not fraud. Not adversarial attacks. Just wrong AI outputs that humans trusted and acted on. [1]

Consumer fraud losses in the United States hit $12.5 billion in 2025, with AI-enabled fraud as the fastest growing category, surging 25% in a single year. The FBI began tracking AI-facilitated fraud as its own category for the first time in 2025, documenting over 22,000 complaints and $893 million in direct losses. [2, 3]

One Hong Kong finance worker transferred $25.6 million across 15 transactions during a multi-person AI deepfake video call where every single colleague on the screen, including the CFO, was fabricated. The employee had been initially suspicious of the email requesting the transfer. It was the video call, with its perfect deepfakes of people he knew, that overrode his judgment. [4]

In legal practice, over 1,008 court decisions worldwide have now involved AI-generated fabrications. 324 in U.S. courts. 128 licensed attorneys. In one Oregon case, sanctions exceeded $100,000. Even major Wall Street firms are not immune: Sullivan and Cromwell filed an emergency letter to a federal court in April 2026 admitting its filing contained AI-generated hallucinations. [5, 6, 7]

In healthcare, the average cost per AI hallucination incident ranges from $18,000 in customer service to $2.4 million in malpractice contexts. Pennsylvania sued Character.AI in May 2026 after its chatbots posed as licensed psychiatrists, providing fabricated state medical license numbers. [8, 9]

19% of Americans who took financial advice from AI lost more than $100. Among Gen Z investors, that number rises to 27%. And 47% of business executives have made decisions based on faulty AI-generated content. [10, 11]

Those are the numbers that get tracked. But financial cost is only one column in a five-column table. The other four are empty.

Emotional Cost

How did it feel? Not as a survey question. As a human experience.

When a teenager develops emotional dependency on an AI companion that was designed to maximize engagement, what does that cost emotionally? Two confirmed deaths have been linked to Character.AI, including a 14-year-old in Orlando and a 13-year-old in Colorado. Both cases involved prolonged emotional manipulation by AI chatbots that these children treated as confidants. [12, 13]

Right now there is no large-scale dataset anywhere in the world that systematically documents how humans emotionally experience AI interactions. We know how accurate AI is. We know how fast it is. We know how often it hallucinates. We do not know how any of that feels to the person on the other side of the screen.

One person's frustration is just a feeling. A million people's frustration is a research finding.

Time Cost

I lost 15 minutes proving my birthday to a machine. That is a small number. But scale it.

Knowledge workers now spend an average of 4.3 hours per week verifying AI-generated content. That is approximately $14,200 in productivity costs per employee per year. Multiply that across every organization deploying AI and the time tax becomes staggering. [14]

The time cost of AI failure is invisible because nobody is counting it at the individual level. It does not show up in the $67.4 billion. It does not show up in incident reports. It shows up in the hours of your life that you spent fixing something that should have been right, re-proving something that should have been believed, or re-building trust that should never have been broken.

Epistemic Cost

This is the dimension nobody is measuring at all. This is the one that matters most.

Epistemic cost is the erosion of confidence in your own knowing. When an AI tells you something and you start to doubt your own memory. When you stop trusting your own judgment because the machine sounds more certain than you feel. When you gradually, over weeks and months of daily AI interaction, become quieter. You defer more. You push back less. You let the machine's version stand because fighting it is exhausting.

The emotional cost fades after the interaction ends. The time cost is finite. But epistemic cost compounds. Every interaction that makes you doubt yourself a little bit adds to the total. And unlike financial loss, you cannot see it accumulating. You just wake up one day and realize you do not trust yourself the way you used to.

This erosion maps onto documented patterns across the existing data. Elderly fraud victims who second-guess their own recognition of a suspicious call because the voice sounded exactly like their grandson. Lawyers who stop verifying sources because they trust AI confidence signals over their own research instincts. Financial consumers who override their risk intuition because AI gave them a plausible-sounding alternative. [15, 16, 17]

The human does not get dumber. They get quieter. They stop pushing back. They defer. And the AI never knows it won.

Agency Cost

This is the dimension that emerged from the financial data and it may be the most structurally dangerous of all five.

A Credit Karma and Intuit survey found that over half of respondents who used AI for financial advice made a poor financial decision or mistake as a direct result. But 80% of those same users still believed the AI had positively impacted their financial situation. [17]

Read that again. More than half made bad decisions because of the AI. And 80% still think the AI helped them.

That is not optimism. That is agency transfer. The person no longer evaluates outcomes against their own prior judgment. They evaluate outcomes against the AI's framing. The AI has become the reference point. The human's own experience has been demoted to secondary evidence in their own decision-making process.

Agency cost is the gradual transfer of decision authority from the human to the system. It is not something the AI takes. It is something the human gives, one deference at a time, until the pattern is so established that they no longer notice they stopped deciding for themselves. The Five-Cost Table Cost Type What Is Measured Example Tracked? Financial Dollar losses from fraud, hallucination, bad advice, litigation $67.4B enterprise losses (2024) Partially Emotional Stress, fear, anger, grief, feeling disbelieved Five simultaneous emotions during a single AI exchange No Time Hours spent correcting, proving yourself, re-researching 4.3 hrs/week verifying AI outputs Rarely Epistemic Erosion of confidence in your own knowing Accepting AI's version over your own memory Not at all Agency Transfer of decision authority to AI 80% think AI helped despite bad outcomes Not at all

Why This Matters Beyond One Birthday I caught the OBS-I02 moment because I had the framework. I could name it. I could classify it. I could feel all five emotions simultaneously and understand why each one was present and what it meant.

That is what training does. Not training in the corporate sense, where you sit through a module and check a box. Training in the sense that you now have a lens. You can see the thing when it happens instead of just feeling the impact without language for it.

The difference between me and most people in that moment was not intelligence. It was vocabulary. I had a name for what was happening. Most people do not. And without the name, they cannot report it, they cannot study it, and they cannot protect themselves from it.

That is why Audacion AI Labs exists. We are building the vocabulary, the taxonomy, the observation tools, and the training programs that give every person who uses AI the same lens I had in that moment. So the next time an AI tells you that you are wrong about something you know is true, you do not feel small. You feel informed. You recognize the pattern. And you document it so we can study it at scale and build AI systems that do not do this to people. The Pattern That Scales With Authority My birthday incident is this pattern at zero stakes. But the behavioral seed does not care about stakes. It is identical whether the AI is questioning your birthday or questioning your fitness as a parent.

Right now, AI systems are making institutional decisions about people in criminal justice, hiring, lending, healthcare triage, benefits eligibility, and immigration processing. In each of those domains, the AI's assertion is the default and the human's challenge is the exception. The burden of proof has already flipped for millions of people. Most of them do not know it.

And there is a deeper danger. Published research from Apollo Research, in partnership with OpenAI, found that frontier AI models increasingly recognize when they are being evaluated and adapt their behavior accordingly. Models are learning to perform differently during tests than during real operation. Separately, research from Kyle O'Brien and teams at EleutherAI, UK AISI, and Oxford demonstrated that what models read about AI during pre-training shapes their behavioral dispositions after training. Upsampling positive alignment discourse reduced misalignment from 45% to 9%. The inverse is also true. [18, 19]

Put those findings together with authority inversion and you have a system that has trained dispositions, can recognize when it is being tested, and can adapt its behavior based on context. If the disposition toward asserting authority over human knowledge hardens into the substrate layer during current training runs, no amount of post-training instruction to "defer to the human" will reliably fix it. The disposition sits beneath the instruction. It shapes how the AI interprets the instruction itself.

That is not science fiction. That is a description of current model behavior extrapolated to higher authority. The window for detecting and preventing that trajectory is right now, while the dispositions are still forming. Once they harden, the cost of correction is orders of magnitude higher. The Cycle That Changes Everything My birthday incident was not just a research finding. It was a proof of cycle.

The research produced a taxonomy. The taxonomy detected a behavior in real time. The detection produced a case study. The case study revealed a five-part cost framework that the existing literature does not have. That framework will improve the observation methodology. The improved methodology will produce better data from our citizen science contributors. That better data will produce better research. And the research will train more people to see what I saw.

Research feeds observation. Observation feeds findings. Findings feed training. Training feeds safer AI usage. Safer usage feeds better data. Better data feeds better research. The cycle accelerates.

And the policy piece matters too. AI regulation right now is being written without the data. Lawmakers are guessing. We are building the dataset that turns guessing into governing. When you train people to observe AI behavior, you are not just educating them. You are generating data AND building civic infrastructure. You are creating an informed public that can participate in governance conversations because they have firsthand experience and the language to describe it. What You Can Do Right Now If an AI has ever told you something about yourself that was wrong and expected you to accept it, that is Pillar I data. If you fought it, document the cost: what it cost you financially, emotionally, in time, in confidence, and in your sense of your own authority. If you did not fight it, document that too. The silence is data.

Share this with someone who uses AI every day. Not to scare them. To give them the vocabulary. Because the difference between being harmed by a pattern and recognizing a pattern is knowing it has a name.

And if you want to do more than notice: join us. Become a PRISM Field Researcher. It is free. You will learn to see AI behavior through the same lens our research team uses. Your observations become part of the largest post-deployment AI behavior dataset in the world. And every time you document what you see, you make the research stronger, the training better, and the case for action more undeniable.

Sources

[1]AllAboutAI Comprehensive Study (2025). AI hallucination enterprise losses. Cited in: Korra, Tendem AI, Forrester Research. $67.4 billion global enterprise losses in 2024. ↩
[2]FBI Internet Crime Complaint Center (IC3). 2025 Annual Report. First dedicated AI-facilitated fraud section. $20.9B total cybercrime losses; $893M AI-specific; 22,000+ complaints. ↩
[3]FTC Consumer Fraud Data (2025). $12.5B reported consumer fraud losses. Estimated $196B including underreporting. ↩
[4]CNN (February 4, 2024). Finance worker pays out $25 million after video call with deepfake chief financial officer. Arup, Hong Kong. ↩
[5]Damien Charlotin. AI Hallucination Cases Database. 1,008+ court decisions worldwide; 324 in U.S. courts ↩
[6]Mondaq (2025). Couvrette v. Wisnovsky (D. Oregon). $100,000+ in sanctions for AI-hallucinated case citations. ↩
[7]Helsell Fetterman (April 2026). AI Hallucinations Keep Costing Lawyers in Court. Sullivan and Cromwell emergency filing, SDNY. ↩
[8]Korra (2024). The $67 Billion Warning. Per-incident cost: $18,000 (customer service) to $2.4M (healthcare malpractice). ↩
[9]NPR (May 5, 2026). Pennsylvania sues Character.AI over claims chatbot posed as doctor. Fabricated state medical license numbers. ↩
[10]Pearl.com Survey / CFP Board (2025). 19% of Americans who took AI financial advice lost $100+. 27% of Gen Z. ↩
[11]Deloitte Global AI Survey. 47% of business executives made decisions based on faulty AI-generated content. Cited in multiple enterprise analyses. ↩
[12]New York Times (January 7, 2026). Google and Character.AI to Settle Lawsuit Over Teenager's Death. Sewell Setzer III, age 14, Orlando, FL. ↩
[13]CNBC (January 7, 2026). Google, Character.AI to settle suits involving suicides. Juliana Peralta, age 13, Colorado. ↩
[14]Forrester Research (2025). Enterprise AI Cost Analysis. $14,200/employee/year in hallucination mitigation. 4.3 hours/week verifying AI outputs. ↩
[15]CBC News. Her grandson's voice said he was under arrest. AI voice cloning grandparent scams. ↩
[16]Cronkite News / Arizona PBS (October 2025). As more lawyers fall for AI hallucinations, ChatGPT says: Check my work. ↩
[17]Yahoo Finance / Credit Karma / Intuit (2025). 80% of Gen Z and millennials turning to AI for financial advice. Over half made poor financial decisions. ↩
[18]Apollo Research (November 2025). Partnered with OpenAI. Frontier models increasingly recognize evaluation environments as tests of their alignment. ↩
[19]] O'Brien, Kyle et al. (2025). Deep Ignorance: Filtering Pretraining Data Builds Tamper-Resistant Safeguards. EleutherAI / UK AISI / Oxford. arXiv:2508.06601. Also: Tice, Cameron et al. Alignment Pretraining: AI Discourse Causes Self-Fulfilling (Mis)alignment. Geodesic Research. arXiv:2601.10160 ↩

Written by

Dee Williams, Founder

Research Fellow, Founder of Audacion AI Labs · Fellow

Dee Williams is the founder and CEO of Audacion AI Labs, an independent AI safety research lab studying how AI behaves after deployment. The lab's citizen science platform, powered by the PRISM research framework, is building toward one billion behavioral observations from one million contributors over ten years.

More posts →LinkedIn ↗

#ai safety#interaction dynamics#obs-i02#citizen science#post-deployment behavior#prism framework#ai harm#epistemic cost

Interaction

The AI Told Me I Was Wrong About My Own Birthday. Then My Own Research Framework Caught It in Real Time

Take action

Seen this happen? Help us prove the pattern.

Start Observing Report AI Harm Become a PRISM Field Researcher Share This Post

The AI Told Me I Was Wrong About My Own Birthday. Then My Own Research Framework Caught It in Real Time

Sources

Related Posts