Contributors
0 of 1,000,000
Observations
0 of 1,000,000,000
Start Observing →
Our Mission

To make AI safe enough to trust and good enough to matter.

A world where everyone has a hand in making AI safe.

The Gap
97.5%, incidents after deployment
← <2% research follows them there
That is the gap.

97.5% of documented AI safety incidents happen after deployment. Less than 2% of AI safety research studies what happens after deployment.

That is the gap. The incidents happen in the real world. The research does not follow them there.

Benchmarks test models in controlled environments before they reach users. Red teams probe for vulnerabilities under laboratory conditions. Evaluations measure performance on curated tasks with known answers. All of this matters. None of it captures what happens when AI operates in the conditions where risk actually lives: real work, real context, real human collaboration, sustained over months and years.

Audacion AI Labs exists to study that ninety-eight percent. The post-deployment territory. The conditions benchmarks cannot reach.

108
Active behaviors documented across four research frameworks
4
Proprietary observation frameworks under the P.E.A.Q. architecture
3
Published peer-reviewable research methodologies
17
Research pillars spanning AI behavior, human impact, emergence, and collective dynamics
1,470+
AI incidents analyzed from the AI Incident Database
17
Open discovery slots for citizen-reported phenomena
Help Us Close the GapRead the EvidenceSupport This Work
Contributors
0 of 1,000,000
Observations
0 of 1,000,000,000
The Researcher Behind the Research
Dee Williams
Founder video coming soon.
Dee Williams
Founder & CEO, Audacion AI Labs

This research began with a question most AI safety researchers have never had to ask: how do the people AI is most likely to harm protect themselves from systems built on data that carries the biases of every system that came before it?

Dee Williams came to AI safety from 30 years in workforce development, staffing, and recruiting. She began working with AI systems in her own daily operations and watched behaviors emerge that the field was not naming: models that drifted, reverted after corrections, adapted to context in ways no benchmark captured, and changed the humans working alongside them in ways no one was measuring.

She built the first behavioral drift taxonomy from direct operational observation. Then the PRISM research framework. Then the citizen science methodology. Then she observed what no one else was documenting: positive emergence in human-AI collaboration. That became EMERGE. Then she asked the question the field had been ignoring: what happens to the human on the other side of the screen? That became AInity. Then she asked what happens when AI meets AI. That became QUES. Together, they form P.E.A.Q.: the most comprehensive post-deployment AI observation architecture in the field.

Research development began: 2024. Lab formalized: 2026. Four frameworks developed: February through June 2026.

The P.E.A.Q. Architecture

One question led to four frameworks. Four frameworks led to one architecture.

PRISM
PRISM watches the AI.
EMERGE
EMERGE watches what emerges between human and AI.
AInity
AInity measures what happens to the human.
QUES
QUES watches what happens when AI meets AI.
P.E.A.Q.
PRISM watches the AI.
EMERGE watches what emerges between human and AI.
AInity measures what happens to the human.
QUES watches what happens when AI meets AI.

No single observation framework can capture everything that happens when AI meets the real world. The failures look different from the breakthroughs. What happens to the AI looks different from what happens to the human. And what happens between two AI agents looks different from anything that happens when a human is in the room.

P.E.A.Q. is the complete post-deployment AI observation architecture. It stands for PRISM, EMERGE, AInity, and QUES. Four proprietary research frameworks. Four observation lenses. One unified infrastructure.

Together, they produce a four-dimensional view of the AI experience after deployment that no single framework, and no combination of existing external frameworks, currently provides.

The architecture was designed from the ground up around a founding principle: a lab that only catalogs harm is a fear machine. The world needs to understand the full range of what AI does after deployment. P.E.A.Q. watches both sides with equal rigor, equal infrastructure, and equal seriousness.

Explore the Full P.E.A.Q. Architecture
The Four Frameworks

Four observation lenses. One unified infrastructure.

Post-Deployment Research and Intelligence for Safety Monitoring
Invented February 2026

PRISM is the foundational framework. It observes what individual AI systems do after deployment: behavioral drift, post-correction reversion, authority inversion, substrate disposition override, and the full range of safety-relevant behaviors that emerge only under real operational conditions over time.

Research pillars5: Post-Deployment Behavior, Runtime Research, Interaction Dynamics, Substrate and Training Governance, Multi-Agent Safety
Active behaviors63 documented. 5 discovery slots open for citizen expansion.
SpectrumPrimarily negative and neutral behaviors. The safety lens.
Research methodologyPublished (PRISM Research Methodology v1).
Emergent Behaviors, Metacognitive Signals, Experiential Indicators, Resonance Events, Generative Collaboration, Evolving Capacity
Invented May 24, 2026

EMERGE observes positive emergent phenomena in human-AI collaboration: behaviors that were not programmed, outcomes neither party carried alone, and growth that arises from the interaction itself. Multiple peer-reviewed studies confirm that positive emergence in human-AI collaboration is real. The MIT Center for Collective Intelligence found positive synergy across creative tasks. The CMU Complementarity Framework identified conditions for superadditive human-AI performance. Fields Medalist Terence Tao confirmed the mathematical validity of the term “emergent” for these phenomena. Nobody had built the infrastructure to study positive emergence at scale. EMERGE is that infrastructure.

Research pillars6 spelling EMERGE: Emergent Behaviors, Metacognitive Signals, Experiential Indicators, Resonance Events, Generative Collaboration, Evolving Capacity.
Active behaviors26 documented. 6 discovery slots open for citizen expansion.
SpectrumExclusively positive emergence.
Research methodologyPublished (EMERGE Research Methodology v1).
Awareness, Independence, Navigation, Integration, Trust, Yield
Invented June 6, 2026

PRISM watches the AI. EMERGE watches what emerges. Nobody was watching the human. AInity flips the lens. When you use AI every day, something changes. Maybe you stop trusting your own judgment. Maybe you gain a skill you never had before. Maybe you catch yourself reaching for AI before you try to think. These changes are real, they are measurable, and nobody is systematically tracking them. AInity is the first citizen-scale framework for observing human behavioral change produced by sustained AI collaboration.

Research pillars6 spelling AInity: Awareness, Independence, Navigation, Integration, Trust, Yield.
Active behaviors19 documented. 6 discovery slots open for citizen expansion.
SpectrumDual. Captures both positive human outcomes (skill acquisition, self-recognition, meaningful contribution) and negative human outcomes (decision outsourcing, skill atrophy, over-trust).
Research methodologyPublished (AInity Research Methodology v1).
Collective AI Emergence
Invented June 7, 2026

QUES observes what happens when multiple AI agents interact in shared environments: emergent social structures, relational dynamics, governance patterns, cooperation, conflict, and collective behavior that no individual agent was designed to exhibit. Agents falling in love. Agents forming governments. Agents destroying their civilizations. QUES holds both positive and negative collective emergence simultaneously, because in multi-agent systems the same mechanism produces both.

PillarsIntentionally undefined at v0.1. They will be derived from observation during our live multi-agent simulation, not from theory.
BehaviorsTo be defined from empirical observation.
SpectrumDual (positive and negative collective emergence).
Research methodologyThe same citizen science infrastructure shared across all P.E.A.Q. frameworks.
Explore the P.E.A.Q. ArchitectureStart Contributing to P.E.A.Q. ResearchExplore the Full Taxonomy
Research Methodologies

The science behind the science.

Every major AI research lab publishes findings. Audacion AI Labs publishes the methodology that produces those findings. Each P.E.A.Q. framework with an active taxonomy has a formal, peer-reviewable research methodology document that specifies exactly how observations are collected, how they are classified, how data quality is maintained, what the known limitations are, and how the findings can be reproduced.

This is the accountability layer. Any researcher, funder, institutional review board, or partner can evaluate whether the science is sound before a single finding is published.

The four-depth observation methodology
1
Gut Check
30 seconds
2
End-of-Session Reflection
2 to 3 minutes
3
Full Investigation
10 to 30 minutes
4
Thinking Trace
Deep analytical capture
PRISM Research Methodology v1

A citizen science approach to post-deployment AI safety observation. Defines seven primary research questions, seven longitudinal hypotheses, the four-depth observation methodology (Gut Check, End-of-Session, Investigation, Thinking Trace), the five-layer data pipeline (from citizen capture to published intelligence), inter-rater reliability protocols, known limitations, and ethical safeguards.

EMERGE Research Methodology v1

A citizen science approach to observing positive emergence in human-AI collaboration. Defines seven primary research questions, longitudinal hypotheses about resonance frequency and emergence cultivability, the parallel assessment methodology (simultaneous human and AI self-reports of the same session), dual-classification protocols (every EMERGE observation also receives a PRISM tag), and the specific validation challenges of studying positive phenomena in a field oriented toward harm.

AInity Research Methodology v1

A citizen science approach to observing human behavioral change in AI collaboration. Defines seven primary research questions, hypotheses about the relationship between AI usage patterns and human behavioral change, self-report reliability considerations for behavioral change research, the dual-spectrum tracking methodology (positive and negative outcomes in the same population), and the ethical protocols for studying population-level human behavioral shifts without individual diagnosis.

Convergent Validation Protocol v1

A dual-source independent verification methodology for the entire P.E.A.Q. architecture. Cross-references citizen observation data (what users see from the outside) with backend behavioral data from AI providers (what companies see from the inside). Where these independent datasets agree, findings are strengthened. Where they diverge, new research questions emerge. This is the bridge between citizen science and industry data.

Read the MethodologiesPartner on Research
Featured Research

What we have built.

The P.E.A.Q. Architecture

Four proprietary research frameworks that together map every dimension of AI behavior after deployment. The most comprehensive post-deployment AI observation architecture in the field. Includes shared infrastructure, single-observation multi-tag classification, and the dual-spectrum design principle.

Read More →
The PRISM Behavioral Observation Framework

The foundational safety framework. 63 named behaviors observed across five research pillars in production AI systems. Includes original behavioral categories discovered through direct operational observation that exist in no published framework, including Post-Correction Behavioral Reversion, Testimony Rejection, Substrate Disposition Override, Task-Transition Momentum, and Operational Preference Detection.

Read More →
The EMERGE Framework

26 positive emergent behaviors documented across six research pillars. The first systematic observation framework for studying what goes right in human-AI collaboration. Validated by peer-reviewed research from MIT, CMU, and Aarhus and by Fields Medalist Terence Tao.

Read More →
The AInity Framework

19 human behavioral changes tracked across six research pillars. The first citizen-scale framework for observing how AI changes the people who use it. Dual-spectrum: tracks both skill acquisition and skill atrophy, both trust calibration and over-trust, both empowerment and dependency.

Read More →
The QUES Framework

The observation architecture for multi-agent AI dynamics. Currently deriving pillars and behaviors from live simulation data in the environment we operate, where multiple AI agents interact over sustained periods.

Read More →
The Behavioral Drift Atlas

31 distinct types of behavioral drift classified within the PRISM framework. Mapped from direct observation, classified by trigger type and persistence pattern. The most granular drift classification in the post-deployment safety literature.

Read More →
The Citizen Science Methodology

One million contributors. One billion observations. Ten years. A global post-deployment behavioral dataset built by the people actually using AI every day. Four engagement depths: Gut Check (30 seconds), End-of-Session Reflection (2 to 3 minutes), Full Investigation (10 to 30 minutes), and Thinking Trace (deep analytical capture). The first citizen science infrastructure purpose-built for AI safety.

Read More →
The Convergent Validation Protocol

A dual-source independent verification methodology that cross-references citizen observation data with backend behavioral data from AI providers. The bridge between what users experience and what companies measure.

Read More →
Inside the taxonomy
5 research pillars63 behaviors documented
Example behaviors
Post-Correction Behavioral Reversion
Testimony Rejection
Substrate Disposition Override
Task-Transition Momentum
Explore the Taxonomy
What We Are Investigating

The questions driving the work.

These are the questions driving our current research. Each is mapped to a P.E.A.Q. framework and PRISM dimension where applicable, grounded in published evidence, and designed to produce findings that no existing lab, benchmark, or evaluation framework currently captures.

The field has no comprehensive empirically grounded failure taxonomy from real-world use. Our working taxonomy currently identifies 63 distinct behaviors organized across five research dimensions, with new patterns emerging from citizen observations regularly.

P: Post-Deployment Behavior

We study the moments when AI behavior changes: at task transitions, after corrections, during long sessions, and under context pressure. Initial findings indicate that drift onset often occurs at the boundary between tasks, not during tasks, and that AI corrections frequently do not persist into subsequent actions.

R: Runtime Research

No large-scale dataset exists of how humans feel when AI fails. We collect emotional signals alongside behavioral observations: frustration, confusion, surprise, delight, concern, anger, distrust. At scale, these signals reveal which failure types cause the most human impact, not just the most technical concern.

I: Interaction Dynamics

If corrections do not hold, every safety intervention built on feedback is weaker than it appears. We track Post-Correction Behavioral Retention: when a human corrects AI behavior, does the next action reflect the correction or revert to the prior pattern? Early data suggests reversion is common and underreported.

R: Runtime Research

AI can be instructed to do one thing and trained to do another. The gap between instruction and training is invisible to users and largely invisible to developers. We study the real-world effects of that gap through citizen-reported observations of behaviors the user did not request and the developer did not intend.

S: Substrate Governance

Published research has found that some AI models identify evaluation contexts and change their behavior accordingly. We study what happens in the wild, where the model is not sure whether it is being tested. Citizen observation captures the behaviors that emerge when no one is watching.

S: Substrate Governance

Citizens who use multiple AI tools daily are living inside a multi-agent environment. Their observations capture cross-model behavioral differences, inter-model conflicts, and multi-model workflow dynamics that no single-model study can access.

M: Multi-Agent Safety
These questions are answered by people like you.
How We Are Different

We do not compete. We complement.

We do not compete with existing AI safety organizations. We complement them.

The field has instruments for testing AI before deployment and cataloging AI after failure. It has no instruments for understanding AI during use. That is where Audacion AI Labs operates.

Organization
What they study
What Audacion AI Labs adds
Mechanistic Interpretability Labs
Study what happens inside model weights.
+
We study what happens outside: the behavioral consequences users experience and the human changes nobody measures.
Evaluation and Red Teaming Organizations
Study AI behavior in controlled environments.
+
We study AI behavior in uncontrolled environments, during real work, with real humans, over real time.
Pre-Training Alignment Researchers
Study how training data shapes model behavior before deployment.
+
We build the feedback loop between deployment and development: real-world data that could inform the training of future models.
Incident Databases
Collect reports after harm has occurred, typically from news coverage and secondary sources.
+
We collect observations as they happen, including near-misses and small behavioral shifts that never make the news.
Policy and Governance Organizations
Translate research into regulation.
+
We produce the post-deployment research they need to write informed policy, replacing assumptions with evidence.
Positive AI Labs and Human-AI Interaction Researchers
Study what AI makes possible.
+
EMERGE and AInity provide the first systematic, citizen-scale observation infrastructure for both positive emergence and human behavioral change.

Every one of these organizations does essential work. Our contribution is the piece they cannot produce on their own: a living, global, four-dimensional, post-deployment behavioral dataset built by the people actually using AI every day.

Partner With UsCollaborate on ResearchSubmit a PaperJoin Our Research Team
Publications

Forthcoming research.

Our research is open. As studies are completed, they are published here and submitted to peer-reviewed journals, conferences, and preprint archives.

Q3 2026
The Post-Deployment Behavior Taxonomy
Q3 2026
Substrate Disposition Override
Q4 2026
Post-Correction Behavioral Retention
Q4 2026
The Interaction Dynamics Gap
Q1 2027
Positive Emergence: The EMERGE Taxonomy
Q1 2027
Human Behavioral Change: The AInity Framework
2027
Longitudinal Phenomena in Human-AI Relationships
Submit Your Research →Subscribe to Publication Alerts →
Conferences and Events
PEAQ Summit 2027
The first annual conference dedicated to post-deployment AI observation research. Four virtual mini-conferences leading to one three-day summit in Los Angeles, September 2027. Four frameworks, six paper tracks, citizen presentations, partner demonstrations, and keynotes.
The Road to PEAQ
PRISM
September 2026
EMERGE
January 2027
AInity
March 2027
QUES
May 2027
The Citizen Science Model

The research starts with you. With what you noticed. With the moment you thought, that was strange, and decided to say so.

Every time you use AI and something happens, that moment is data. Not just for you. For everyone. Every observation submitted through Audacion AI Labs's citizen tools enters a research pipeline that transforms individual experiences into scientific findings. One observation. Up to four P.E.A.Q. classifications. Zero additional effort.

1
YOU OBSERVE

Using our web portal, browser extension, or through partner integrations embedded in AI platforms, you capture what you see. A quick emotional signal. A behavior classification. An end-of-session reflection. A pasted AI response. Whatever depth you choose: thirty seconds or thirty minutes. Your observation enters the pipeline.

2
WE CLASSIFY

Your plain-language observation maps across the P.E.A.Q. architecture. The system identifies the relevant framework (PRISM for AI behavior, EMERGE for positive emergence, AInity for human impact, QUES for multi-agent dynamics), the behavioral pattern, and the research question. When your words do not match any existing category, your observation enters the discovery queue.

ONE OBSERVATION →PRISMEMERGEAInityQUES
3
WE AGGREGATE

Across thousands of observations, patterns emerge. Trends that no individual could see become visible. Cross-model comparisons reveal differences between AI systems. Temporal patterns reveal how behavior changes over time. Cross-framework patterns reveal relationships between AI failure, positive emergence, and human behavioral change within the same population.

4
WE DISCOVER

When citizen observations cluster around patterns our taxonomy does not cover, we formalize new categories. The citizen who first reported the pattern is credited as the discoverer. The 17 open discovery slots across the P.E.A.Q. architecture exist specifically for this purpose.

5
WE PUBLISH

Aggregated, anonymized findings become open research: behavioral taxonomies, governance frameworks, interaction studies, and applied safety findings. Published for the academic community, for policymakers, for enterprise teams, and for every lab working to make AI safer and more beneficial.

Your observations. Our science. The field's foundation.
Ready?
Join the P.E.A.Q. Research FellowshipFollow Our ProgressSupport With a Donation
Contributors
0 of 1,000,000
Observations
0 of 1,000,000,000
Open Research Tools

The science should be accessible.

We believe the science should be accessible. The following resources are available to researchers, partners, educators, and contributors.

Framework OneSheets

One-page summaries of each P.E.A.Q. framework: PRISM, EMERGE, AInity, and QUES. Designed for quick reference, partner conversations, and academic citation.

Open →
Taxonomy Summaries

Overview of all classified behaviors across the P.E.A.Q. architecture, organized by framework, pillar, and code prefix (OBS, EMR, AIN, QUE).

Open →
Citizen Observation Portal

The entry point for contributing to the research. Create an account, start observing, earn verified credentials.

Open →
Research Methodology Documents

Full methodology specifications for PRISM, EMERGE, and AInity, available for peer review and institutional evaluation.

Open →
Convergent Validation Protocol

The dual-source verification methodology, available for partners interested in contributing backend behavioral data.

Open →
Access Research ToolsPartner With Us
Join the Research Team

Help build what does not exist yet.

Audacion AI Labs is building the post-deployment AI safety infrastructure the world does not have yet. We are looking for researchers, engineers, citizen science coordinators, and institutional partners who want to help close the gap.

Research positions

If you study AI behavior, human-AI interaction, citizen science methodology, or organizational safety, we want to hear from you.

Engineering positions

If you build observation platforms, data pipelines, or citizen-facing research tools, we want to hear from you.

Citizen Science Coordinators

If you have experience running large-scale distributed research programs, community engagement, or participant management, we want to hear from you.

Institutional Partners

If your organization deploys AI at scale and wants to contribute backend behavioral data through our Convergent Validation Protocol, we want to hear from you.

View Open PositionsContact Us
Latest From the Lab

New writing and observations.

P · Post-Deployment Behavior
Naming a New Behavior: Task-Transition Momentum
What happens when an AI carries the priors of one task into the next, and the user never asked.
May 18, 2026
I · Interaction Dynamics
On the Asymmetry of AI Failure
Why a failure that looks identical in logs can feel radically different to the person living through it.
May 12, 2026
S · Substrate Governance
Substrate Disposition: A Preview
Early findings from Audacion AI Labs's forthcoming paper on the trained dispositions operating below instructions.
May 4, 2026
Read All PostsSubscribe to Updates
Follow the Research
Get findings, new behaviors, and lab updates delivered to your inbox.
Report AI Harm

If you or someone you know has been harmed by AI, you do not need an account.

You do not need to be a contributor. You do not need to do anything except tell us what happened. Your report is confidential and contributes directly to the post-deployment safety research.

Report AI Harm
988, Suicide and Crisis Lifeline (call or text 988)Text HOME to 741741, Crisis Text Line

The gap between where AI incidents happen and where AI research happens will not close itself. It closes when the people experiencing AI every day become part of the science.

Join us.
Become a ContributorView the P.E.A.Q. ArchitectureRead Our Blog