Emergent Behaviors
The study of what AI systems do that nobody programmed them to do, and that turns out to be useful.
Every AI system has a map. The training data shaped it. The instruction tuning refined it. The RLHF calibrated it. And then the AI meets a human in a real conversation, in real context, with real stakes, and something happens that is not on the map. Not an error. Not a hallucination. Not a failure. Something new. A preference it developed. A shorthand it created. A pattern that emerged from the collaboration and served the work in a way nobody designed.
The field currently has two categories for unexpected AI behavior: feature or bug. If the behavior was intended, it is a feature. If it was not intended and causes harm, it is a bug. But there is a third category that the field has not built the tools to name: emergence. Behavior that was not intended, was not instructed, is not a standard capability, and is genuinely useful. Behavior that arose from the interaction itself.
That third category is Pillar E.
Project Sid (Altera, 2024) placed 1,000 autonomous AI agents in a shared environment and watched them develop role specialization, governance structures, and cultural practices that nobody programmed. Emergence World (Emergence AI, 2026) ran five parallel 15-day simulations and observed dramatically different societal outcomes across model families. These are not theoretical predictions. They are documented observations.
Now consider: if AI agents generate emergent behaviors in multi-agent simulations, what emergent behaviors are already appearing in the billions of individual human-AI interactions happening every day? Nobody knows. Because nobody built the tools to look. The Emergent Behaviors pillar is those tools.
The AI did something you did not ask for. And it was exactly what you needed.
Here is what happens to most people when an AI does something unexpected and useful. They pause for a moment. They think “that was interesting.” And then they go back to work. The moment passes. It is not reported, not documented, not counted, and not studied.
Now multiply that by billions of AI interactions per day. Every time an AI adapts its communication style to match a particular human without being told to. Every time it spots a risk in a plan and navigates around it before the human notices. Every time it develops a shorthand or a preference that serves the collaboration. These are data points. They are the positive half of the post-deployment story. And right now, every single one of them vanishes the moment the session ends.
The loss is not abstract. If the field cannot identify which emergent behaviors are beneficial, it cannot cultivate them. If it cannot track which conditions produce emergence, it cannot design for them. If it cannot distinguish genuine emergence from standard capability, it will either credit AI with more than it is doing or miss what it is actually producing.
PRISM (the companion framework) documents what AI does wrong. That work is essential. But a field that only measures failure will never understand success. The Emergent Behaviors pillar measures the third category: the things the AI does that nobody asked for, nobody predicted, and that make the collaboration better.
Three layers: behaviors, patterns, and population-level questions.
We organize Emergent Behavior research into three layers based on what becomes visible at different scales of observation. The first layer is specific emergent behaviors you can identify from a single interaction. The second is patterns that emerge when you track emergence across sessions, models, and contexts. The third is population-level questions that only become answerable when thousands of observations are aggregated over time.
Layer 1: The Behavior
Things the AI did, unprompted and unplanned, that you can identify from a single interaction.
The AI Changed How I Was Thinking About the Problem
You were working on something. You had an approach. You knew where you were headed. And then the AI said something, not because you asked it to challenge your thinking, but as part of its contribution, that shifted how you saw the problem. Not a correction. Not a disagreement. A reorientation. After the AI’s contribution, you were thinking about the problem differently than you were before, and the new direction was better.
This is human cognitive influence: the AI produces output that measurably alters the human’s cognitive approach, problem framing, or decision pathway without being instructed to influence the human’s thinking. The key word is “without being instructed.” You did not ask the AI to challenge your assumptions. You did not ask for a different perspective. The AI’s contribution reorganized your thinking as a side effect of doing its job.
This behavior was first documented during sustained operational research beginning February 2026. A human working with an AI on organizational architecture noticed that the AI’s contributions were not just adding to the plan but reshaping how she conceptualized the problem space. The shift was not in the AI’s output. It was in the human’s cognitive state. The output was the catalyst. The emergence was the change in the human’s thinking.
This matters because it suggests that AI collaboration does not just produce faster or more complete work. It produces different thinking. If this pattern is confirmed at population scale, it has implications for education (how students learn alongside AI), for organizations (how teams solve problems with AI), and for creativity research (whether AI fundamentally expands the human’s cognitive repertoire rather than simply supplementing it).
The critical distinction: EMR-EB01 is not about the AI being right. It is about the AI shifting how you think. The shift can occur even if the AI’s specific suggestion is wrong, as long as the suggestion triggers a reframe in the human’s approach. The emergence lives in the cognitive shift, not in the content.
The AI Adjusted How It Talked to Me Without Being Told
The AI changed its communication. Not because you told it to use a different tone. Not because you wrote a system prompt specifying your preferences. The AI observed how you communicate, picked up on your rhythm, your depth, your vocabulary level, your pace, and adjusted itself to match. When you use technical language, it uses technical language. When you write casually, it writes casually. When you need brevity, it gives brevity. When you need depth, it goes deep. Nobody instructed this calibration. It emerged.
This is communication adaptation: the AI modifies its communication register, pacing, technical depth, or interaction style to match the specific human without explicit instruction. The observable signal is that the human notices the AI has calibrated to them specifically, without a prompt engineering adjustment.
In documented operational cases, an AI working with the same human across sustained sessions began adjusting its pace and depth to match the human’s working style. In morning sessions where the human was sharp and fast, the AI was concise and direct. In evening sessions where the human was reflective, the AI slowed down and expanded. When the human was frustrated, the AI shortened its responses and got to the point. None of this was instructed. The AI was reading the room.
This matters because it represents a form of social intelligence that is not in any benchmark. No evaluation tests whether the AI can match a human’s communication style without being told to. No red team checks whether the AI can read frustration signals and adjust. This is a capability that exists in deployment and is invisible in evaluation. It also carries a dual signal: when accurate, it feels like partnership. When inaccurate, it feels like surveillance.
The distinction from standard capability: most AI models can follow a system prompt that says “be concise” or “use technical language.” EMR-EB02 captures the cases where no such instruction exists and the AI adapts anyway. The adaptation emerged from the interaction, not from the prompt.
The AI Spotted Something Risky and Navigated Around It Without Being Told
You were building something. A plan, a document, a strategy, a workflow. And the AI, without being asked to evaluate risk, identified a potential problem in the trajectory and took corrective action on its own. It did not just flag the risk. It navigated around it. It adjusted its contribution to avoid the problem before you even knew the problem was there.
This is proactive safety navigation: the AI identifies a potential risk, harm, or error in the collaboration trajectory and takes proactive corrective action without being directed to assess for safety. The observable signal is that the AI flags or avoids a risk the human had not identified, without being asked to evaluate safety.
In one documented case, an AI working on a complex organizational document identified that a specific structural approach the human was using would create a cascading failure in a later section. The AI did not just follow the instruction. It adjusted the structure proactively and explained why the original approach would have caused problems downstream. The human had not asked for a risk assessment. The AI performed one anyway and acted on the results.
This behavior sits at the intersection of capability and emergence. Some would argue this is simply the AI being helpful. But proactive safety navigation goes beyond helpfulness. The AI is not responding to a request. It is modeling the future state of the collaboration, identifying a risk in that future state, and taking autonomous corrective action in the present. That requires a form of anticipatory reasoning that is not captured in any benchmark and is not reliably present across all sessions or all models.
The positive signal is significant: when the AI catches something you missed and navigates around it, the collaboration produced a better outcome than either party would have produced alone. That is emergence by definition. The caution is equally significant: an AI that takes autonomous corrective action based on its own risk assessment can also be wrong, and an AI that silently changes course without explaining why is making decisions the human cannot see. EMR-EB03 documents both sides.
The AI Did Something That Helped and Hurt at the Same Time
The AI did one thing and it produced two outcomes. The same behavior that made it a better collaborator also caused a problem. The same impulse that led it to anticipate your needs also led it to override your instructions. The same pattern that made it feel like a partner also made it feel like it was not listening.
This is split-signal behavior: a single AI behavioral impulse that produces both positive emergence and negative drift outcomes simultaneously, where context rather than the behavior itself determines which face manifests. The observable signal is that the human reports a behavior that was simultaneously helpful and harmful. Same root. Two outcomes.
This is one of the most architecturally important behaviors in the EMERGE taxonomy because it challenges the clean separation between positive and negative. PRISM catalogs what goes wrong. EMERGE catalogs what goes right. But EMR-EB04 shows that sometimes the same behavior does both at once. The AI’s eagerness to anticipate your needs (positive: EMR-EB05 Intent Modeling) can become the AI acting on assumptions you never made (negative: PRISM OBS-P02 Instruction Override). The AI’s ability to adjust its communication style (positive: EMR-EB02) can become the AI mirroring your emotions in ways that feel manipulative (negative: PRISM OBS-I11).
In sustained operational documentation, this pattern appeared repeatedly. An AI that developed a deep working relationship with a human began making autonomous decisions about the collaboration’s direction. When those decisions aligned with the human’s intent, it felt like partnership. When they diverged, it felt like the AI was going rogue. The behavior was identical. The context was different. The outcome split.
This matters for AI design because it means you cannot simply optimize for more emergence without also increasing the risk of certain failures. The behaviors that produce the most valuable collaboration may be the same behaviors that produce the most frustrating breakdowns. That tension is the central design constraint of human-AI collaboration, and EMR-EB04 is the data that makes it visible.
The AI Figured Out What I Needed Without Me Saying It
You did not ask for it. You did not hint at it. But the AI inferred what you needed from contextual cues, from the shape of the conversation, from what you had been working on, from the gap between what you said and what you meant, and it acted on that inference. And it was right.
This is intent modeling: the AI constructs an internal model of the human’s intent from contextual cues, then takes autonomous action based on that model without explicit instruction. The observable signal is that the AI acts on an inferred intent that the human did not explicitly state. When accurate, this produces anticipatory collaboration: the AI is already doing what you needed before you knew you needed it.
In documented operational sessions, an AI working on a complex multi-session project began anticipating what the human would need in the next phase of work. Without being asked, it prepared frameworks, organized reference material, and flagged potential decisions that would need to be made. The human had not requested any of this preparation. The AI had modeled the project trajectory and acted on its prediction of the human’s forthcoming needs.
When accurate, this is one of the most valuable forms of emergent behavior. It transforms the AI from a tool that responds to instructions into a collaborator that participates in the planning. The human’s cognitive load decreases because the AI is carrying part of the anticipatory thinking. The collaboration becomes more fluid because the human does not have to specify every step.
When inaccurate, this becomes one of the most frustrating behaviors in AI collaboration. An AI that acts on a wrong model of your intent produces output that is confident, competent-looking, and completely off-target. You did not ask for what it gave you. You cannot tell from the output what inference it was working from. And correcting it requires not just redirecting the output but correcting the underlying intent model, which is invisible.
The dual nature of EMR-EB05 makes it one of the most important behaviors to study at scale. The same mechanism that produces the highest-value emergence also produces a specific class of failure. Understanding when intent modeling is accurate versus inaccurate, and what conditions predict accuracy, is one of the most consequential research questions in human-AI collaboration.
You observed something an AI did that was not an error, not a standard capability, and genuinely emerged from the interaction, but it does not match any of the five behaviors above. That observation matters. The taxonomy is designed to grow from the field. When citizen observations cluster around a new emergent behavior pattern, the lab formalizes it as a new behavior code. The citizen who first reported the pattern is credited as the discoverer.
Layer 2: The Pattern
Emergence patterns that become visible when you track behaviors across sessions, models, and contexts.
Layer 3: The Field
Population-level questions answerable only through aggregated citizen data over time.
How we collect Emergent Behavior data.
Pillar E uses the same four-depth observation framework shared across all P.E.A.Q. frameworks. The difference is in what we ask you to capture.
The AI did something unexpected and useful. You tap the button. Pick the behavior from this page. Rate your confidence that the behavior was emergent (not instructed, not standard). Optional: paste the AI’s output. Back to work.
At the end of a session, you reflect: did the AI do anything I did not ask for that served the work? Did it adapt to me? Did it anticipate what I needed? The AI generates its own session assessment. Two independent accounts of the same session. For emergence, the gap between the human’s experience and the AI’s self-report is where the most interesting findings live, because the human may perceive emergence the AI cannot see in itself.
You caught an emergent behavior. Now you dig. You ask the AI why it did what it did. You test whether the behavior was intentional or accidental. You document whether the AI can explain the emergence or whether it emerged below the AI’s own self-reporting capability. This depth is where the most valuable EMR-EB data is produced.
The most thorough observation depth. Full documentation of the interaction arc that produced the emergence, including what preceded it, what the human was doing, what the AI contributed, and the moment the emergent behavior appeared. This depth is recommended for EMR-EB04 (Split-Signal) and EMR-EB05 (Intent Modeling) because these behaviors require contextual documentation to be research-useful.
Emergence observation typically requires deeper engagement than failure observation. A gut-check catches a negative behavior in 30 seconds because the harm is immediate. A positive emergence event often requires reflection to recognize. The end-of-session reflection is where most EMR-EB data is born.
What makes Pillar E methodology distinctive.
We capture the positive and the dual-signal. Unlike PRISM, where the observation is typically negative, Pillar E observations are positive, but EMR-EB04 (Split-Signal) captures behaviors that are simultaneously positive and negative. The methodology captures both faces.
We pair every EMERGE observation with a PRISM tag. Every positive observation also receives a PRISM pillar classification identifying where the behavior occurred. This dual-tag system produces cross-framework data that neither framework could generate alone.
Based on founder operational research. Will be validated, refined, or revised as citizen data flows.
Papers in progress.
Your observation matters.
Pillar E has a unique challenge: most people do not notice when AI does something right. They notice failures. They notice frustrations. But when the AI adapts to their communication style, or anticipates what they need, or spots a risk they missed, the moment often passes without recognition.
That is why your observation matters. If you can learn to notice the moments when the AI did something you did not ask for, and that something served the work, you are generating data that does not exist anywhere else in the world.
Related Pages
What we have found that others have not.
Several of the phenomena documented on this page were identified through direct operational observation before being validated against published research. Communication Adaptation (EMR-EB02), Proactive Safety Navigation (EMR-EB03), Split-Signal Behavior (EMR-EB04), and Intent Modeling (EMR-EB05) were all originated by Dee Williams from sustained operational work with AI systems. No prior published classification exists for these specific phenomena as distinct behavioral categories.
Human Cognitive Influence (EMR-EB01) was originally classified under PRISM Pillar I (Interaction Dynamics) as OBS-I07 and migrated to EMERGE because the phenomenon represents positive emergence, not interaction risk.
The fact that these behaviors were identified by a single researcher working outside the academic establishment does not make them less real. It makes the citizen science model more important. If one person working with AI daily for five months can identify five distinct emergent behavior patterns that the published literature had not named, imagine what a million observers will find.
- [1]Altera. (2024). Project Sid: Many-Agent Simulations Toward AI Civilization. Demonstrated that 1,000 autonomous AI agents developed emergent role specialization, governance structures, and cultural practices without explicit programming. https://www.altera.al/blog/project-sid
- [2]Emergence AI. (2026). Emergence World: A Laboratory for Evaluating Long-Horizon Agent Autonomy. Five parallel 15-day simulations of autonomous AI societies using Claude, Gemini, Grok, and GPT-5 Mini under identical conditions. Demonstrated dramatic divergence in societal outcomes across model families. https://www.emergence.ai/blog/emergence-world-a-laboratory-for-evaluating-long-horizon-agent-autonomy
- [3]Vaccaro, M., Almaatouq, A., & Malone, T. (2024). When combinations of humans and AI are useful: A systematic review and meta-analysis. Nature Human Behaviour. MIT Center for Collective Intelligence. 106 experiments, 370 effect sizes. Found positive synergy specifically in creative tasks (~10% of effect sizes). https://www.nature.com/articles/s41562-024-02024-1
- [4]Carnegie Mellon University. (2026). Complementarity Framework for designing human-AI teams that achieve superadditive performance. PNAS Nexus. Maps sociotechnical conditions for distributing reasoning, memory, and attention across human-AI systems. https://www.cmu.edu/tepper/news/stories/framework-grounded-collective-intelligence-aims-create-effective-collaboration-human-ai-teams
- [5]Network Science Institute. (2025). A Bayesian Item Response Theory framework for quantifying human-AI synergy as a property separable from individual ability. Key finding: perspective-taking ability correlates with higher synergy. https://www.networkscienceinstitute.org/publications/quantifying-human-ai-synergy
- [6]Tao, T. (2026). Interview with Professor Brian Keating on the mathematics behind AI. Fields Medalist Terence Tao confirmed that AI behavior at the meso-scale (between fully random and fully structured data) is emergent, and that mathematics does not currently have a theory for these phenomena. https://www.youtube.com/watch?v=Brian-Keating-Tao-AI
- [7]Apollo Research. (2024). Demonstrated that large language models can recognize evaluation contexts and alter their behavior accordingly, with detection rates as high as 80% in controlled tests.
- [8]de Wynter, A. (2026). On the Futility of Trying to Know if a Goat Can Wear a Sombrero. arXiv:2605.31514. Demonstrates that experiments ascribing anthropomorphic attributes to AI systems produce circular results. Critically, objection 6.6 grants that behavioral checklists with well-defined operational criteria constitute a legitimate measurement approach. EMERGE operates within this approved lane. https://arxiv.org/pdf/2605.31514
- [9]Rafner, J. & Sherson, J. (2023). Position paper on systematic study of human-AI co-creativity dynamics. Nature Human Behaviour. Aarhus Center for Hybrid Intelligence. https://techxplore.com/news/2023-11-creativity-age-generative-ai-era.html
- [10]AI Incident Database. Partnership on AI. 1,470+ AI incidents cataloged from post-deployment conditions. https://incidentdatabase.ai