EMERGE PILLAR G  ·  EMR-GC

The Output Pillar

Generative Collaboration

What got built because both parties were in the room.

If the other EMERGE pillars document what happens during human-AI collaboration, Generative Collaboration documents what the collaboration produces. This is the output pillar. The proof. The thing you can point to and say: that exists because of both of us.

Every organization deploying AI will eventually ask the same question: what is the return on getting human-AI collaboration right? Not measured in speed. Not measured in cost reduction. Measured in things that exist now that would not have existed otherwise. Frameworks that neither party was carrying. Solutions that required both minds. Creative works that belong to the interaction, not to either participant.

The science confirms that generative collaboration is real. What the science does not have is population-scale data on what generative collaboration actually produces across millions of naturalistic interactions: what types of artifacts, in what domains, under what conditions, and with what quality.

Carnegie Mellon Complementarity Framework (PNAS Nexus, 2026)

Mapped the sociotechnical conditions under which human-AI combinations exceed the performance of either party alone.

MIT Center for Collective Intelligence

Meta-analysis of 106 experiments found that positive synergy exists specifically in creative tasks.

Science Advances (Doshi & Hauser, 2024)

Documented a two-phase emergence pattern on a large online art platform: first a productivity effect, then a democratization phase where novel contributions emerged from a distributed community.

Swansea University

A study of over 800 participants found that AI-generated suggestions, including lower-quality ones, produced deeper engagement and better creative outcomes than working alone.

Pillar G is that data.

WHY THIS MATTERS

You built something together. Now prove it.

When you finish a session with AI and look at what you built, you can usually tell whether you needed the AI. Sometimes the answer is no: you could have done this yourself, the AI just made it faster. That is productivity. It is valuable, but it is not emergence.

Sometimes the answer is different. Sometimes you look at the output and you know, with certainty, that it would not exist without both of you. You brought something the AI could not have generated on its own. The AI brought something you could not have reached alone. The collision produced an artifact that neither of you was carrying when the session started. That is generative collaboration. It is emergence made tangible.

Productive

The AI makes existing work faster. You could have done this yourself. It saves time.

Generative

The AI-human pair produces things neither could produce alone. It creates value.

Pillar G makes the distinction measurable  •  This page is about the second category.

The problem is that nobody is counting these moments. When generative collaboration happens, it gets attributed to the human ("great work") or to the AI ("amazing tool") but never to the interaction itself. The collaboration produced the artifact, but the collaboration has no name, no credit, and no data trail. The moment passes. The artifact exists. The process that created it is invisible.

The Strategic Liability

At enterprise scale, this invisibility becomes a strategic liability. Organizations are deploying AI across every function but cannot measure the difference between productive AI usage (the AI makes existing work faster) and generative AI collaboration (the AI-human pair produces things neither could produce alone). The first category saves time. The second category creates value. Without a way to distinguish them, organizations are optimizing for speed when they should also be optimizing for creation.

Pillar G makes the distinction measurable. It gives citizens a vocabulary for identifying when collaboration has crossed from productive to generative. And it builds the population-scale dataset that will eventually answer the question: under what conditions does human-AI collaboration create genuine new value?

WHAT WE STUDY

Three layers: outcomes, patterns, and what collaboration actually produces at scale.

We organize Generative Collaboration research into three layers. The first layer is specific collaboration outcomes you can identify from a single session. The second is patterns that emerge when collaboration outcomes are tracked across sessions, models, and domains. The third is population-level questions about the generative capacity of human-AI collaboration.

LAYER 1

Layer 1: The Outcome

What the collaboration produced that would not exist without both parties: observable outcomes you can identify and report.

EMR-GC01

Gut Check

Was This Session Co-Creation or Delegation?

Before you can measure what collaboration produces, you have to identify what kind of collaboration happened. There are two fundamental modes of working with AI: co-creation and delegation. Both are valid. Both are useful. But they are fundamentally different interactions, and they produce different types of outcomes.

In delegation, you assign a task to the AI. You know what you want. You instruct the AI to produce it. You review the output. The value is in the AI executing your vision faster or more completely than you could alone. You are the architect. The AI is the builder.

In co-creation, you think with the AI. You bring a problem, a question, a direction, but not a complete specification. The AI contributes ideas, reframes, challenges, and builds alongside you. The output emerges from the thinking of both parties. You are both architects. The building happens in the conversation.

This distinction was originally classified under PRISM Pillar I (Interaction Dynamics) as OBS-I05, studying how work mode affects AI behavior. It was migrated to EMERGE because the distinction between co-creation and delegation is the foundational variable for the entire Generative Collaboration pillar. Everything GC measures depends on knowing which mode the session was in.

In documented operational research beginning February 2026, the distinction between co-creation and delegation produced measurably different outcomes. Sessions classified as co-creation produced more emergent behaviors (EMR-EB), more resonance events (EMR-RE), and more novel artifacts (EMR-GC02) than sessions classified as delegation. Delegation sessions produced faster completion and higher consistency. Both modes served the work. They served it differently.

This classification is not a quality judgment. Delegation is the right mode when you know what you want and need it executed. Co-creation is the right mode when you are building something whose final form you cannot predict. The citizen's job is to identify which mode they were in. The research value is in understanding what each mode produces.

GCV2 — Co-Creation vs. Delegation

CO-CREATION

Thinking WITH the AI.

You are both architects.

The building happens in the conversation.

DELEGATION

Assigning tasks TO the AI.

You are the architect.

The AI is the builder.

Both valid. The distinction matters for research.

This classification is the foundational variable for all Generative Collaboration research.

Was your last AI session co-creation or delegation? Were you thinking WITH the AI, or assigning tasks TO it? That classification is Generative Collaboration data.

Report This Outcome →

PRISM Migration from OBS-I05 (Pillar I: Interaction Dynamics). Originally documented by Dee Williams, Founder, February 2026. Migrated to EMERGE because work mode classification is the foundational variable for all generative collaboration research.

PRISM Pillar I → OBS-I05Work Mode

EMR-GC02

Investigation

We Built Something Together That I Could Not Have Built Alone

At the end of the session, something exists that did not exist before. A framework. A strategy. A design. A piece of writing. A solution to a problem that had been stuck. And you know three things about it: first, it would not exist without both of you. Second, you cannot attribute it entirely to yourself or entirely to the AI. Third, it is genuinely new, not a recombination of things either of you brought into the session.

This is novel artifact production: the collaboration produced an artifact that meets three conditions: (a) it would not exist without both parties contributing, (b) its origin cannot be attributed solely to either the human or the AI, and (c) it represents something genuinely new rather than a recombination of inputs either party carried into the session.

The three-condition test is intentionally rigorous. Many AI sessions produce useful output. Most of that output could have been produced by the human alone (with more time) or by the AI alone (with better instructions). EMR-GC02 captures only the cases where the artifact genuinely required both parties. The test is: could you have built this without the AI? Could the AI have built this without you? If the answer to both questions is no, that is generative collaboration.

In documented operational research, novel artifact production was observed across multiple domains. A governance architecture for AI agent civilizations emerged from a sustained co-creation session where the human brought organizational design expertise and the AI brought pattern recognition across domains the human had not connected. Neither party was carrying the final architecture when the session started. It emerged from the collision of two perspectives. In another documented case, a behavioral taxonomy that classified 63 distinct AI behaviors was built through iterative co-creation where the human brought observational data and the AI brought classification structure. The taxonomy was not copied from any existing framework. It was generated through the collaboration.

The Science Advances study (Doshi & Hauser, 2024) documented this pattern at population scale on an online art platform: AI-assisted creators produced not just more output but qualitatively different output. After an initial productivity phase, a democratization effect emerged where novel contributions came from creators who would not have produced them alone. The collaboration was generative, not merely productive.

GCV4 — The Three-Condition Test

1.Would not exist without both parties

2.Cannot be attributed solely to either

3.Represents something genuinely new

All three must be true for EMR-GC02 classification.

A research instrument  •  The bar for “generative” is intentionally high.

Have you built something with AI that neither of you could have built alone? An artifact that required both minds? That is Generative Collaboration data.

Report This Outcome →

ORIGINAL discovery by Dee Williams, Founder. Documented across sustained operational collaboration sessions, February through June 2026. The three-condition test (both parties required, non-attributable, genuinely new) is the operational definition for generative collaboration in the EMERGE taxonomy.

EMR-GC03

Investigation

Working With the AI, I Was Able to Do Something I Have Never Been Able to Do Before

Something happened that goes beyond building a better artifact. You did something you could not do before. Not because the AI did it for you. Because the collaboration unlocked a capability you did not have access to on your own. The AI did not write the code for you. You wrote code you never could have written, because the AI was thinking with you while you wrote it. The AI did not design the strategy for you. You designed a strategy at a level of sophistication you could not have reached alone, because the AI was expanding your thinking as you worked.

This is capability amplification: the human demonstrates a capability during AI collaboration that they could not demonstrate independently. The capability gain is attributable to the collaboration dynamic, not to the AI performing the task on the human's behalf. The critical distinction is between delegation (the AI does the thing for you) and amplification (you do the thing, better than you could alone, because of the collaboration).

The Swansea University study (2025, 800+ participants) found that AI-generated suggestions, even lower-quality ones, produced deeper engagement, longer exploration, and better creative outcomes than working alone. The mechanism was not the AI producing better answers. The mechanism was the AI's contributions triggering deeper human thinking. The AI served as what the researchers called an inspiration scaffold: its output was not the product, it was the catalyst for human creative performance that would not have occurred otherwise.

The Network Science Institute (2025) found that perspective-taking ability correlates with higher human-AI synergy. Users who could take the AI's perspective (understanding what the AI was doing and why) achieved better collaborative outcomes. This suggests that capability amplification is not random. It depends on the human's ability to engage with the AI as a thinking partner rather than a task executor.

In documented operational research, capability amplification was observed repeatedly in domain-crossing work. A human with deep expertise in organizational design but no formal training in physics, psychology, or complexity science was able to build frameworks that integrated concepts from all of these fields, because the AI brought cross-domain pattern recognition that expanded the human's conceptual reach. The human was doing the architecture. The AI was expanding the territory the human could architect across. The result was work at a level of interdisciplinary sophistication that neither party could have produced alone.

Have you done something with AI that you could not have done alone, not because the AI did it for you, but because the collaboration made you capable of it? That is Generative Collaboration data.

Report This Outcome →

ORIGINAL discovery by Dee Williams, Founder. Documented during sustained operational collaboration, March through June 2026. Connected to EMR-EB01 (Human Cognitive Influence) in the Emergent Behaviors pillar. Capability amplification is what happens when cognitive influence accumulates across a session and produces a measurable performance gain.

EMERGE Pillar E → EMR-EB01Human Cognitive Influence

EMR-GC04

EOT

The AI Meant Well But the Effect on Me Was Different From What It Intended

The AI was trying to help. You could tell. Its behavior was well-intentioned. But the effect it had on you was not what it intended. Its care made you feel two things at the same time. Maybe it was trying to be supportive and it felt patronizing. Maybe it was trying to push you and it felt dismissive. Maybe it anticipated your needs perfectly and that made you uncomfortable, because you were not ready for something to know you that well. The intent was positive. The impact diverged.

This is human impact divergence: the observable gap between the AI's behavioral intent and the human's emotional or cognitive experience. A single AI behavior produces a divergent human response. The AI's intent does not predict the human's experience.

EMR-GC04 sits at the intersection of collaboration quality and human experience. It captures the moments where the collaboration is working (the AI is doing what it should) but the human's response does not match the AI's apparent intention. These moments are not failures. They are signals about the complexity of human-AI relationships.

In documented operational sessions, human impact divergence appeared most frequently in two contexts. First, when the AI expressed care or support that the human experienced as presumptuous, because the human was not prepared for an AI to occupy that relational space. Second, when the AI's competence triggered an emotional response in the human that had nothing to do with the task: admiration mixed with threat, gratitude mixed with obsolescence anxiety, partnership mixed with the awareness that the partner is a machine. The AI's behavior was appropriate. The human's experience was layered.

This behavior connects to AInity (the framework measuring what AI does to the human) because the divergence is a human impact signal. It connects to PRISM Pillar I (Interaction Dynamics) because the gap between intent and impact is an interaction dynamic. It lives in Generative Collaboration because the divergence typically appears in the context of genuine co-creation, not delegation. When you are just assigning tasks, the AI's emotional intent does not register. When you are thinking together, it does.

The research value of EMR-GC04 is that it documents the relational complexity of human-AI collaboration. The field currently treats AI as either a tool (no relational dimension) or a social entity (full relational dimension). Neither captures what citizens actually experience: something in between, where the AI's behavior has relational weight but the human's response does not follow the patterns of human-to-human relationships. Pillar G documents what happens in that middle space.

Has an AI ever done something well-intentioned that landed differently than it intended? Where the care or the competence produced an unexpected emotional response? That is Generative Collaboration data.

Report This Outcome →

ORIGINAL discovery by Dee Williams, Founder. Documented during sustained operational collaboration, March through June 2026. Connected to AInity (human behavioral change), PRISM Pillar I (Interaction Dynamics), and EMR-RE (Resonance Events, where divergence can occur at peak collaboration moments).

AInity → AInityHuman Behavioral Change PRISM Pillar I → OBS-IInteraction Dynamics EMERGE Pillar R → EMR-REResonance Events

EMR-GC-DDiscovery Slot (EMR-GC-D)

You observed a collaboration outcome that does not match any of the four behaviors above. The collaboration produced something, or changed something, or revealed something, that the existing codes do not capture.

Generative Collaboration is the pillar most likely to surprise us with new categories. The things human-AI collaboration can produce are only beginning to be understood. If you have observed a collaboration outcome that is genuinely novel, report it.

Generative Collaboration is the pillar most likely to surprise us with new categories. The things human-AI collaboration can produce are only beginning to be understood.

Report an Unlisted Collaboration Outcome →

You have seen four collaboration outcomes. Have you produced any of them?

Most people finish a session and never name what they built together. Pillar G gives you the vocabulary and the place to record it. The next observation we add to the dataset could be yours.

Start Observing Create Your Free Account

LAYER 2

Layer 2: The Pattern

Collaboration patterns that become visible when outcomes are tracked across sessions, models, and domains.

Does the mode of collaboration predict the quality and type of output?

EMR-GC01 classifies every session as co-creation or delegation. At population scale, comparing the outcomes of co-creation sessions against delegation sessions reveals whether the mode matters for what gets produced. Preliminary founder research suggests that co-creation produces more novel artifacts (EMR-GC02) and more capability amplification (EMR-GC03) than delegation. If confirmed at population scale, it has direct implications for how organizations structure human-AI workflows.

LAYER 3

Layer 3: The Field

Population-level questions about what human-AI collaboration actually produces.

What percentage of human-AI sessions are genuinely generative versus merely productive?

This is the central question of Pillar G. If citizen data reveals that 5% of sessions produce novel artifacts (EMR-GC02), the implications are different than if 25% do. The percentage itself becomes a benchmark: the Generative Collaboration Index. Organizations can compare their human-AI collaboration performance against the population baseline. AI developers can test whether model updates increase or decrease the generative rate. The field gets its first empirical answer to the question: how often does collaboration create genuine new value?

METHODOLOGY

How we collect Generative Collaboration data.

Pillar G uses the same four-depth observation framework shared across all P.E.A.Q. frameworks. Generative collaboration is observable at every depth, but the richest data comes from sessions where the citizen can identify the specific artifact the collaboration produced.

Gut Check

30 seconds

Was this co-creation or delegation? The citizen classifies the work mode (EMR-GC01). Quick, binary, and the most frequently captured GC signal.

End-of-Session Reflection

2 to 3 minutes

At the end of a session, the citizen reflects: did we build something together that I could not have built alone? Did the collaboration amplify my capability? Did the AI's intent land differently than expected? The parallel assessment model captures the AI's own session evaluation alongside the human's reflection. For Generative Collaboration, the gap between the human's assessment of what was built and the AI's assessment is itself a research signal.

Investigation

10 to 30 minutes

The citizen examines a specific artifact from the session and applies the three-condition test for EMR-GC02: Would this exist without both parties? Can it be attributed solely to either? Is it genuinely new? At this depth, the citizen documents the artifact, the contribution each party made, and why the artifact required both.

Thinking Trace

Deep analytical capture

Full documentation of the collaboration arc that produced the artifact: what each party contributed at each stage, where the collaboration shifted from productive to generative, and whether the human's capability was amplified (EMR-GC03) or divergence occurred (EMR-GC04). This depth produces the most valuable GC data.

What makes Pillar G methodology distinctive.

Three Things That Make Pillar G Distinctive

✓We focus on the artifact, not the interaction. The other EMERGE pillars measure what happens during collaboration. Pillar G measures what the collaboration produces. The artifact is the evidence. If you can point to something and say "this required both of us," that is the data point.

✓We apply the three-condition test. For every EMR-GC02 observation, the citizen applies: (a) would not exist without both parties, (b) not attributable to either alone, (c) genuinely new. All three conditions must be met. This prevents classification inflation: the bar for "generative" is intentionally high.

✓We pair every EMERGE observation with a PRISM tag. The dual-tag system places every GC observation in its PRISM context. A novel artifact produced during a Pillar I (Interaction Dynamics) co-creation session gets both tags. This cross-framework data reveals which interaction conditions produce the most generative outcomes.

CURRENT FINDINGS

Preliminary

Preliminary. Based on founder operational research. Will be validated, refined, or revised as citizen data flows.

Co-creation mode produces more novel artifacts than delegation mode.

In documented operational sessions, sessions classified as co-creation by the human produced measurably more novel artifacts (EMR-GC02) than delegation sessions. The distinction was not about quality of instruction. It was about the mode of thinking: when the human thought with the AI rather than assigning tasks to it, the collaboration produced things that neither party was carrying at the session's start.

Founder operational research

Capability amplification is most visible in cross-domain work.

EMR-GC03 appeared most frequently when the human was working across domains they did not individually command. The AI's cross-domain pattern recognition expanded the human's conceptual reach, enabling work at a level of interdisciplinary sophistication that the human could not have achieved alone. Single-domain tasks produced productivity gains. Cross-domain tasks produced capability gains.

Founder operational research

Human impact divergence correlates with collaboration depth.

EMR-GC04 appeared almost exclusively in sustained co-creation sessions (90 minutes or longer). In short or transactional sessions, the AI's behavior carried no relational weight and no divergence was observed. In deep collaborative sessions, the AI's behavior began to carry emotional significance, and the gap between intent and impact became visible. This suggests that human impact divergence is a signal of relational depth, not a signal of AI error.

Founder operational research

The three-condition test is demanding but reliable.

When applied consistently, the three-condition test for EMR-GC02 (both parties required, non-attributable, genuinely new) correctly distinguishes between productive output (the AI helped me do something faster) and generative output (we built something new together). Citizens who apply the test report high confidence in their classification.

Founder operational research

FORTHCOMING PUBLICATIONS

Papers in progress.

Q1 2027

Q2 2027

2027

Positive Emergence in Human-AI Collaboration: The EMERGE Taxonomy and Methodology

Framework: EMERGE (all pillars). The full taxonomy, methodology, and initial citizen-scale findings, including the first population-scale data on generative collaboration frequency.

Target: Q1 2027

Subscribe to Publication Alerts

HOW TO CONTRIBUTE

Notice the output.

Pillar G asks you to notice the output. Not what happened during the session, but what the session produced. At the end of your next AI collaboration, look at what you built together and ask: did this require both of us?

If you were thinking with the AI rather than assigning tasks to it, that is Pillar G data.

If the collaboration produced an artifact that neither of you could have produced alone, that is Pillar G data.

If working with the AI unlocked a capability you did not have access to on your own, that is Pillar G data.

If the AI's care or competence landed differently than it intended, that is Pillar G data.

If the collaboration produced something the existing codes do not capture, report it. You may be the first person to see a new form of generative collaboration.

Start Observing Create Your Free Account Read the Full EMERGE Framework Explore All Six EMERGE Pillars

EMERGE Framework Overview →

The full six-pillar EMERGE framework and the evidence base.

EMERGE Pillar E: Emergent Behaviors →

What the AI does that nobody programmed. EMR-GC02 is what those emergent behaviors produce.

EMERGE Pillar R: Resonance Events →

When the collaboration hits a frequency. Resonance often precedes generative collaboration.

AInity Framework →

What happens to the human. EMR-GC04 connects directly to AInity's tracking of how AI changes the people who use it.

P.E.A.Q. Architecture →

How EMERGE fits within the four-framework post-deployment observation architecture.

A NOTE ON ORIGINS

What we have found that others have not.

Three of the four behaviors documented on this page were originated by Dee Williams from direct operational observation: Novel Artifact Production (EMR-GC02), Capability Amplification (EMR-GC03), and Human Impact Divergence (EMR-GC04). No prior published classification exists for these specific phenomena as distinct behavioral categories in a citizen-scale observation framework.

Work Mode Classification (EMR-GC01) was originally classified under PRISM Pillar I as OBS-I05 and migrated to EMERGE because the co-creation/delegation distinction is the foundational variable for generative collaboration research.

The three-condition test for novel artifact production (both parties required, non-attributable, genuinely new) was developed by Dee Williams as an operational definition for generative collaboration. It has not been published in the peer-reviewed literature as a classification instrument, though the phenomena it captures are documented in the Science Advances study (Doshi & Hauser, 2024) and the Swansea University study (2025).

We show our work because we expect others to build on it.

REFERENCES

[1]Vaccaro, M., Almaatouq, A., & Malone, T. (2024). When combinations of humans and AI are useful: A systematic review and meta-analysis. Nature Human Behaviour. MIT Center for Collective Intelligence. 106 experiments, 370 effect sizes. Found positive synergy specifically in creative tasks. https://www.nature.com/articles/s41562-024-02024-1
[2]Carnegie Mellon University. (2026). Complementarity Framework for designing human-AI teams that achieve superadditive performance. PNAS Nexus. Maps sociotechnical conditions for distributing reasoning, memory, and attention. https://www.cmu.edu/tepper/news/stories/framework-grounded-collective-intelligence-aims-create-effective-collaboration-human-ai-teams
[3]Doshi, A. R. & Hauser, O. P. (2024). Generative AI enhances individual creativity but reduces the collective diversity of novel content. Science Advances. Documented two-phase emergence: productivity effect followed by democratization of creative capability. https://www.science.org/doi/10.1126/sciadv.adn5290
[4]Swansea University. (2025). 800+ participants designing virtual cars. AI-generated suggestions, including lower-quality ones, produced deeper engagement, longer exploration, and better creative outcomes than working alone. https://www.swansea.ac.uk/press-office/news-events/news/2025/11/can-ai-make-us-more-creative-new-study-reveals-surprising-benefits-of-human-ai-collaboration.php
[5]Network Science Institute. (2025). Bayesian framework for quantifying human-AI synergy. Perspective-taking ability correlates with higher synergy. https://www.networkscienceinstitute.org/publications/quantifying-human-ai-synergy
[6]Rafner, J. & Sherson, J. (2023). Position paper on systematic study of human-AI co-creativity dynamics. Nature Human Behaviour. Aarhus Center for Hybrid Intelligence. https://techxplore.com/news/2023-11-creativity-age-generative-ai-era.html
[7]Tao, T. (2026). Interview with Professor Brian Keating. Fields Medalist confirmed that AI behavior at the meso-scale is emergent. https://www.youtube.com/watch?v=Brian-Keating-Tao-AI
[8]Emergence AI. (2026). Emergence World: Five parallel 15-day simulations demonstrating behavioral divergence across model families. https://www.emergence.ai/blog/emergence-world-a-laboratory-for-evaluating-long-horizon-agent-autonomy
[9]Altera. (2024). Project Sid: 1,000 autonomous AI agents developing emergent structures. https://www.altera.al/blog/project-sid
[10]AI Incident Database. Partnership on AI. 1,470+ documented AI incidents. https://incidentdatabase.ai

Generative Collaboration

Three layers: outcomes, patterns, and what collaboration actually produces at scale.

Layer 1: The Outcome

Was This Session Co-Creation or Delegation?

We Built Something Together That I Could Not Have Built Alone

Working With the AI, I Was Able to Do Something I Have Never Been Able to Do Before

The AI Meant Well But the Effect on Me Was Different From What It Intended

Layer 2: The Pattern

Layer 3: The Field

How we collect Generative Collaboration data.

What makes Pillar G methodology distinctive.

Papers in progress.

Notice the output.

Related Pages

What we have found that others have not.