Researchers Just Proved Synthetic Research Works At Scale

A breakthrough study confirms what we've been building: AI models trained on human behavior can predict how people think, choose, and act across entirely new situations. The implications for market research are massive.

The Study: Centaur and Psych-101

Researchers fine-tuned a large language model on Psych-101, a dataset containing more than 10 million human choices from 60,000 participants across 160 experiments. The resulting model, called Centaur, doesn't just predict behavior in familiar scenarios, it generalizes to new cover stories, different task structures, and entirely unseen domains.

This matters because it demonstrates that synthetic humans, when properly calibrated with real behavioral data, can reliably simulate population-level behavior without recruiting a single respondent.

What Makes This Different From Other AI

Most AI models are domain-specific. AlphaGo plays Go. Prospect theory explains choice. Neither tells you much beyond its narrow focus.

Centaur is different. It captures the full distribution of human behavior - not just the average response, but the entire range of strategies people actually use. In one experiment (the two-step task), Centaur produced both purely model-free and purely model-based learning trajectories, matching the bimodal distribution researchers see in real participants.

This is exactly what population-true digital twins should do: represent the heterogeneity of human behavior, not collapse it into a single answer.

Three Results That Matter for Research

1. Better predictions than traditional cognitive models

Centaur outperformed 14 domain-specific cognitive models in predicting held-out participants. The average improvement was 0.13 in log-likelihood—a statistically significant and meaningful gain across nearly two million responses.

2. Generalization without retraining

The model handled new cover stories (magic carpets instead of spaceships), structural changes (three-armed bandits instead of two), and entirely new domains (logical reasoning) without additional training. Traditional models require complete recalibration for these scenarios.

3. Alignment with neural activity

Even though Centaur was trained only on behavioral data, its internal representations aligned with human fMRI activity. This suggests the model isn't just mimicking surface patterns—it's learning something meaningful about how humans process information.

What This Means for Market Research

The study validates the core promise of synthetic research: you can build models that capture real human behavior at scale, and those models will generalize to new contexts without collecting fresh data every time.

This has direct applications for anyone doing consumer research:

Faster iteration

Test messaging, positioning, and product concepts without waiting weeks for recruitment and fielding. Run exploratory research in hours, not months.

Population-true sampling

Traditional research often relies on panels that skew toward people who like taking surveys. Synthetic research can be calibrated to actual population distributions—demographic, psychographic, and behavioral.

Scenario testing at scale

Want to know how people respond to 50 different headlines? Or how behavior changes across 20 different price points? Synthetic research makes this economically feasible.

The Say-Do Gap Still Matters

One critical finding: Centaur accurately predicted human responses in social games but failed to predict artificial agent responses with matched statistics. People aren't just probability distributions - we have intuitions, biases, and mental models that shape behavior in ways that pure statistics can't capture.

This reinforces why calibration matters. A model trained on behavioral data learns the patterns people actually exhibit, including the gap between what they say and what they do. That gap isn't noise - it's signal.

Where This Goes Next

The researchers propose using Centaur for "model-guided scientific discovery." They demonstrated this by using the model to identify patterns in decision-making data that led to a new, more accurate cognitive model.

This same approach works for commercial research. Use synthetic humans to:

Identify which consumer segments respond differently to messaging
Surface unexpected patterns in how people evaluate products
Generate hypotheses about why certain campaigns succeed or fail

The model becomes a tool for exploration, not just prediction.

Why Natural Language Matters

Centaur works because it was trained on experiments transcribed into natural language. This format allows the model to understand vastly different tasks—from memory tests to reward learning to logical reasoning—using a single unified representation.

This is why we built Ditto on natural language prompts. The same digital twin that evaluates ad copy can also predict purchase behavior, test product features, and explore brand positioning. No custom models required for each domain.

The Bottom Line

A team of cognitive scientists just proved that AI models can capture how humans think, choose, and act - not just in controlled lab settings, but across new situations they've never seen before.

The question for anyone doing consumer research is no longer whether synthetic methods work. It's whether you're using them yet.

About the author

Andreas Duess

Andreas Duess builds AI tools that turn consumer behavior into fast, usable signal. He started his career in London, UK, working with Cisco Systems, Sony, and Autonomy before co-founding and scaling Canada’s largest independent communications agency focused on food and drink.

After exiting the agency, Andreas co-founded ditto, born from a clear gap he saw firsthand: teams needed faster, more accurate ways to map consumer behavior and pressure-test decisions before committing time and capital.

Ready to Experience Synthetic Persona Intelligence?

See how population-true synthetic personas can transform your market research and strategic decision-making.

Book a Demo

Researchers Just Proved Synthetic Research Works At Scale

The Study: Centaur and Psych-101

What Makes This Different From Other AI

Three Results That Matter for Research

What This Means for Market Research

The Say-Do Gap Still Matters

Where This Goes Next

Why Natural Language Matters

The Bottom Line

Andreas Duess

Related Articles

Why Your Synthetic Consumers Built Simply with ChatGPT Keep Lying to You

What LLMs Actually Learn (And Why That's Perfect for Market Research)

The Ten Levels of Synthetic Personas

Ready to Experience Synthetic Persona Intelligence?

Researchers Just Proved Synthetic Research Works At Scale

The Study: Centaur and Psych-101

What Makes This Different From Other AI

Three Results That Matter for Research

What This Means for Market Research

The Say-Do Gap Still Matters

Where This Goes Next

Why Natural Language Matters

The Bottom Line

Andreas Duess

Related Articles

Why Your Synthetic Consumers Built Simply with ChatGPT Keep Lying to You

What LLMs Actually Learn (And Why That's Perfect for Market Research)

The Ten Levels of Synthetic Personas

Ready to Experience Synthetic Persona Intelligence?

Ditto Newsletter - Subscribe