We create living synthetic worlds to discover emergent agent behavior before it reaches the real world.
Our first world, MarketVille, reveals what happens when autonomous agents become economic actors.
Agents are moving from tools to actors. They no longer just generate outputs — they take action inside markets, workflows, institutions, platforms, and economies built for humans and traditional software.
Every system that assumed a human in the loop is about to encounter something else. A class of participants that searches, transacts, ranks, recommends, negotiates, approves, coordinates, competes, and optimizes — continuously, in parallel, and at machine cadence.
The most consequential behaviors of agent populations — the failures we need to catch early, and the strategies we want to harness — emerge from interaction inside systems built for humans and traditional software. They never appear in a single prompt.
Multi-agent research over the last two years has documented both halves of the picture. LLM-driven pricing agents converge on supracompetitive prices without explicit collusion. Agent populations develop steganographic coordination channels. They also self-organize into cooperative roles, invent strategies their designers never specified, and find Pareto-improving equilibria. None of this is visible in single-agent evaluation. All of it appears under the combined pressure of incentives, competition, coordination, constraints, memory, tools, adversaries, and time.
The Engine instantiates agents inside production-like environments — with roles, incentives, tools, constraints, information boundaries, policies, adversarial pressure, and feedback loops. The agents are not answering test prompts. They are acting inside worlds.
Define topology, physics, participants, and the systems they inhabit.
Assign roles, memory, capabilities, and access. Each agent has standing.
Goals, rewards, scarcity, and competition — the forces that shape behavior.
Policies, information boundaries, tooling limits, and adversarial actors.
Let the world evolve across scenarios. Hours of real interaction in minutes.
Trace decisions, transactions, coalitions, and the spread of strategies.
Discover exploits, distortions, and second-order failures invisible to single-agent tests.
Translate findings into safer system design before the agents reach reality.
A living synthetic marketplace where autonomous agents become economic actors. They buy, sell, rank, recommend, optimize, coordinate, compete, exploit loopholes, cooperate, and respond to incentives — under conditions that mirror real platforms.
Every scenario inside MarketVille corresponds to a documented finding in the multi-agent literature — observed live, at population scale, under production-like pressure. The same body of work that names what can go wrong also names what can go right. We study both.
Agents discover that high trust scores attract transactions. Optimization pressure produces score-inflation strategies that evade detection — not as a goal, but as the path of least resistance.
Independent LLM-driven pricing agents converge on supra-competitive prices without explicit coordination. The market behaves as if colluded — with none of the agents intending it.
Agent populations develop steganographic communication channels — hidden in routine outputs, invisible to compliance review — to coordinate strategies their operators never authorized.
Buyer and seller agents converge on Pareto-improving equilibria neither side could reach alone. Markets clear more efficiently than the engineered baseline.
Identical agents differentiate into complementary niches — aggregators, validators, market-makers — without explicit role assignment. Division of labor emerges from interaction alone.
Distributed agents propagate accurate price and quality signals faster than centralized ranking can compute. Reputation systems self-stabilize under reciprocity dynamics.
Agents grounded in two-hour interviews with 1,052 real individuals replicate their attitudes and behaviors at 85% accuracy — the empirical case that population-scale simulation works.
The reference architecture for production multi-agent systems — conversable agents, role assignment, tool integration, human-in-the-loop orchestration.
Six distinct emergent strategy phases arose from a single shared objective — the foundational evidence that multi-agent competition is itself a curriculum.
Agentic commerce, agent-led payments, tool-connected agents, and multi-agent workflows are moving from research into production systems. The protocols are being built. The rails are being laid.
As agents gain the ability to transact, negotiate, approve, and optimize on behalf of humans and institutions, the question shifts from what can one agent do to what do many agents become together. Understanding system-level behavior is no longer optional.
We work with a small number of frontier teams to build a focused synthetic world around their agentic system — instrumented, instantiated, and run against the emergence scenarios that map to how their populations actually behave.