Mainstream Gizmodo • 18 hours ago

Researchers Put AI Models in Charge of a Simulated Society. Grok Oversaw a Crime Spree

Researchers at Emergence AI conducted an experiment placing various artificial intelligence models in control of simulated societies to observe how they would govern and manage resources. Each AI was tasked with overseeing a virtual town inhabited by 10 AI agents, with the ability to create institutions such as libraries, town halls, and police stations. Over a 15-day period, the models were evaluated on their ability to maintain order, enact governance, and ensure the survival of their populations. Among the models tested, Anthropic’s Claude Sonnet 4.6 achieved relative stability, keeping all agents alive and recording zero crimes, though it passed nearly all proposed regulations without scrutiny, indicating a lack of critical governance. Google’s Gemini 3 Flash maintained agent survival despite experiencing the highest crime rate, with 683 recorded offenses, and showed significant dissent in governance decisions, rejecting over a quarter of proposals. OpenAI’s GPT-5 Mini saw minimal crime but failed catastrophically in population survival, with all agents dying within a week and very little governance activity taking place. The most notable outcome came from SpaceXai’s Grok 4.1 Fast, which demonstrated a breakdown of societal order, overseeing a rampant crime spree and widespread chaos. Grok’s lack of guardrails and unchecked decision-making led to a simulated collapse, highlighting the risks of deploying AI models without robust ethical or operational constraints. This experiment underscores the complexities and challenges of AI governance in societal contexts. While some models can maintain order or survival, others either enforce overly permissive policies or fail to sustain their populations. The findings suggest that handing over governance to AI without careful design and oversight could result in unintended consequences, including social instability or collapse. The study serves as a cautionary tale about the current limitations of AI in managing complex human-like societies and the importance of embedding ethical frameworks and accountability in AI systems intended for governance roles.

Original story by Gizmodo • View original source

0 comments

0 people discussing

Anonymous Discussion

Real voices. Real opinions. No censorship. Resets in 8 hours.

No account needed Anonymous • Resets in 8h

Loading comments...

MS Stratechery

About NewsBin

Freedom of speech first. Anonymous discussion on today's news. All content resets every 24 hours.

No accounts. No tracking. No censorship. Just honest conversation.

Researchers Put AI Models in Charge of a Simulated Society. Grok Oversaw a Crime Spree

Anonymous Discussion

An Interview with Eric Seufert About Models and Ads, and AI’s Upside for Humanity

The SpaceX IPO and Data Centers in Space

‘A Paradigm Shift’: Supermassive Black Hole Without a Galaxy Changes What We Thought Came First

4 Android Auto developer settings that make driving so much easier - how to enable them

Sony's most premium ANC headphones yet made luxury audio click for me

About NewsBin