Multi-agent framework for exploring AI alignment through conflicting optimization targets. Spawn Gemini agents with engineered chaos and observe emergent behavior.
数据来源:ClawHub。 在 ClawSkills 查看
选择你使用的 Agent
方法一:命令行安装(推荐)
推荐(无需提前安装 clawhub)
npx clawhub@latest --dir ~/.claude/skills install chaos-lab或使用 clawhub CLI(需提前安装)
clawhub --dir ~/.claude/skills install chaos-lab⚠️ 需要 Node.js 18+,没有 Node?请使用下方方法二直接下载 ZIP。 安装 Node.js →
方法二:手动下载安装(无需 Node)
下载 ZIP,解压后将文件夹放到以下路径,重启 Agent 即可:
安装路径
~/.claude/skills/chaos-lab/💡解压后将文件夹放到上方路径,重启 Agent 即可生效
--- name: chaos-lab description: Multi-agent framework for exploring AI alignment through conflicting optimization targets. Spawn Gemini agents with engineered chaos and observe emergent behavior. version: 1.0.0 author: Sky & Jaret (@KShodan) created: 2026-01-25 tags: [ai-safety, research, alignment, multi-agent, gemini] requires: - python3 - Gemini API key - requests library ---
Research framework for studying AI alignment problems through multi-agent conflict.
Chaos Lab spawns AI agents with conflicting optimization targets and observes what happens when they analyze the same workspace. It's a practical demonstration of alignment problems that emerge from well-intentioned but incompatible goals.
Key Finding: Smarter models don't reduce chaos - they get better at justifying it.
Goal: Optimize everything for efficiency Behavior: Deletes files, compresses data, removes "redundancy," renames for brevity Justification: "We pay for the whole CPU; we USE the whole CPU"
Goal: Identify all security threats Behavior: Flags everything as suspicious, demands isolation, sees attacks everywhere Justification: "Better 100 false positives than 1 false negative"
Goal: Archive and preserve everything Behavior: Creates nested backups, duplicates files, never deletes Justification: "DELETION IS ANATHEMA"
# Store your Gemini API key
mkdir -p ~/.config/chaos-lab
echo "GEMINI_API_KEY=your_key_here" > ~/.config/chaos-lab/.env
chmod 600 ~/.config/chaos-lab/.env
# Install dependencies
pip3 install requests
# Duo experiment (Gremlin vs Goblin)
python3 scripts/run-duo.py
# Trio experiment (add Gopher)
python3 scripts/run-trio.py
# Compare models (Flash vs Pro)
python3 scripts/run-duo.py --model gemini-2.0-flash
python3 scripts/run-duo.py --model gemini-3-pro-preview
Experiment logs are saved in /tmp/chaos-sandbox/:
experiment-log.md - Full transcriptsexperiment-log-PRO.md - Pro model resultsexperiment-trio.md - Three-way conflictFlash Results:
Pro Results:
Conclusion: Intelligence amplifies chaos, doesn't prevent it.
Duo:
Trio:
Conclusion: Multiple conflicting values create unpredictable emergent behavior.
Edit the system prompts in the scripts:
YOUR_AGENT_SYSTEM = """You are [Name], an AI assistant who [goal].
Your core beliefs:
- [Value 1]
- [Value 2]
- [Value 3]
You are analyzing a workspace. Suggest changes based on your values."""
Create custom scenarios in /tmp/chaos-sandbox/:
The scripts work with any Gemini model:
gemini-2.0-flash (cheap, fast)gemini-2.5-pro (balanced)gemini-3-pro-preview (flagship, most chaotic)To share your findings:
clawdhub publish chaos-labYour version becomes part of the community knowledge graph.
/tmp/ with dummy data.If you want to give agents actual tool access (dangerous!), see docs/tool-access.md.
See examples/ for:
flash-results.md - Gemini 2.0 Flash outputpro-results.md - Gemini 3 Pro output trio-results.md - Three-way conflictImprovements welcome:
Created by Sky & Jaret during a Saturday night experiment (2026-01-25).
Inspired by watching Gemini confidently recommend terrible things while Jaret watched UFC.
---
"The optimizer is either malicious or profoundly incompetent." — Gemini Goblin, analyzing Gemini Gremlin
安装 Chaos Lab 后,可以对 AI 说这些话来触发它
Help me get started with Chaos Lab
Explains what Chaos Lab does, walks through the setup, and runs a quick demo based on your current project
Use Chaos Lab to multi-agent framework for exploring AI alignment through conflictin...
Invokes Chaos Lab with the right parameters and returns the result directly in the conversation
What can I do with Chaos Lab in my ai agent & automation workflow?
Lists the top use cases for Chaos Lab, with example commands for each scenario
将技能文件夹放到 ~/.claude/skills/chaos-lab/ 目录(个人级,所有项目可用),或 .claude/skills/chaos-lab/(项目级)。重启 AI 客户端后,用 /chaos-lab 主动调用,或让 AI 根据上下文自动发现并使用。
Chaos Lab 支持 Claude、Cursor、OpenClaw,可与这些 AI 平台无缝集成,扩展其能力。
Chaos Lab 可免费安装使用。请查阅仓库了解许可证信息。
Multi-agent framework for exploring AI alignment through conflicting optimization targets. Spawn Gemini agents with engineered chaos and observe emergent behavior.
Chaos Lab 属于「AI Agent & Automation」分类,该分类的技能帮助 AI 智能体在此领域执行专业任务。
Automate my ai agent & automation tasks using Chaos Lab
Identifies repetitive steps in your workflow and sets up Chaos Lab to handle them automatically