Headless browser automation CLI for AI agents. Use when interacting with websites — navigating pages, filling forms, clicking buttons, taking screenshots, ex...
数据来源:ClawHub。 在 ClawSkills 查看
选择你使用的 Agent
方法一:命令行安装(推荐)
推荐(无需提前安装 clawhub)
npx clawhub@latest --dir ~/.claude/skills install openclaw-agent-browser或使用 clawhub CLI(需提前安装)
clawhub --dir ~/.claude/skills install openclaw-agent-browser⚠️ 需要 Node.js 18+,没有 Node?请使用下方方法二直接下载 ZIP。 安装 Node.js →
方法二:手动下载安装(无需 Node)
下载 ZIP,解压后将文件夹放到以下路径,重启 Agent 即可:
安装路径
~/.claude/skills/openclaw-agent-browser/💡解压后将文件夹放到上方路径,重启 Agent 即可生效
--- name: agent-browser description: Headless browser automation CLI for AI agents. Use when interacting with websites — navigating pages, filling forms, clicking buttons, taking screenshots, extracting data, scraping, testing web apps, downloading files, or automating any browser task. Triggers on requests to "open a website", "fill out a form", "click a button", "take a screenshot", "scrape data", "test this web app", "login to a site", "monitor a page", or any task requiring programmatic web interaction. ---
Run scripts/setup.sh to install agent-browser and Chromium. Requires Node.js.
Every browser automation follows this pattern:
agent-browser open agent-browser snapshot -i (get element refs like @e1, @e2)agent-browser open https://example.com/form
agent-browser snapshot -i
# Output: @e1 [input type="email"], @e2 [input type="password"], @e3 [button] "Submit"
agent-browser fill @e1 "[email protected]"
agent-browser fill @e2 "password123"
agent-browser click @e3
agent-browser wait --load networkidle
agent-browser snapshot -i # Check result
Chain with && when you don't need intermediate output:
agent-browser open https://example.com && agent-browser wait --load networkidle && agent-browser snapshot -i
Run separately when you need to parse output first (e.g., snapshot to discover refs).
# Navigate
agent-browser open <url>
agent-browser close
# See the page (always do this first)
agent-browser snapshot -i # Interactive elements with refs
agent-browser snapshot -i -C # Include onclick divs
# Interact using @refs
agent-browser click @e1
agent-browser fill @e2 "text"
agent-browser select @e1 "option"
agent-browser press Enter
agent-browser scroll down 500
# Get info
agent-browser get text @e1
agent-browser get url
agent-browser get title
# Wait
agent-browser wait @e1 # For element
agent-browser wait --load networkidle # For network idle
# Capture
agent-browser screenshot page.png
agent-browser screenshot --full # Full page
agent-browser pdf output.pdf
For the full command reference, see references/commands.md.
Refs (@e1, @e2) are invalidated when the page changes. Always re-snapshot after:
agent-browser open https://example.com/signup
agent-browser snapshot -i
agent-browser fill @e1 "Jane Doe"
agent-browser fill @e2 "[email protected]"
agent-browser select @e3 "California"
agent-browser click @e5
agent-browser wait --load networkidle
agent-browser open https://app.example.com/login
agent-browser snapshot -i
agent-browser fill @e1 "$USERNAME" && agent-browser fill @e2 "$PASSWORD"
agent-browser click @e3
agent-browser wait --url "**/dashboard"
agent-browser state save auth.json
# Reuse later
agent-browser state load auth.json
agent-browser open https://app.example.com/dashboard
agent-browser open https://example.com/products
agent-browser snapshot -i
agent-browser get text @e5
agent-browser get text body > page.txt
agent-browser screenshot baseline.png
# ... changes happen ...
agent-browser diff screenshot --baseline baseline.png
agent-browser --session site1 open https://site-a.com
agent-browser --session site2 open https://site-b.com
agent-browser session list
export AGENT_BROWSER_CONTENT_BOUNDARIES=1 # Wrap output for AI safety
export AGENT_BROWSER_ALLOWED_DOMAINS="example.com" # Domain allowlist
export AGENT_BROWSER_MAX_OUTPUT=50000 # Prevent context flooding
Always close sessions when done: agent-browser close
安装 Agent Browser 后,可以对 AI 说这些话来触发它
Help me get started with Agent Browser
Explains what Agent Browser does, walks through the setup, and runs a quick demo based on your current project
Use Agent Browser to headless browser automation CLI for AI agents
Invokes Agent Browser with the right parameters and returns the result directly in the conversation
What can I do with Agent Browser in my ai agent & automation workflow?
Lists the top use cases for Agent Browser, with example commands for each scenario
将技能文件夹放到 ~/.claude/skills/openclaw-agent-browser/ 目录(个人级,所有项目可用),或 .claude/skills/openclaw-agent-browser/(项目级)。重启 AI 客户端后,用 /openclaw-agent-browser 主动调用,或让 AI 根据上下文自动发现并使用。
Agent Browser 支持 Claude、Cursor、OpenClaw,可与这些 AI 平台无缝集成,扩展其能力。
Agent Browser 可免费安装使用。请查阅仓库了解许可证信息。
Headless browser automation CLI for AI agents. Use when interacting with websites — navigating pages, filling forms, clicking buttons, taking screenshots, ex...
Agent Browser 属于「AI Agent & Automation」分类,该分类的技能帮助 AI 智能体在此领域执行专业任务。
Automate my ai agent & automation tasks using Agent Browser
Identifies repetitive steps in your workflow and sets up Agent Browser to handle them automatically