Automate browser actions locally via browser-use CLI/Python: open pages, click/type, screenshot, extract HTML/links, debug sessions, and capture login QR codes.
数据来源:ClawHub。 在 ClawSkills 查看
选择你使用的 Agent
方法一:命令行安装(推荐)
推荐(无需提前安装 clawhub)
npx clawhub@latest --dir ~/.claude/skills install browser-use-local或使用 clawhub CLI(需提前安装)
clawhub --dir ~/.claude/skills install browser-use-local⚠️ 需要 Node.js 18+,没有 Node?请使用下方方法二直接下载 ZIP。 安装 Node.js →
方法二:手动下载安装(无需 Node)
下载 ZIP,解压后将文件夹放到以下路径,重启 Agent 即可:
安装路径
~/.claude/skills/browser-use-local/💡解压后将文件夹放到上方路径,重启 Agent 即可生效
--- name: browser-use-local description: Use when you need browser automation via the browser-use CLI or Python code in this OpenClaw container/host: open pages, click/type, take screenshots, extract HTML/links, or run an Agent with an OpenAI-compatible LLM (e.g. Moonshot/Kimi) using a custom base_url. Also use for debugging browser-use sessions (state empty, page readiness timeouts), and for extracting login QR codes from demo/login pages via screenshots or HTML data:image. ---
browser tool here; OpenClaw browser may fail if no supported system browser is present.--session .1) Open
browser-use --session demo open https://example.com
2) Inspect (sometimes state returns 0 elements on heavy/JS sites)
browser-use --session demo --json state | jq '.data | {url,title,elements:(.elements|length)}'
3) Screenshot (always works; best debugging primitive)
browser-use --session demo screenshot /home/node/.openclaw/workspace/page.png
4) HTML for link discovery (works even when state is empty)
browser-use --session demo --json get html > /tmp/page_html.json
python3 - <<'PY'
import json,re
html=json.load(open('/tmp/page_html.json')).get('data',{}).get('html','')
urls=set(re.findall(r"https?://[^\s\"'<>]+", html))
for u in sorted([u for u in urls if any(k in u for k in ['demo','login','console','qr','qrcode'])])[:200]:
print(u)
PY
5) Lightweight DOM queries via JS (useful when state is empty)
browser-use --session demo --json eval "location.href"
browser-use --session demo --json eval "document.title"
Use Python for Agent runs when the CLI run path requires Browser-Use cloud keys or when you need strict control over LLM parameters.
Create .env (or export env vars) with:
OPENAI_API_KEY=...OPENAI_BASE_URL=https://api.moonshot.cn/v1Then run the bundled script:
source /home/node/.openclaw/workspace/.venv-browser-use/bin/activate
python /home/node/.openclaw/workspace/skills/browser-use-local/scripts/run_agent_kimi.py
Kimi/Moonshot quirks observed in practice (fixes):
temperature must be 1 for kimi-k2.5.frequency_penalty must be 0 for kimi-k2.5. - remove_defaults_from_schema=True - remove_min_items_from_schema=True
If you get a 400 error mentioning response_format.json_schema ... keyword 'default' is not allowed or min_items unsupported, those two flags are the first thing to set.
1) Screenshot the page and crop candidate regions (fast, robust). 2) If HTML contains data:image/png;base64,..., extract and decode it.
Use scripts/crop_candidates.py to generate multiple likely QR crops from a screenshot.
source /home/node/.openclaw/workspace/.venv-browser-use/bin/activate
python skills/browser-use-local/scripts/crop_candidates.py \
--in /home/node/.openclaw/workspace/login.png \
--outdir /home/node/.openclaw/workspace/qr_crops
source /home/node/.openclaw/workspace/.venv-browser-use/bin/activate
browser-use --session demo --json get html > /tmp/page_html.json
python skills/browser-use-local/scripts/extract_data_images.py \
--in /tmp/page_html.json \
--outdir /home/node/.openclaw/workspace/data_imgs
state shows elements: 0: use get html + regex discovery, plus screenshots; use eval to query DOM. - ✅ browser-use --browser chromium --json open https://... - ❌ browser-use open https://... --browser chromium
安装 Browser Use Local 后,可以对 AI 说这些话来触发它
Help me get started with Browser Use Local
Explains what Browser Use Local does, walks through the setup, and runs a quick demo based on your current project
Use Browser Use Local to automate browser actions locally via browser-use CLI/Python: open p...
Invokes Browser Use Local with the right parameters and returns the result directly in the conversation
What can I do with Browser Use Local in my developer & devops workflow?
Lists the top use cases for Browser Use Local, with example commands for each scenario
将技能文件夹放到 ~/.claude/skills/browser-use-local/ 目录(个人级,所有项目可用),或 .claude/skills/browser-use-local/(项目级)。重启 AI 客户端后,用 /browser-use-local 主动调用,或让 AI 根据上下文自动发现并使用。
Browser Use Local 支持 Claude、Cursor、OpenClaw,可与这些 AI 平台无缝集成,扩展其能力。
Browser Use Local 可免费安装使用。请查阅仓库了解许可证信息。
Automate browser actions locally via browser-use CLI/Python: open pages, click/type, screenshot, extract HTML/links, debug sessions, and capture login QR codes.
Browser Use Local 属于「Developer & DevOps」分类,该分类的技能帮助 AI 智能体在此领域执行专业任务。
Automate my developer & devops tasks using Browser Use Local
Identifies repetitive steps in your workflow and sets up Browser Use Local to handle them automatically