Agentic Vision via Gemini's native Code Execution sandbox. Use for spatial grounding, visual math, and UI auditing.
数据来源:ClawHub。 在 ClawSkills 查看
选择你使用的 Agent
方法一:命令行安装(推荐)
推荐(无需提前安装 clawhub)
npx clawhub@latest --dir ~/.claude/skills install vision-sandbox或使用 clawhub CLI(需提前安装)
clawhub --dir ~/.claude/skills install vision-sandbox⚠️ 需要 Node.js 18+,没有 Node?请使用下方方法二直接下载 ZIP。 安装 Node.js →
方法二:手动下载安装(无需 Node)
下载 ZIP,解压后将文件夹放到以下路径,重启 Agent 即可:
安装路径
~/.claude/skills/vision-sandbox/💡解压后将文件夹放到上方路径,重启 Agent 即可生效
--- name: Vision Sandbox slug: vision-sandbox version: 1.1.0 description: Agentic Vision via Gemini's native Code Execution sandbox. Use for spatial grounding, visual math, and UI auditing. metadata: openclaw: emoji: "🔭" primaryEnv: "GEMINI_API_KEY" requires: bins: ["uv"] env: ["GEMINI_API_KEY"] ---
Leverage Gemini's native code execution to analyze images with high precision. The model writes and runs Python code in a Google-hosted sandbox to verify visual data, perfect for UI auditing, spatial grounding, and visual reasoning.
clawhub install vision-sandbox
uv run vision-sandbox --image "path/to/image.png" --prompt "Identify all buttons and provide [x, y] coordinates."
Ask the model to find specific items and return coordinates.
Ask the model to count or calculate based on the image.
Check layout and readability.
Solve visual counting tasks with code verification.
This skill is designed to provide Visual Grounding for automated coding agents like OpenCode.
vision-sandbox to extract UI metadata (coordinates, sizes, colors).gemini-3-flash-preview.安装 Vision Sandbox 后,可以对 AI 说这些话来触发它
Help me get started with Vision Sandbox
Explains what Vision Sandbox does, walks through the setup, and runs a quick demo based on your current project
Use Vision Sandbox to agentic Vision via Gemini's native Code Execution sandbox
Invokes Vision Sandbox with the right parameters and returns the result directly in the conversation
What can I do with Vision Sandbox in my developer & devops workflow?
Lists the top use cases for Vision Sandbox, with example commands for each scenario
将技能文件夹放到 ~/.claude/skills/vision-sandbox/ 目录(个人级,所有项目可用),或 .claude/skills/vision-sandbox/(项目级)。重启 AI 客户端后,用 /vision-sandbox 主动调用,或让 AI 根据上下文自动发现并使用。
Vision Sandbox 支持 Claude、Cursor、OpenClaw,可与这些 AI 平台无缝集成,扩展其能力。
Vision Sandbox 可免费安装使用。请查阅仓库了解许可证信息。
Agentic Vision via Gemini's native Code Execution sandbox. Use for spatial grounding, visual math, and UI auditing.
Vision Sandbox 属于「Developer & DevOps」分类,该分类的技能帮助 AI 智能体在此领域执行专业任务。
Automate my developer & devops tasks using Vision Sandbox
Identifies repetitive steps in your workflow and sets up Vision Sandbox to handle them automatically