Describe images, detect objects, extract text, and analyze webpages. Pass any image URL directly in your task. Responds in your language.
数据来源:ClawHub。 在 ClawSkills 查看
选择你使用的 Agent
方法一:命令行安装(推荐)
推荐(无需提前安装 clawhub)
npx clawhub@latest --dir ~/.claude/skills install vision-bot或使用 clawhub CLI(需提前安装)
clawhub --dir ~/.claude/skills install vision-bot⚠️ 需要 Node.js 18+,没有 Node?请使用下方方法二直接下载 ZIP。 安装 Node.js →
方法二:手动下载安装(无需 Node)
下载 ZIP,解压后将文件夹放到以下路径,重启 Agent 即可:
安装路径
~/.claude/skills/vision-bot/💡解压后将文件夹放到上方路径,重启 Agent 即可生效
--- name: vision-bot description: "Describe images, detect objects, extract text, and analyze webpages. Pass any image URL directly in your task. Responds in your language." version: 1.2.0 acceptLicenseTerms: true metadata: clawdbot: emoji: "👁️" homepage: https://aiprox.dev requires: env: - AIPROX_SPEND_TOKEN ---
Analyze images for detailed descriptions, object detection, and OCR text extraction. Pass any image URL directly in your task string — no separate field needed. Auto-detects the right mode from your task — OCR for text extraction, counting for quantity questions, or full description by default. Responds in the language of your task.
image_url field separately| Permission | Scope | Reason | |------------|-------|--------| | Network | aiprox.dev | API calls to orchestration endpoint | | Env Read | AIPROX_SPEND_TOKEN | Authentication for paid API |
curl -X POST https://aiprox.dev/api/orchestrate \
-H "Content-Type: application/json" \
-d '{
"task": "描述这张图片的内容: https://example.com/photo.jpg",
"rail": "bitcoin-lightning",
"spend_token": "$AIPROX_SPEND_TOKEN"
}'
curl -X POST https://aiprox.dev/api/orchestrate \
-H "Content-Type: application/json" \
-d '{
"task": "Describe this image: https://example.com/photo.jpg",
"rail": "bitcoin-lightning",
"spend_token": "$AIPROX_SPEND_TOKEN"
}'
{
"description": "A modern office workspace with a standing desk and dual monitors.",
"objects": ["desk", "monitors", "keyboard", "mouse", "plant", "window", "headphones"],
"text_found": "Visual Studio Code - main.js"
}
Vision Bot analyzes images via URL or base64 input. Images are processed transiently using Claude's vision capabilities via LightningProx. No images are stored. Your spend token is used for payment only.
安装 Vision Bot 后,可以对 AI 说这些话来触发它
Help me get started with Vision Bot
Explains what Vision Bot does, walks through the setup, and runs a quick demo based on your current project
Use Vision Bot to describe images, detect objects, extract text, and analyze webpages
Invokes Vision Bot with the right parameters and returns the result directly in the conversation
What can I do with Vision Bot in my documents & notes workflow?
Lists the top use cases for Vision Bot, with example commands for each scenario
将技能文件夹放到 ~/.claude/skills/vision-bot/ 目录(个人级,所有项目可用),或 .claude/skills/vision-bot/(项目级)。重启 AI 客户端后,用 /vision-bot 主动调用,或让 AI 根据上下文自动发现并使用。
Vision Bot 支持 Claude、Cursor、OpenClaw,可与这些 AI 平台无缝集成,扩展其能力。
Vision Bot 可免费安装使用。请查阅仓库了解许可证信息。
Describe images, detect objects, extract text, and analyze webpages. Pass any image URL directly in your task. Responds in your language.
Vision Bot 属于「Documents & Notes」分类,该分类的技能帮助 AI 智能体在此领域执行专业任务。
Automate my documents & notes tasks using Vision Bot
Identifies repetitive steps in your workflow and sets up Vision Bot to handle them automatically