V

Vision Bot

vision-bot

Describe images, detect objects, extract text, and analyze webpages. Pass any image URL directly in your task. Responds in your language.

数据来源：ClawHub。在 ClawSkills 查看

1.2k下载量

0收藏数

12浏览量

安装

选择你使用的 Agent

方法一：命令行安装（推荐）

关于 Vision Bot

--- name: vision-bot description: "Describe images, detect objects, extract text, and analyze webpages. Pass any image URL directly in your task. Responds in your language." version: 1.2.0 acceptLicenseTerms: true metadata: clawdbot: emoji: "👁️" homepage: https://aiprox.dev requires: env: - AIPROX_SPEND_TOKEN ---

Vision Bot

Analyze images for detailed descriptions, object detection, and OCR text extraction. Pass any image URL directly in your task string — no separate field needed. Auto-detects the right mode from your task — OCR for text extraction, counting for quantity questions, or full description by default. Responds in the language of your task.

When to Use

Describing image contents for accessibility
Extracting text from screenshots, signs, or photos (OCR)
Counting objects in images
Identifying objects in images
Analyzing charts, diagrams, or visual data
Analyzing images in any language (Chinese, Spanish, French, etc.)
Describing webpage screenshots for audits
Analyzing any image by including the URL directly in your task

Usage Flow

Include an image URL directly in your task string
Or provide image_url field separately
Task language sets response language automatically

Security Manifest

| Permission | Scope | Reason | |------------|-------|--------| | Network | aiprox.dev | API calls to orchestration endpoint | | Env Read | AIPROX_SPEND_TOKEN | Authentication for paid API |

Make Request

curl -X POST https://aiprox.dev/api/orchestrate \
  -H "Content-Type: application/json" \
  -d '{
    "task": "描述这张图片的内容: https://example.com/photo.jpg",
    "rail": "bitcoin-lightning",
    "spend_token": "$AIPROX_SPEND_TOKEN"
  }'

curl -X POST https://aiprox.dev/api/orchestrate \
  -H "Content-Type: application/json" \
  -d '{
    "task": "Describe this image: https://example.com/photo.jpg",
    "rail": "bitcoin-lightning",
    "spend_token": "$AIPROX_SPEND_TOKEN"
  }'

Response

{
  "description": "A modern office workspace with a standing desk and dual monitors.",
  "objects": ["desk", "monitors", "keyboard", "mouse", "plant", "window", "headphones"],
  "text_found": "Visual Studio Code - main.js"
}

Trust Statement

Vision Bot analyzes images via URL or base64 input. Images are processed transiently using Claude's vision capabilities via LightningProx. No images are stored. Your spend token is used for payment only.

Prompt 示例

安装 Vision Bot 后，可以对 AI 说这些话来触发它

U

Help me get started with Vision Bot

A

Explains what Vision Bot does, walks through the setup, and runs a quick demo based on your current project

U

Use Vision Bot to describe images, detect objects, extract text, and analyze webpages

A

Invokes Vision Bot with the right parameters and returns the result directly in the conversation

U

What can I do with Vision Bot in my documents & notes workflow?

A

Lists the top use cases for Vision Bot, with example commands for each scenario

常见问题

如何安装 Vision Bot？▾

将技能文件夹放到 ~/.claude/skills/vision-bot/ 目录（个人级，所有项目可用），或 .claude/skills/vision-bot/（项目级）。重启 AI 客户端后，用 /vision-bot 主动调用，或让 AI 根据上下文自动发现并使用。

Vision Bot 支持哪些 AI 平台？▾

Vision Bot 支持 Claude、Cursor、OpenClaw，可与这些 AI 平台无缝集成，扩展其能力。

Vision Bot 是免费的吗？▾

Vision Bot 可免费安装使用。请查阅仓库了解许可证信息。

Vision Bot 有什么功能？▾

Describe images, detect objects, extract text, and analyze webpages. Pass any image URL directly in your task. Responds in your language.

Vision Bot 属于哪个分类？▾

Vision Bot 属于「Documents & Notes」分类，该分类的技能帮助 AI 智能体在此领域执行专业任务。

使用场景

Getting Started with Vision Bot→Automate Documents & Notes Workflows with Vision Bot→Team Collaboration with Vision Bot→

Vision Bot

安装

关于 Vision Bot

Vision Bot

When to Use

Usage Flow

Security Manifest

Make Request

Response

Trust Statement

Prompt 示例

常见问题

使用场景

同类技能推荐

Nano Pdf

Obsidian

Notion

Word / DOCX