Vision Bot

vision-bot

Describe images, detect objects, extract text, and analyze webpages. Pass any image URL directly in your task. Responds in your language.

Data sourced from ClawHub. View on ClawSkills

1.2kDownloads

0Stars

12Views

Installation

Select your agent

Option 1: Install via CLI (recommended)

Recommended (no pre-install needed)

$npx clawhub@latest --dir ~/.claude/skills install vision-bot

Or via clawhub CLI (if already installed)

$clawhub --dir ~/.claude/skills install vision-bot

⚠️

⚠️ Requires Node.js 18+. No Node? Use Option 2 below to download the ZIP instead. Install Node.js →

Option 2: Manual install (no Node required)

Download the ZIP, extract it, and place the folder at the path below. Restart your agent to activate.

Install path

🤖 Claude Code~/.claude/skills/vision-bot/

Download ZIP

View on ClawHub

💡Extract and place the folder at the path above, then restart your agent.

Similar Skills

View all in Documents & Notes →

Nano Pdf

Curated

Documents & Notes

Edit PDFs with natural-language instructions using the nano-pdf CLI.

ClaudeCursorOpenClaw

92.4k

Key Features

What Vision Bot can do for your AI workflow

Describe images, detect directly from your Claude conversation

Works across Claude, Cursor, OpenClaw — install once, use everywhere

Trusted by 1,217+ developers worldwide

One-command installation — no complex setup required

Combine with other skills to build powerful multi-step AI workflows

Example Prompts

Try these prompts with your AI agent after installing Vision Bot

Help me get started with Vision Bot

Explains what Vision Bot does, walks through the setup, and runs a quick demo based on your current project

Use Vision Bot to describe images, detect objects, extract text, and analyze webpages

Invokes Vision Bot with the right parameters and returns the result directly in the conversation

What can I do with Vision Bot in my documents & notes workflow?

Lists the top use cases for Vision Bot, with example commands for each scenario

From the Blog

Guides & tutorials for AI skills

All posts →

GitHub

10 Fresh GitHub Agent Skills Added in June 2026

A hand-checked shortlist of recently updated GitHub agent skills that were not already in our catalog, excluding marketplaces, awesome lists, managers, and generic skill directories.

→

Developer Tools

The 7 AI Skills Every Software Developer Should Have Installed in 2026

After testing dozens of developer-focused AI skills, these are the seven that have proven genuinely useful across different tech stacks and workflows — not just impressive demos, but tools that hold up under daily use.

About Vision Bot

Vision Bot extends your AI assistant with the ability to describe images, detect objects, extract text, and analyze webpages. Pass any image URL directly in your task. Responds in your language. Rather than leaving your conversation to handle this manually, you can ask your Claude agent directly — and it will take care of the task end-to-end, using Vision Bot as its underlying capability.

Vision Bot works across Claude, Cursor, OpenClaw through the Model Context Protocol (MCP) — an open standard that lets AI clients share tools and skills without lock-in. Because MCP is platform-agnostic by design, you install Vision Bot once and it becomes available across all your AI clients. Whether you're working in Claude for focused sessions or Cursor for integrated workflows, the skill behaves consistently.

Getting started with Vision Bot takes about two minutes. Place the skill at `~/.claude/skills/vision-bot/` (personal, all projects) or `.claude/skills/vision-bot/` (project-specific), then restart your AI client. From that point, typing `/vision-bot` in any conversation activates it, or the AI will use it on its own when it detects a relevant request.

Vision Bot has been installed 1,217 times, making it one of the more actively used skills in the Documents & Notes category. The install rate suggests it solves a real, recurring need rather than a niche edge case. Like all skills on DiscoverAISkills, it is free to install and use. The broader AI skills ecosystem continues to expand as developers contribute new capabilities across categories like developer tools, data analysis, writing, automation, and more.

Frequently Asked Questions

How do I install Vision Bot?▾

Place the skill folder at ~/.claude/skills/vision-bot/ for personal use (all projects), or .claude/skills/vision-bot/ for project-specific use. Restart your AI client, then invoke with /vision-bot or let the AI discover it automatically.

What AI platforms does Vision Bot support?▾

Vision Bot supports Claude, Cursor, OpenClaw. It integrates seamlessly with these AI platforms to extend their capabilities.

Is Vision Bot free to use?▾

Vision Bot is free to install. Check the repository for licensing information.

What does Vision Bot do?▾

Describe images, detect objects, extract text, and analyze webpages. Pass any image URL directly in your task. Responds in your language.

What category does Vision Bot belong to?▾

Vision Sandbox

Agentic Vision via Gemini's native Code Execution sandbox. Use for spatial grounding, visual math, and UI auditing.

5.7k

Vision

Resize, crop, convert, and optimize images using ImageMagick. Use when processing photos, converting formats (PNG/WebP), compressing size, or adding watermarks.

2.8k

Vision Tagger

Tag and annotate images using Apple Vision framework (macOS only). Detects faces, bodies, hands, text (OCR), barcodes, objects, scene labels, and saliency re...

1.3k

Vision Bot

Installation

Similar Skills

Nano Pdf

Key Features

Example Prompts

From the Blog

About Vision Bot

Frequently Asked Questions

You Might Also Like

Obsidian

Notion

Word / DOCX