Analyze and summarize videos from 1000+ sites using Google Gemini AI, providing transcripts, descriptions, summaries, and answers to questions.
数据来源:ClawHub。 在 ClawSkills 查看
选择你使用的 Agent
方法一:命令行安装(推荐)
推荐(无需提前安装 clawhub)
npx clawhub@latest --dir ~/.claude/skills install video-understanding或使用 clawhub CLI(需提前安装)
clawhub --dir ~/.claude/skills install video-understanding⚠️ 需要 Node.js 18+,没有 Node?请使用下方方法二直接下载 ZIP。 安装 Node.js →
方法二:手动下载安装(无需 Node)
下载 ZIP,解压后将文件夹放到以下路径,重启 Agent 即可:
安装路径
~/.claude/skills/video-understanding/💡解压后将文件夹放到上方路径,重启 Agent 即可生效
--- name: video-understanding description: > Analyze videos with Google Gemini multimodal AI. Download from any URL (Loom, YouTube, TikTok, Vimeo, Twitter/X, Instagram, 1000+ sites) and get transcripts, descriptions, and answers to questions. Use when asked to watch, analyze, summarize, or transcribe a video, or answer questions about video content. Triggers on video URLs or requests involving video understanding. compatibility: "Requires yt-dlp, ffmpeg, and GEMINI_API_KEY environment variable. Python 3.10+ with uv." metadata: openclaw: emoji: "🎬" requires: bins: ["yt-dlp", "ffmpeg"] env: ["GEMINI_API_KEY"] primaryEnv: "GEMINI_API_KEY" install: - id: "yt-dlp-brew" kind: "brew" formula: "yt-dlp" bins: ["yt-dlp"] label: "Install yt-dlp (brew)" - id: "ffmpeg-brew" kind: "brew" formula: "ffmpeg" bins: ["ffmpeg"] label: "Install ffmpeg (brew)" ---
Analyze videos using Google Gemini's multimodal video understanding. Supports 1000+ video sources via yt-dlp.
yt-dlp — brew install yt-dlp / pip install yt-dlpffmpeg — brew install ffmpeg (for merging video+audio streams)GEMINI_API_KEY environment variableReturns structured JSON:
[MM:SS] timestampsuv run {baseDir}/scripts/analyze_video.py "<video-url>"
uv run {baseDir}/scripts/analyze_video.py "<video-url>" -q "What product is shown?"
uv run {baseDir}/scripts/analyze_video.py "<video-url>" -p "Custom prompt" --raw
uv run {baseDir}/scripts/analyze_video.py "<video-url>" --download-only -o video.mp4
| Flag | Description | Default | |------|-------------|---------| | -q / --question | Question to answer (added to default fields) | none | | -p / --prompt | Override entire prompt (ignores -q) | structured JSON | | -m / --model | Gemini model | gemini-2.5-flash | | -o / --output | Save output to file | stdout | | --keep | Keep downloaded video file | false | | --download-only | Download only, skip analysis | false | | --max-size | Max file size in MB | 500 | | --raw | Raw text output instead of JSON | false |
Any URL supported by yt-dlp: Loom, YouTube, TikTok, Vimeo, Twitter/X, Instagram, Dailymotion, Twitch, and 1000+ more.
-q for targeted questions on top of the full analysisuv安装 Video Understanding 后,可以对 AI 说这些话来触发它
Help me get started with Video Understanding
Explains what Video Understanding does, walks through the setup, and runs a quick demo based on your current project
Use Video Understanding to analyze and summarize videos from 1000+ sites using Google Gemini A...
Invokes Video Understanding with the right parameters and returns the result directly in the conversation
What can I do with Video Understanding in my design & creative workflow?
Lists the top use cases for Video Understanding, with example commands for each scenario
将技能文件夹放到 ~/.claude/skills/video-understanding/ 目录(个人级,所有项目可用),或 .claude/skills/video-understanding/(项目级)。重启 AI 客户端后,用 /video-understanding 主动调用,或让 AI 根据上下文自动发现并使用。
Video Understanding 支持 Claude、Cursor、OpenClaw,可与这些 AI 平台无缝集成,扩展其能力。
Video Understanding 可免费安装使用。请查阅仓库了解许可证信息。
Analyze and summarize videos from 1000+ sites using Google Gemini AI, providing transcripts, descriptions, summaries, and answers to questions.
Video Understanding 属于「Design & Creative」分类,该分类的技能帮助 AI 智能体在此领域执行专业任务。
Automate my design & creative tasks using Video Understanding
Identifies repetitive steps in your workflow and sets up Video Understanding to handle them automatically