Generate photorealistic images, videos, talking heads, and natural TTS audio using GPU-accelerated AI models and scripts on a remote server.
数据来源:ClawHub。 在 ClawSkills 查看
选择你使用的 Agent
方法一:命令行安装(推荐)
推荐(无需提前安装 clawhub)
npx clawhub@latest --dir ~/.claude/skills install ai-media或使用 clawhub CLI(需提前安装)
clawhub --dir ~/.claude/skills install ai-media⚠️ 需要 Node.js 18+,没有 Node?请使用下方方法二直接下载 ZIP。 安装 Node.js →
方法二:手动下载安装(无需 Node)
下载 ZIP,解压后将文件夹放到以下路径,重启 Agent 即可:
安装路径
~/.claude/skills/ai-media/💡解压后将文件夹放到上方路径,重启 Agent 即可生效
Full-stack AI media generation powered by GPU server (RTX 3090/3080/2070S).
${GPU_USER}@${GPU_HOST}~/.ssh/id_ed25519_gpu/data/ai-stack/comfyui/ComfyUI/ (port 8188)/data/ai-stack/sadtalker//data/ai-stack/whisper//data/ai-stack/output/./scripts/image.sh "lady on beach at sunset" realistic
./scripts/image.sh "cyberpunk cityscape" artistic
Arguments:
$1: Prompt text$2: Style (realistic|artistic) — optional, default: realisticOutput: Path to generated image (e.g., /data/ai-stack/output/image_001.png)
./scripts/video.sh "waves crashing on shore" animatediff 4
./scripts/video.sh "city traffic timelapse" ltx2 8
Arguments:
$1: Prompt text$2: Model (animatediff|ltx2) — optional, default: animatediff$3: Duration in seconds — optional, default: 4Output: Path to generated video (e.g., /data/ai-stack/output/video_001.mp4)
./scripts/talking-head.sh "Hello, I'm Agent" gentle input.jpg
./scripts/talking-head.sh "Welcome to the future" neutral photo.png
Arguments:
$1: Speech text$2: Voice style (gentle|neutral|energetic) — optional, default: gentle$3: Avatar image path — optional, generates default if not providedOutput: Path to talking head video (e.g., /data/ai-stack/output/talking_001.mp4)
./scripts/audio.sh "This is a test message" en male
./scripts/audio.sh "Bonjour le monde" fr female
Arguments:
$1: Text to speak$2: Language code (en|fr|es|etc) — optional, default: en$3: Voice gender (male|female) — optional, default: maleOutput: Path to audio file (e.g., /data/ai-stack/output/audio_001.wav)
All dependencies are pre-installed on GPU server:
Scripts will:
---
Status: Active development Maintainer: Agent GPU Server: ${GPU_USER}@${GPU_HOST}
安装 Ai Media 后,可以对 AI 说这些话来触发它
Help me get started with Ai Media
Explains what Ai Media does, walks through the setup, and runs a quick demo based on your current project
Use Ai Media to generate photorealistic images, videos, talking heads, and natural ...
Invokes Ai Media with the right parameters and returns the result directly in the conversation
What can I do with Ai Media in my design & creative workflow?
Lists the top use cases for Ai Media, with example commands for each scenario
将技能文件夹放到 ~/.claude/skills/ai-media/ 目录(个人级,所有项目可用),或 .claude/skills/ai-media/(项目级)。重启 AI 客户端后,用 /ai-media 主动调用,或让 AI 根据上下文自动发现并使用。
Ai Media 支持 Claude、Cursor、OpenClaw,可与这些 AI 平台无缝集成,扩展其能力。
Ai Media 可免费安装使用。请查阅仓库了解许可证信息。
Generate photorealistic images, videos, talking heads, and natural TTS audio using GPU-accelerated AI models and scripts on a remote server.
Ai Media 属于「Design & Creative」分类,该分类的技能帮助 AI 智能体在此领域执行专业任务。
Automate my design & creative tasks using Ai Media
Identifies repetitive steps in your workflow and sets up Ai Media to handle them automatically