AI image generation with OpenAI, Google, DashScope and Replicate APIs. Supports text-to-image, reference images, aspect ratios. Sequential by default; parall...
数据来源:ClawHub。 在 ClawSkills 查看
选择你使用的 Agent
方法一:命令行安装(推荐)
推荐(无需提前安装 clawhub)
npx clawhub@latest --dir ~/.claude/skills install baoyu-image-gen或使用 clawhub CLI(需提前安装)
clawhub --dir ~/.claude/skills install baoyu-image-gen⚠️ 需要 Node.js 18+,没有 Node?请使用下方方法二直接下载 ZIP。 安装 Node.js →
方法二:手动下载安装(无需 Node)
下载 ZIP,解压后将文件夹放到以下路径,重启 Agent 即可:
安装路径
~/.claude/skills/baoyu-image-gen/💡解压后将文件夹放到上方路径,重启 Agent 即可生效
--- name: baoyu-image-gen description: AI image generation with OpenAI, Google, DashScope and Replicate APIs. Supports text-to-image, reference images, aspect ratios. Sequential by default; parallel generation available on request. Use when user asks to generate, create, or draw images. ---
Official API-based image generation. Supports OpenAI, Google, DashScope (阿里通义万象) and Replicate providers.
Agent Execution:
SKILL_DIR = this SKILL.md file's directory
${SKILL_DIR}/scripts/main.ts
CRITICAL: This step MUST complete BEFORE any image generation. Do NOT skip or defer.
Check EXTEND.md existence (priority: project → user):
test -f .baoyu-skills/baoyu-image-gen/EXTEND.md && echo "project"
test -f "$HOME/.baoyu-skills/baoyu-image-gen/EXTEND.md" && echo "user"
| Result | Action |
|--------|--------|
| Found | Load, parse, apply settings. If default_model.[provider] is null → ask model only (Flow 2) |
| Not found | ⛔ Run first-time setup (references/config/first-time-setup.md) → Save EXTEND.md → Then continue |
CRITICAL: If not found, complete the full setup (provider + model + quality + save location) using AskUserQuestion BEFORE generating any images. Generation is BLOCKED until EXTEND.md is created.
| Path | Location |
|------|----------|
| .baoyu-skills/baoyu-image-gen/EXTEND.md | Project directory |
| $HOME/.baoyu-skills/baoyu-image-gen/EXTEND.md | User home |
EXTEND.md Supports: Default provider | Default quality | Default aspect ratio | Default image size | Default models
Schema: references/config/preferences-schema.md
# Basic
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image cat.png
# With aspect ratio
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A landscape" --image out.png --ar 16:9
# High quality
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image out.png --quality 2k
# From prompt files
npx -y bun ${SKILL_DIR}/scripts/main.ts --promptfiles system.md content.md --image out.png
# With reference images (Google multimodal or OpenAI edits)
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "Make blue" --image out.png --ref source.png
# With reference images (explicit provider/model)
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "Make blue" --image out.png --provider google --model gemini-3-pro-image-preview --ref source.png
# Specific provider
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image out.png --provider openai
# DashScope (阿里通义万象)
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "一只可爱的猫" --image out.png --provider dashscope
# Replicate (google/nano-banana-pro)
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image out.png --provider replicate
# Replicate with specific model
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image out.png --provider replicate --model google/nano-banana
| Option | Description |
|--------|-------------|
| --prompt , -p | Prompt text |
| --promptfiles | Read prompt from files (concatenated) |
| --image | Output image path (required) |
| --provider google\|openai\|dashscope\|replicate | Force provider (default: google) |
| --model , -m | Model ID (Google: gemini-3-pro-image-preview, gemini-3.1-flash-image-preview; OpenAI: gpt-image-1.5) |
| --ar | Aspect ratio (e.g., 16:9, 1:1, 4:3) |
| --size | Size (e.g., 1024x1024) |
| --quality normal\|2k | Quality preset (default: 2k) |
| --imageSize 1K\|2K\|4K | Image size for Google (default: from quality) |
| --ref | Reference images. Supported by Google multimodal (gemini-3-pro-image-preview, gemini-3-flash-preview, gemini-3.1-flash-image-preview) and OpenAI edits (GPT Image models). If provider omitted: Google first, then OpenAI |
| --n | Number of images |
| --json | JSON output |
| Variable | Description |
|----------|-------------|
| OPENAI_API_KEY | OpenAI API key |
| GOOGLE_API_KEY | Google API key |
| DASHSCOPE_API_KEY | DashScope API key (阿里云) |
| REPLICATE_API_TOKEN | Replicate API token |
| OPENAI_IMAGE_MODEL | OpenAI model override |
| GOOGLE_IMAGE_MODEL | Google model override |
| DASHSCOPE_IMAGE_MODEL | DashScope model override (default: z-image-turbo) |
| REPLICATE_IMAGE_MODEL | Replicate model override (default: google/nano-banana-pro) |
| OPENAI_BASE_URL | Custom OpenAI endpoint |
| GOOGLE_BASE_URL | Custom Google endpoint |
| DASHSCOPE_BASE_URL | Custom DashScope endpoint |
| REPLICATE_BASE_URL | Custom Replicate endpoint |
Load Priority: CLI args > EXTEND.md > env vars > > ~/.baoyu-skills/.env
When using --provider replicate, the model can be configured in the following ways (highest priority first):
--model
default_model.replicate
REPLICATE_IMAGE_MODEL
google/nano-banana-pro
Supported model formats:
owner/name (recommended for official models), e.g. google/nano-banana-pro
owner/name:version (community models by version), e.g. stability-ai/sdxl:
Examples:
# Use Replicate default model
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image out.png --provider replicate
# Override model explicitly
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image out.png --provider replicate --model google/nano-banana
--ref provided + no --provider → auto-select Google first, then OpenAI, then Replicate
--provider specified → use it (if --ref, must be google, openai, or replicate)
| Preset | Google imageSize | OpenAI Size | Use Case |
|--------|------------------|-------------|----------|
| normal | 1K | 1024px | Quick previews |
| 2k (default) | 2K | 2048px | Covers, illustrations, infographics |
Google imageSize: Can be overridden with --imageSize 1K|2K|4K
Supported: 1:1, 16:9, 9:16, 4:3, 3:4, 2.35:1
imageConfig.aspectRatio
aspectRatio parameter
Default: Sequential generation (one image at a time). This ensures stable output and easier debugging.
Parallel Generation: Only use when user explicitly requests parallel/concurrent generation.
| Mode | When to Use | |------|-------------| | Sequential (default) | Normal usage, single images, small batches | | Parallel | User explicitly requests, large batches (10+) |
Parallel Settings (when requested):
| Setting | Value | |---------|-------| | Recommended concurrency | 4 subagents | | Max concurrency | 8 subagents | | Use case | Large batch generation when user requests parallel |
Agent Implementation (parallel mode only):
# Launch multiple generations in parallel using Task tool
# Each Task runs as background subagent with run_in_background=true
# Collect results via TaskOutput when all complete
gemini-3-pro-image-preview, gemini-3.1-flash-image-preview; or OpenAI GPT Image edits)
...
安装 Baoyu Image Gen 后,可以对 AI 说这些话来触发它
Help me get started with Baoyu Image Gen
Explains what Baoyu Image Gen does, walks through the setup, and runs a quick demo based on your current project
Use Baoyu Image Gen to aI image generation with OpenAI, Google, DashScope and Replicate APIs
Invokes Baoyu Image Gen with the right parameters and returns the result directly in the conversation
What can I do with Baoyu Image Gen in my design & creative workflow?
Lists the top use cases for Baoyu Image Gen, with example commands for each scenario
将技能文件夹放到 ~/.claude/skills/baoyu-image-gen/ 目录(个人级,所有项目可用),或 .claude/skills/baoyu-image-gen/(项目级)。重启 AI 客户端后,用 /baoyu-image-gen 主动调用,或让 AI 根据上下文自动发现并使用。
Baoyu Image Gen 支持 Claude、Cursor、OpenClaw,可与这些 AI 平台无缝集成,扩展其能力。
Baoyu Image Gen 可免费安装使用。请查阅仓库了解许可证信息。
AI image generation with OpenAI, Google, DashScope and Replicate APIs. Supports text-to-image, reference images, aspect ratios. Sequential by default; parall...
Baoyu Image Gen 属于「Design & Creative」分类,该分类的技能帮助 AI 智能体在此领域执行专业任务。
Automate my design & creative tasks using Baoyu Image Gen
Identifies repetitive steps in your workflow and sets up Baoyu Image Gen to handle them automatically