I

Image Gen

image-gen

Generate images using multiple AI models — Midjourney (via Legnext.ai), Flux, Nano Banana Pro (Gemini), Ideogram, Recraft, and more via fal.ai. Intelligently...

数据来源：ClawHub。在 ClawSkills 查看

3.4k下载量

4收藏数

45浏览量

安装

选择你使用的 Agent

方法一：命令行安装（推荐）

关于 Image Gen

--- name: image-gen description: Generate images using multiple AI models — Midjourney (via Legnext.ai), Flux, Nano Banana Pro (Gemini), Ideogram, Recraft, and more via fal.ai. Intelligently routes to the best model based on use case. homepage: https://legnext.ai metadata: {"openclaw":{"emoji":"🎨","primaryEnv":"FAL_KEY","requires":{"env":["FAL_KEY","LEGNEXT_KEY"]},"install":[{"id":"node","kind":"node","package":"@fal-ai/client","label":"Install fal.ai client (npm)"}]}} ---

Image Generation Skill

This skill generates images using the best AI model for each use case. Model selection is the most important decision — read the dispatch logic carefully before generating.

---

🧠 Intelligent Dispatch Logic

Always select the model based on the user's actual need, not just the request surface.

Decision Tree

Does the request involve MULTIPLE images that share characters, scenes, or story continuity?
  ├─ YES → Use NANO BANANA (Gemini)
  │         Reason: Gemini understands context holistically; supports reference_images
  │         for character/scene consistency across a series (storyboard, comic, sequence)
  │
  └─ NO → Is it a SINGLE standalone image?
            ├─ Artistic / cinematic / painterly / highly detailed?
            │   → Use MIDJOURNEY
            │
            ├─ Photorealistic / portrait / product photo?
            │   → Use FLUX PRO
            │
            ├─ Contains TEXT (logo, poster, sign, infographic)?
            │   → Use IDEOGRAM
            │
            ├─ Vector / icon / flat design / brand asset?
            │   → Use RECRAFT
            │
            ├─ Quick draft / fast iteration (speed priority)?
            │   → Use FLUX SCHNELL (<2s)
            │
            └─ General purpose / balanced?
                → Use FLUX DEV

Model Capability Matrix

| Model | ID | Artistic | Photorealism | Text | Context Continuity | Speed | Cost | |---|---|---|---|---|---|---|---| | Midjourney | midjourney | ⭐⭐⭐⭐⭐ | ⭐⭐⭐⭐ | ⭐⭐ | ❌ (no context) | ~30s | ~$0.05 | | Nano Banana Pro | nano-banana | ⭐⭐⭐⭐ | ⭐⭐⭐⭐ | ⭐⭐⭐⭐ | ⭐⭐⭐⭐⭐ | ~20s | $0.15 | | Flux Pro | flux-pro | ⭐⭐⭐ | ⭐⭐⭐⭐⭐ | ⭐⭐⭐ | ❌ | ~5s | ~$0.05 | | Flux Dev | flux-dev | ⭐⭐⭐ | ⭐⭐⭐⭐ | ⭐⭐ | ❌ | ~8s | ~$0.03 | | Flux Schnell | flux-schnell | ⭐⭐ | ⭐⭐⭐ | ⭐⭐ | ❌ | <2s | ~$0.003 | | Ideogram v3 | ideogram | ⭐⭐⭐ | ⭐⭐⭐ | ⭐⭐⭐⭐⭐ | ❌ | ~10s | ~$0.08 | | Recraft v3 | recraft | ⭐⭐⭐⭐ | ⭐⭐ | ⭐⭐⭐⭐ | ❌ | ~8s | ~$0.04 | | SDXL Lightning | sdxl | ⭐⭐⭐ | ⭐⭐⭐ | ⭐⭐ | ❌ | ~3s | ~$0.01 |

When to Use Nano Banana (Critical)

Use Nano Banana whenever the user's request involves:

Storyboard / 分镜图: Multiple frames that tell a story with the same characters
Comic strip / 漫画: Sequential panels with consistent characters
Character series: Multiple images of the same person/character in different poses or scenes
Scene continuation: "Now show the same girl in the forest" (referencing a previous image)
Style consistency: A set of images that must share the same visual style/world

Nano Banana uses Google's Gemini 3 Pro multimodal architecture, which understands context holistically rather than keyword-matching. It supports up to 14 reference images for maintaining character and scene consistency.

---

How to Use This Skill

Analyze the request: Is it a single image or a series? Does it need context continuity?
Select model: Use the decision tree above.
Enhance the prompt: Add style, lighting, and quality descriptors appropriate for the model.
Inform the user: Tell them which model you're using and why, and that generation has started.
Run the script: Use exec tool with sufficient timeout.
Deliver the result: Send image URL(s) to the user.

---

Calling the Generation Script

node {baseDir}/generate.js \
  --model <model_id> \
  --prompt "<enhanced prompt>" \
  [--aspect-ratio <ratio>] \
  [--num-images <1-4>] \
  [--negative-prompt "<negative prompt>"] \
  [--reference-images "<url1,url2,...>"]

Parameters:

--model: One of midjourney, flux-pro, flux-dev, flux-schnell, sdxl, nano-banana, ideogram, recraft
--prompt: The image generation prompt (required)
--aspect-ratio: e.g. 16:9, 1:1, 9:16, 4:3, 3:4 (default: 1:1)
--num-images: 1-4 (default: 1; Midjourney always returns 4 regardless)
--negative-prompt: Things to avoid (not supported by Midjourney)
--reference-images: Comma-separated image URLs for context/character consistency (Nano Banana only)
--mode: Midjourney speed: turbo (default, ~20-40s), fast (~30-60s), relax (free but slow)

exec timeout: Set at least 120 seconds for Midjourney and Nano Banana; 30 seconds is sufficient for Flux Schnell.

---

⚡ Midjourney Workflow (Sync Mode — No --async)

Always use sync mode (no --async). The script waits internally until complete.

node {baseDir}/generate.js \
  --model midjourney \
  --prompt "<enhanced prompt>" \
  --aspect-ratio 16:9

Understanding Midjourney Output

{
  "success": true,
  "model": "midjourney",
  "jobId": "xxxxxxxx-...",
  "imageUrl": "https://cdn.legnext.ai/temp/....png",
  "imageUrls": [
    "https://cdn.legnext.ai/mj/xxxx_0.png",
    "https://cdn.legnext.ai/mj/xxxx_1.png",
    "https://cdn.legnext.ai/mj/xxxx_2.png",
    "https://cdn.legnext.ai/mj/xxxx_3.png"
  ]
}

CRITICAL — image field meanings:

| Field | What it is | When to use | |---|---|---| | imageUrl | A 2×2 grid composite of all 4 images | Send as preview so user can see all options | | imageUrls[0] | Image 1 (top-left) | Send when user wants image 1 | | imageUrls[1] | Image 2 (top-right) | Send when user wants image 2 | | imageUrls[2] | Image 3 (bottom-left) | Send when user wants image 3 | | imageUrls[3] | Image 4 (bottom-right) | Send when user wants image 4 |

"放大第N张" / "要第N张" / "give me image N" = send imageUrls[N-1] directly. Do NOT call generate.js again.

Midjourney Interaction Flow

After generation: > 🎨 生成完成！这是 4 张图的预览： > 预览图 > 你喜欢哪一张？回复 1、2、3 或 4，我直接发给你高清单图。

When user picks image N: > 这是第 N 张的单独高清图： > 图片 N

---

🤖 Nano Banana (Gemini) Workflow

Use for storyboards, character series, and any context-dependent multi-image generation.

Single image (no reference)

node {baseDir}/generate.js \
  --model nano-banana \
  --prompt "<detailed scene description>" \
  --aspect-ratio 16:9

With reference images (character/scene consistency)

node {baseDir}/generate.js \
  --model nano-banana \
  --prompt "<scene description, referencing the character/style from the reference images>" \
  --aspect-ratio 16:9 \
  --reference-images "https://url-of-previous-image-1.png,https://url-of-previous-image-2.png"

How to build a storyboard series:

Generate the first frame without reference images (establishes the character/scene)
Use the first frame's URL as --reference-images for the second frame
For subsequent frames, use the most recent 1-3 images as references to maintain consistency
Keep the character description consistent across all prompts

Example storyboard workflow:

Frame 1: node generate.js --model nano-banana --prompt "A young girl with red hair, wearing a blue dress, sitting under a magical treehouse in an enchanted forest, warm golden light, storybook illustration style" --aspect-ratio 16:9

Frame 2: node generate.js --model nano-banana --prompt "The same red-haired girl in blue dress climbing the rope ladder up to the treehouse, excited expression, enchanted forest background, same storybook illustration style" --aspect-ratio 16:9 --reference-images "<frame1_url>"

...

Prompt 示例

安装 Image Gen 后，可以对 AI 说这些话来触发它

U

Help me get started with Image Gen

A

Explains what Image Gen does, walks through the setup, and runs a quick demo based on your current project

U

Use Image Gen to generate images using multiple AI models — Midjourney (via Legnext

A

Invokes Image Gen with the right parameters and returns the result directly in the conversation

U

What can I do with Image Gen in my design & creative workflow?

A

Lists the top use cases for Image Gen, with example commands for each scenario

常见问题

如何安装 Image Gen？▾

将技能文件夹放到 ~/.claude/skills/image-gen/ 目录（个人级，所有项目可用），或 .claude/skills/image-gen/（项目级）。重启 AI 客户端后，用 /image-gen 主动调用，或让 AI 根据上下文自动发现并使用。

Image Gen 支持哪些 AI 平台？▾

Image Gen 支持 Claude、Cursor、OpenClaw，可与这些 AI 平台无缝集成，扩展其能力。

Image Gen 是免费的吗？▾

Image Gen 可免费安装使用。请查阅仓库了解许可证信息。

Image Gen 有什么功能？▾

Generate images using multiple AI models — Midjourney (via Legnext.ai), Flux, Nano Banana Pro (Gemini), Ideogram, Recraft, and more via fal.ai. Intelligently...

Image Gen 属于哪个分类？▾

Image Gen 属于「Design & Creative」分类，该分类的技能帮助 AI 智能体在此领域执行专业任务。

使用场景

Getting Started with Image Gen→Automate Design & Creative Workflows with Image Gen→Team Collaboration with Image Gen→