按需生成有声读物、播客或教育音频内容。用户提供想法或主题,Claude AI 编写脚本,ElevenLabs 将其转换为高质量音频。支持多种格式(有声读物、播客、教育)、自定义长度和语音效果。当要求创建时使用
数据来源:ClawHub。 在 ClawSkills 查看
选择你使用的 Agent
方法一:命令行安装(推荐)
推荐(无需提前安装 clawhub)
npx clawhub@latest --dir ~/.claude/skills install audio-gen或使用 clawhub CLI(需提前安装)
clawhub --dir ~/.claude/skills install audio-gen⚠️ 需要 Node.js 18+,没有 Node?请使用下方方法二直接下载 ZIP。 安装 Node.js →
方法二:手动下载安装(无需 Node)
下载 ZIP,解压后将文件夹放到以下路径,重启 Agent 即可:
安装路径
~/.claude/skills/audio-gen/💡解压后将文件夹放到上方路径,重启 Agent 即可生效
--- name: audio-gen description: Generate audiobooks, podcasts, or educational audio content on demand. User provides an idea or topic, Claude AI writes a script, and ElevenLabs converts it to high-quality audio. Supports multiple formats (audiobook, podcast, educational), custom lengths, and voice effects. Use when asked to create audio content, make a podcast, generate an audiobook, or produce educational audio. Returns MP3 audio file via MEDIA token. homepage: https://github.com/clawdbot/clawdbot metadata: {"clawdbot":{"emoji":"🎙️","requires":{"skills":["sag"],"env":["ANTHROPIC_API_KEY","ELEVENLABS_API_KEY"]},"primaryEnv":"ANTHROPIC_API_KEY"}} ---
Generate high-quality audiobooks, podcasts, or educational audio content on demand using AI-written scripts and ElevenLabs text-to-speech.
Create an audiobook chapter:
User: "Create a 5-minute audiobook chapter about a dragon discovering friendship"
Generate a podcast:
User: "Make a 10-minute podcast about the history of coffee"
Produce educational content:
User: "Generate a 15-minute educational audio explaining how neural networks work"
Style: Narrative storytelling with emotional depth
[whispers], [excited], [serious] for impactExample Structure:
[Opening hook - set the scene]
[long pause]
[Story development with character emotions]
[short pause] between sentences
[long pause] between paragraphs
[Climax with dramatic tension]
[long pause]
[Resolution and emotional closure]
Style: Conversational and engaging
Example Structure:
**Intro:** "Welcome to [topic]. I'm excited to share..."
[short pause]
**Main Content:** "Let's start with... [topic 1]"
[long pause] between segments
**Outro:** "Thanks for listening! Remember..."
Style: Clear explanations for learning
[excited] for important pointsExample Structure:
**Introduction:** What is [topic] and why it matters?
**Main Content:**
- Concept 1: Explanation + Example
- Concept 2: Explanation + Example
- Concept 3: Explanation + Example
**Summary:** Key takeaways and next steps
Word Count to Duration Conversion:
Pacing: Average conversational speed is ~75 words per minute
Practical Limits:
Parse the user's request for:
target_words = target_minutes × 75
Example: 10 minutes = 10 × 75 = 750 words
Write the complete script following these rules:
Content Guidelines:
Formatting Rules:
[short pause] after sentences (use sparingly, not every sentence)[long pause] between paragraphs or major sections[whispers], [shouts], [excited], [serious], [sarcastic], [sings], [laughs]Show the script to the user and ask:
Here's the [format] script I've created (approximately [length] minutes):
[Display the script]
Would you like me to:
1. Generate the audio now
2. Make changes to the script
3. Adjust the length or tone
If user requests changes:
If user approves:
Format the script for TTS:
[effect] formatInvoke the TTS script:
IMPORTANT: The ELEVENLABS_API_KEY environment variable is already configured in the system. Simply invoke the TTS script directly.
uv run /home/clawdbot/clawdbot/skills/sag/scripts/tts.py \
-o /tmp/audio-gen-[timestamp]-[topic-slug].mp3 \
-m eleven_multilingual_v2 \
"[formatted_script]"
For long scripts, use heredoc:
uv run /home/clawdbot/clawdbot/skills/sag/scripts/tts.py \
-o /tmp/audio-gen-[timestamp]-[topic-slug].mp3 \
-m eleven_multilingual_v2 \
"$(cat <<'EOF'
[formatted_script]
EOF
)"
Return the result:
MEDIA:/tmp/audio-gen-[timestamp]-[topic-slug].mp3
Your [format] is ready! [Brief description of content]. Duration: approximately [X] minutes.
Available voice modulation effects (use sparingly for impact):
[whispers] - Soft, intimate delivery[shouts] - Loud, emphatic delivery[excited] - Enthusiastic, energetic tone[serious] - Grave, solemn tone[sarcastic] - Ironic, mocking tone[sings] - Musical, melodic delivery[laughs] - Amused, jovial tone[short pause] - Brief silence (~0.5s)[long pause] - Extended silence (~1-2s)Best Practices:
If the generated script exceeds target by >20%:
The script I generated is [X] words ([Y] minutes), which is longer than your target of [Z] minutes. Would you like me to:
1. Condense it to fit the target length
2. Split it into multiple parts
3. Keep it as is
If the generated script is under target by >20%:
The script is [X] words ([Y] minutes), shorter than your target. Would you like me to:
1. Expand it with more detail
2. Add additional examples or stories
3. Generate as is
If the TTS script fails:
I've created the script, but I'm unable to generate the audio right now. Here's your script:
[Display script]
Error: [specific error message]
You can:
1. Check that ELEVENLABS_API_KEY is configured
2. Use the script with your own text-to-speech tool
3. Try again in a moment
4. Ask me to troubleshoot the audio generation
Common TTS Issues:
...
安装 音频内容生成器 后,可以对 AI 说这些话来触发它
Help me get started with Audio Content Generator
Explains what Audio Content Generator does, walks through the setup, and runs a quick demo based on your current project
Use Audio Content Generator to generate audiobooks, podcasts, or educational audio content on demand
Invokes Audio Content Generator with the right parameters and returns the result directly in the conversation
What can I do with Audio Content Generator in my design & creative workflow?
Lists the top use cases for Audio Content Generator, with example commands for each scenario
将技能文件夹放到 ~/.claude/skills/audio-gen/ 目录(个人级,所有项目可用),或 .claude/skills/audio-gen/(项目级)。重启 AI 客户端后,用 /audio-gen 主动调用,或让 AI 根据上下文自动发现并使用。
音频内容生成器 支持 Claude、Cursor、OpenClaw,可与这些 AI 平台无缝集成,扩展其能力。
音频内容生成器 可免费安装使用。请查阅仓库了解许可证信息。
按需生成有声读物、播客或教育音频内容。用户提供想法或主题,Claude AI 编写脚本,ElevenLabs 将其转换为高质量音频。支持多种格式(有声读物、播客、教育)、自定义长度和语音效果。当要求创建时使用
音频内容生成器 属于「Design & Creative」分类,该分类的技能帮助 AI 智能体在此领域执行专业任务。
Automate my design & creative tasks using Audio Content Generator
Identifies repetitive steps in your workflow and sets up Audio Content Generator to handle them automatically