Convert text to natural speech with DIA TTS, Kokoro, Chatterbox, and more via inference.sh CLI. Models: DIA TTS (conversational), Kokoro TTS, Chatterbox, Hig...
数据来源:ClawHub。 在 ClawSkills 查看
选择你使用的 Agent
方法一:命令行安装(推荐)
推荐(无需提前安装 clawhub)
npx clawhub@latest --dir ~/.claude/skills install text-to-speech或使用 clawhub CLI(需提前安装)
clawhub --dir ~/.claude/skills install text-to-speech⚠️ 需要 Node.js 18+,没有 Node?请使用下方方法二直接下载 ZIP。 安装 Node.js →
方法二:手动下载安装(无需 Node)
下载 ZIP,解压后将文件夹放到以下路径,重启 Agent 即可:
安装路径
~/.claude/skills/text-to-speech/💡解压后将文件夹放到上方路径,重启 Agent 即可生效
--- name: text-to-speech description: "Convert text to natural speech with DIA TTS, Kokoro, Chatterbox, and more via inference.sh CLI. Models: DIA TTS (conversational), Kokoro TTS, Chatterbox, Higgs Audio, VibeVoice (podcasts). Capabilities: text-to-speech, voice cloning, multi-speaker dialogue, podcast generation, expressive speech. Use for: voiceovers, audiobooks, podcasts, accessibility, video narration, IVR, voice assistants. Triggers: text to speech, tts, voice generation, ai voice, speech synthesis, voice over, generate speech, ai narrator, voice cloning, text to audio, elevenlabs alternative, voice ai, ai voiceover, speech generator, natural voice" allowed-tools: Bash(infsh *) ---
Convert text to natural speech via inference.sh CLI.
# Install CLI
curl -fsSL https://cli.inference.sh | sh && infsh login
# Generate speech
infsh app run infsh/kokoro-tts --input '{"text": "Hello, welcome to our product demo."}'
> Install note: The install script only detects your OS/architecture, downloads the matching binary from dist.inference.sh, and verifies its SHA-256 checksum. No elevated permissions or background processes. Manual install & verification available.
| Model | App ID | Best For | |-------|--------|----------| | DIA TTS | infsh/dia-tts | Conversational, expressive | | Kokoro TTS | infsh/kokoro-tts | Fast, natural | | Chatterbox | infsh/chatterbox | General purpose | | Higgs Audio | infsh/higgs-audio | Emotional control | | VibeVoice | infsh/vibevoice | Podcasts, long-form |
infsh app list --category audio
infsh app run infsh/kokoro-tts --input '{"text": "Welcome to our tutorial."}'
infsh app sample infsh/dia-tts --save input.json
# Edit input.json:
# {
# "text": "Hey! How are you doing today? I'm really excited to share this with you.",
# "voice": "conversational"
# }
infsh app run infsh/dia-tts --input input.json
infsh app sample infsh/vibevoice --save input.json
# Edit input.json with your podcast script
infsh app run infsh/vibevoice --input input.json
infsh app sample infsh/higgs-audio --save input.json
# {
# "text": "This is absolutely incredible!",
# "emotion": "excited"
# }
infsh app run infsh/higgs-audio --input input.json
Generate speech, then create a talking head video:
# 1. Generate speech
infsh app run infsh/kokoro-tts --input '{"text": "Your script here"}' > speech.json
# 2. Use the audio URL with OmniHuman for avatar video
infsh app run bytedance/omnihuman-1-5 --input '{
"image_url": "https://portrait.jpg",
"audio_url": "<audio-url-from-step-1>"
}'
# Full platform skill (all 150+ apps)
npx skills add inference-sh/skills@inference-sh
# AI avatars (combine TTS with talking heads)
npx skills add inference-sh/skills@ai-avatar-video
# AI music generation
npx skills add inference-sh/skills@ai-music-generation
# Speech-to-text (transcription)
npx skills add inference-sh/skills@speech-to-text
# Video generation
npx skills add inference-sh/skills@ai-video-generation
Browse all apps: infsh app list
安装 Text To Speech 后,可以对 AI 说这些话来触发它
Help me get started with Text To Speech
Explains what Text To Speech does, walks through the setup, and runs a quick demo based on your current project
Use Text To Speech to convert text to natural speech with DIA TTS, Kokoro, Chatterbox, an...
Invokes Text To Speech with the right parameters and returns the result directly in the conversation
What can I do with Text To Speech in my design & creative workflow?
Lists the top use cases for Text To Speech, with example commands for each scenario
将技能文件夹放到 ~/.claude/skills/text-to-speech/ 目录(个人级,所有项目可用),或 .claude/skills/text-to-speech/(项目级)。重启 AI 客户端后,用 /text-to-speech 主动调用,或让 AI 根据上下文自动发现并使用。
Text To Speech 支持 Claude、Cursor、OpenClaw,可与这些 AI 平台无缝集成,扩展其能力。
Text To Speech 可免费安装使用。请查阅仓库了解许可证信息。
Convert text to natural speech with DIA TTS, Kokoro, Chatterbox, and more via inference.sh CLI. Models: DIA TTS (conversational), Kokoro TTS, Chatterbox, Hig...
Text To Speech 属于「Design & Creative」分类,该分类的技能帮助 AI 智能体在此领域执行专业任务。
Automate my design & creative tasks using Text To Speech
Identifies repetitive steps in your workflow and sets up Text To Speech to handle them automatically