Text-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and subtitle generation. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.
数据来源:ClawHub。 在 ClawSkills 查看
选择你使用的 Agent
方法一:命令行安装(推荐)
推荐(无需提前安装 clawhub)
npx clawhub@latest --dir ~/.claude/skills install edge-tts或使用 clawhub CLI(需提前安装)
clawhub --dir ~/.claude/skills install edge-tts⚠️ 需要 Node.js 18+,没有 Node?请使用下方方法二直接下载 ZIP。 安装 Node.js →
方法二:手动下载安装(无需 Node)
下载 ZIP,解压后将文件夹放到以下路径,重启 Agent 即可:
安装路径
~/.claude/skills/edge-tts/💡解压后将文件夹放到上方路径,重启 Agent 即可生效
--- name: edge-tts description: | Text-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and subtitle generation. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output. ---
Generate high-quality text-to-speech audio using Microsoft Edge's neural TTS service via the node-edge-tts npm package. Supports multiple languages, voices, adjustable speed/pitch, and subtitle generation.
When you detect TTS intent from triggers or user request:
// Example: Built-in tts tool usage
tts("Your text to convert to speech")
// Returns: MEDIA: /path/to/audio.mp3
Recognize "tts" keyword as TTS requests. The skill automatically filters out TTS-related keywords from text before conversion to avoid converting the trigger words themselves to audio.
For more control, use the bundled scripts directly:
cd scripts
npm install
node tts-converter.js "Your text" --voice en-US-AriaNeural --rate +10% --output output.mp3
Options:
--voice, -v: Voice name (default: en-US-AriaNeural)--lang, -l: Language code (e.g., en-US, es-ES)--format, -o: Output format (default: audio-24khz-48kbitrate-mono-mp3)--pitch: Pitch adjustment (e.g., +10%, -20%, default)--rate, -r: Rate adjustment (e.g., +10%, -20%, default)--volume: Volume adjustment (e.g., +0%, -10%, default)--save-subtitles, -s: Save subtitles as JSON file--output, -f: Output file path (default: tts_output.mp3)--proxy, -p: Proxy URL (e.g., http://localhost:7890)--timeout: Request timeout in milliseconds (default: 10000)--list-voices, -L: List available voicescd scripts
npm install
node config-manager.js --set-voice en-US-AriaNeural
node config-manager.js --set-rate +10%
node config-manager.js --get
node config-manager.js --reset
Common voices (use --list-voices for full list):
English:
en-US-MichelleNeural (female, natural, default)en-US-AriaNeural (female, natural)en-US-GuyNeural (male, natural)en-GB-SoniaNeural (female, British)en-GB-RyanNeural (male, British)Other Languages:
es-ES-ElviraNeural (Spanish, Spain)fr-FR-DeniseNeural (French)de-DE-KatjaNeural (German)ja-JP-NanamiNeural (Japanese)zh-CN-XiaoxiaoNeural (Chinese)ar-SA-ZariyahNeural (Arabic)Rate values use percentage format:
"default": Normal speed"-20%" to "-10%": Slow, clear (tutorials, stories, accessibility)"+10%" to "+20%": Slightly fast (summaries)"+30%" to "+50%": Fast (news, efficiency)Choose audio quality based on use case:
audio-24khz-48kbitrate-mono-mp3: Standard quality (voice notes, messages)audio-24khz-96kbitrate-mono-mp3: High quality (presentations, content)audio-48khz-96kbitrate-stereo-mp3: Highest quality (professional audio, music)Main TTS conversion script using node-edge-tts. Generates audio files with customizable voice, rate, volume, pitch, and format. Supports subtitle generation and voice listing.
Manages persistent user preferences for TTS settings (voice, language, format, pitch, rate, volume). Stores config in ~/.tts-config.json.
NPM package configuration with node-edge-tts dependency.
Complete documentation for node-edge-tts npm package including:
Test different voices and preview audio quality at: https://tts.travisvn.com/
Refer to this when you need specific voice details or advanced features.
To use the bundled scripts:
cd /home/user/clawd/skills/public/tts-skill/scripts
npm install
This installs:
node-edge-tts - TTS librarycommander - CLI argument parsingtts tool for simple requests, or scripts/tts-converter.js for customizationRun the test script to verify TTS functionality:
cd /home/user/clawd/skills/public/edge-tts/scripts
npm test
This generates a test audio file and verifies the TTS service is working.
Test different voices and preview audio quality at: https://tts.travisvn.com/
Use the built-in tts tool for quick testing:
// Example: Test TTS with default settings
tts("This is a test of the TTS functionality.")
Verify configuration persistence:
cd /home/user/clawd/skills/public/edge-tts/scripts
node config-manager.js --get
node config-manager.js --set-voice en-US-GuyNeural
node config-manager.js --get
npm test to check if TTS service is accessiblenode tts-converter.js --list-voices to see available voicesnode tts-converter.js "test" --proxy http://localhost:7890test-output.mp3 in the scripts directory/tmp/edge-tts-temp/ on Unix, C:\Users\\AppData\Local\Temp\edge-tts-temp\ on Windows) with unique filenames (e.g., tts_1234567890_abc123.mp3). Files are not automatically deleted - the calling application (Clawdbot) should handle cleanup after use. You can specify a custom output path with the --output option if permanent storage is needed.config-manager.js to set defaultsen-US-MichelleNeural (female, natural)Neural) provide higher quality than Standard voices安装 Edge TTS 后,可以对 AI 说这些话来触发它
Help me get started with Edge TTS
Explains what Edge TTS does, walks through the setup, and runs a quick demo based on your current project
Use Edge TTS to text-to-speech conversion using node-edge-tts npm package for gener...
Invokes Edge TTS with the right parameters and returns the result directly in the conversation
What can I do with Edge TTS in my design & creative workflow?
Lists the top use cases for Edge TTS, with example commands for each scenario
将技能文件夹放到 ~/.claude/skills/edge-tts/ 目录(个人级,所有项目可用),或 .claude/skills/edge-tts/(项目级)。重启 AI 客户端后,用 /edge-tts 主动调用,或让 AI 根据上下文自动发现并使用。
Edge TTS 支持 Claude、Cursor、OpenClaw,可与这些 AI 平台无缝集成,扩展其能力。
Edge TTS 可免费安装使用。请查阅仓库了解许可证信息。
Text-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and subtitle generation. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.
Edge TTS 属于「Design & Creative」分类,该分类的技能帮助 AI 智能体在此领域执行专业任务。
Automate my design & creative tasks using Edge TTS
Identifies repetitive steps in your workflow and sets up Edge TTS to handle them automatically