Fast, affordable automatic speech-to-text transcription supporting 100 languages, speaker diarization, word timestamps, and customizable output formats.
数据来源:ClawHub。 在 ClawSkills 查看
选择你使用的 Agent
方法一:命令行安装(推荐)
推荐(无需提前安装 clawhub)
npx clawhub@latest --dir ~/.claude/skills install asr或使用 clawhub CLI(需提前安装)
clawhub --dir ~/.claude/skills install asr⚠️ 需要 Node.js 18+,没有 Node?请使用下方方法二直接下载 ZIP。 安装 Node.js →
方法二:手动下载安装(无需 Node)
下载 ZIP,解压后将文件夹放到以下路径,重启 Agent 即可:
安装路径
~/.claude/skills/asr/💡解压后将文件夹放到上方路径,重启 Agent 即可生效
Fast, accurate, and incredibly inexpensive automatic speech-to-text transcription service.
Sign up at speechischeap.com. Use code CH5 for $5 off.
This skill looks for your API key in the SIC_API_KEY environment variable.
Add this to your .env or agent config:
SIC_API_KEY=your_key_here
When this skill is installed, you can transcribe any URL from an OpenClaw session and get the JSON results immediately by running: ./skills/asr/scripts/asr.sh transcribe --url "https://example.com/audio.mp3"
# Basic transcription
./skills/asr/scripts/asr.sh transcribe --url "https://example.com/audio.mp3"
# Advanced transcription with options
./skills/asr/scripts/asr.sh transcribe --url "https://example.com/audio.mp3" \
--speakers --words --labels \
--language "en" \
--format "srt" \
--private
Perfect for processing audio already on your disk. This handles the upload automatically.
# Upload and transcribe local media
./skills/asr/scripts/asr.sh transcribe --file "./local-audio.wav"
# Upload with webhook callback
./skills/asr/scripts/asr.sh transcribe --file "./local-audio.wav" --webhook "https://mysite.com/callback"
# Note: For local files, the skill handles the multi-part upload to
# https://upload.speechischeap.com before starting the transcription.
--speakers: Enable speaker diarization--words: Enable word-level timestamps--labels: Enable audio labeling (music, noise, etc.)--stream: Enable streaming output--private: Do not store audio/transcript (privacy mode)--language : ISO language code (e.g., 'en', 'es')--confidence : Minimum confidence threshold (default 0.5)--format : Output format (json, srt, vtt, webvtt)--webhook : URL to receive job completion payload--segment-duration : Segment duration in seconds (default 30)./skills/asr/scripts/asr.sh status "job-id-here"
The asr.sh command-line tool returns JSON by default when successful, making it easy to pipe into other tools or parse directly.
If the SIC_API_KEY is missing, the tool will provide a clear error message and a direct link to the signup page.
安装 Speech is Cheap Transcribe 后,可以对 AI 说这些话来触发它
Help me get started with Speech is Cheap Transcribe
Explains what Speech is Cheap Transcribe does, walks through the setup, and runs a quick demo based on your current project
Use Speech is Cheap Transcribe to fast, affordable automatic speech-to-text transcription supporting ...
Invokes Speech is Cheap Transcribe with the right parameters and returns the result directly in the conversation
What can I do with Speech is Cheap Transcribe in my design & creative workflow?
Lists the top use cases for Speech is Cheap Transcribe, with example commands for each scenario
将技能文件夹放到 ~/.claude/skills/asr/ 目录(个人级,所有项目可用),或 .claude/skills/asr/(项目级)。重启 AI 客户端后,用 /asr 主动调用,或让 AI 根据上下文自动发现并使用。
Speech is Cheap Transcribe 支持 Claude、Cursor、OpenClaw,可与这些 AI 平台无缝集成,扩展其能力。
Speech is Cheap Transcribe 可免费安装使用。请查阅仓库了解许可证信息。
Fast, affordable automatic speech-to-text transcription supporting 100 languages, speaker diarization, word timestamps, and customizable output formats.
Speech is Cheap Transcribe 属于「Design & Creative」分类,该分类的技能帮助 AI 智能体在此领域执行专业任务。
Automate my design & creative tasks using Speech is Cheap Transcribe
Identifies repetitive steps in your workflow and sets up Speech is Cheap Transcribe to handle them automatically