Use Case
Stop doing repetitive design & creative tasks manually. speech-recognition lets your AI agent handle them automatically through natural conversation. Universal speech recognition skill. Supports multiple audio formats (ogg/mp3/wav/m4a) and uses the silicon-based mobile SenseVoice API for speech-to-text. Triggered when the user sends a voice message, audio file, or audio needs to be transcribed. This guide shows practical examples of using speech-recognition to automate common design & creative workflows and save hours every week.
Install speech-recognition: npx clawhub@latest --dir ~/.claude/skills install speech-recognition
Identify the repetitive design & creative tasks you want to automate
Describe the task to your AI in plain English
speech-recognition will execute the task and return results directly in the chat
Chain multiple tasks: ask your AI to run a sequence of operations
Copy these prompts and use them with your AI agent after installing speech-recognition
Automate my design & creative tasks using speech-recognition
What repetitive tasks can speech-recognition handle for me?
Set up a workflow that runs speech-recognition every morning
Select your agent
Option 1: Install via CLI (recommended)
Recommended (no pre-install needed)
npx clawhub@latest --dir ~/.claude/skills install speech-recognitionOr via clawhub CLI (if already installed)
clawhub --dir ~/.claude/skills install speech-recognitionβ οΈ Requires Node.js 18+. No Node? Use Option 2 below to download the ZIP instead. Install Node.js β
Option 2: Manual install (no Node required)
Download the ZIP, extract it, and place the folder at the path below. Restart your agent to activate.
Install path
~/.claude/skills/speech-recognition/π‘Extract and place the folder at the path above, then restart your agent.