DiscoverAISkills

S

Use Case

How to Get Started with speech-recognition in Your AI Workflow

speech-recognition is a powerful AI agent skill that extends your assistant with new capabilities. Universal speech recognition skill. Supports multiple audio formats (ogg/mp3/wav/m4a) and uses the silicon-based mobile SenseVoice API for speech-to-text. Triggered when the user sends a voice message, audio file, or audio needs to be transcribed. This guide walks you through installing speech-recognition, configuring it for your setup, and running your first commands — so you can start getting value in minutes.

Step-by-Step Guide

1
Install speech-recognition: npx clawhub@latest --dir ~/.claude/skills install speech-recognition
2
Restart your AI client (Claude Code, Cursor, Gemini CLI, or OpenClaw)
3
Type a natural language request related to design & creative to trigger speech-recognition
4
Review the output and refine your prompt for better results
5
Combine speech-recognition with other skills to build multi-step workflows

Example Prompts

Copy these prompts and use them with your AI agent after installing speech-recognition

Help me get started with speech-recognition

What can speech-recognition do for my design & creative workflow?

Show me an example of using speech-recognition

Installation

Select your agent

Option 1: Install via CLI (recommended)

Recommended (no pre-install needed)

$npx clawhub@latest --dir ~/.claude/skills install speech-recognition

Or via clawhub CLI (if already installed)

$clawhub --dir ~/.claude/skills install speech-recognition

⚠️

⚠️ Requires Node.js 18+. No Node? Use Option 2 below to download the ZIP instead. Install Node.js →

Option 2: Manual install (no Node required)

Download the ZIP, extract it, and place the folder at the path below. Restart your agent to activate.

Install path

🤖 Claude Code~/.claude/skills/speech-recognition/

View on ClawHub

💡Extract and place the folder at the path above, then restart your agent.

More Use Cases for speech-recognition

Automate Design & Creative Workflows with speech-recognition Team Collaboration with speech-recognition

Back to speech-recognition