Transcribe audio and video files to text with speaker detection, timestamps, and format conversion.
Data sourced from ClawHub. View on ClawSkills
Select your agent
Option 1: Install via CLI (recommended)
Recommended (no pre-install needed)
npx clawhub@latest --dir ~/.claude/skills install speech-to-text-transcriptionOr via clawhub CLI (if already installed)
clawhub --dir ~/.claude/skills install speech-to-text-transcriptionβ οΈ Requires Node.js 18+. No Node? Use Option 2 below to download the ZIP instead. Install Node.js β
Option 2: Manual install (no Node required)
Download the ZIP, extract it, and place the folder at the path below. Restart your agent to activate.
Install path
~/.claude/skills/speech-to-text-transcription/π‘Extract and place the folder at the path above, then restart your agent.
Category
π¨Design & CreativePlatforms
What Speech to Text Transcription can do for your AI workflow
Transcribe audio and video directly from your Claude conversation
Works across Claude, Cursor, OpenClaw β install once, use everywhere
One-command installation β no complex setup required
Combine with other skills to build powerful multi-step AI workflows
Try these prompts with your AI agent after installing Speech to Text Transcription
Help me get started with Speech to Text Transcription
Explains what Speech to Text Transcription does, walks through the setup, and runs a quick demo based on your current project
Use Speech to Text Transcription to transcribe audio and video files to text with speaker detection, ti...
Invokes Speech to Text Transcription with the right parameters and returns the result directly in the conversation
What can I do with Speech to Text Transcription in my design & creative workflow?
Lists the top use cases for Speech to Text Transcription, with example commands for each scenario
Guides & tutorials for AI skills
The 7 AI Skills Every Software Developer Should Have Installed in 2026
After testing dozens of developer-focused AI skills, these are the seven that have proven genuinely useful across different tech stacks and workflows β not just impressive demos, but tools that hold up under daily use.
MCP Skills vs Native Claude Tools: What's the Difference and When to Use Each
Claude comes with built-in capabilities, but MCP skills extend it in ways the base model can't. Here's a clear breakdown of what each type of tool is good for, with real examples of when to reach for a skill versus relying on Claude's native abilities.
Speech to Text Transcription extends your AI assistant with the ability to transcribe audio and video files to text with speaker detection, timestamps, and format conversion. Rather than leaving your conversation to handle this manually, you can ask your Claude agent directly β and it will take care of the task end-to-end, using Speech to Text Transcription as its underlying capability.
Speech to Text Transcription works across Claude, Cursor, OpenClaw through the Model Context Protocol (MCP) β an open standard that lets AI clients share tools and skills without lock-in. Because MCP is platform-agnostic by design, you install Speech to Text Transcription once and it becomes available across all your AI clients. Whether you're working in Claude for focused sessions or Cursor for integrated workflows, the skill behaves consistently.
Getting started with Speech to Text Transcription takes about two minutes. Place the skill at `~/.claude/skills/speech-to-text-transcription/` (personal, all projects) or `.claude/skills/speech-to-text-transcription/` (project-specific), then restart your AI client. From that point, typing `/speech-to-text-transcription` in any conversation activates it, or the AI will use it on its own when it detects a relevant request.
Speech to Text Transcription has 739 installs and is part of the growing Design & Creative skill ecosystem on DiscoverAISkills. Like all skills on DiscoverAISkills, it is free to install and use. The broader AI skills ecosystem continues to expand as developers contribute new capabilities across categories like developer tools, data analysis, writing, automation, and more.
Place the skill folder at ~/.claude/skills/speech-to-text-transcription/ for personal use (all projects), or .claude/skills/speech-to-text-transcription/ for project-specific use. Restart your AI client, then invoke with /speech-to-text-transcription or let the AI discover it automatically.
Speech to Text Transcription supports Claude, Cursor, OpenClaw. It integrates seamlessly with these AI platforms to extend their capabilities.
Speech to Text Transcription is free to install. Check the repository for licensing information.
Transcribe audio and video files to text with speaker detection, timestamps, and format conversion.
speech-recognition
Universal speech recognition skill. Supports multiple audio formats (ogg/mp3/wav/m4a) and uses the silicon-based mobile SenseVoice API for speech-to-text. Triggered when the user sends a voice message, audio file, or audio needs to be transcribed.
Speech is Cheap Transcribe
Fast, affordable automatic speech-to-text transcription supporting 100 languages, speaker diarization, word timestamps, and customizable output formats.
Speech To Text
Transcribe audio to text with Whisper models via inference.sh CLI. Models: Fast Whisper Large V3, Whisper V3 Large. Capabilities: transcription, translation,...
Automate my design & creative tasks using Speech to Text Transcription
Identifies repetitive steps in your workflow and sets up Speech to Text Transcription to handle them automatically
Speech to Text Transcription is categorized under Design & Creative. These skills help AI agents perform specialized tasks in this domain.