V

Voice Agent

voice-agent

🌐 English

Local Voice Input/Output for Agents using the AI Voice Agent API.

数据来源：ClawHub。在 ClawSkills 查看

3.6k下载量

0收藏数

27浏览量

安装

选择你使用的 Agent

方法一：命令行安装（推荐）

关于 Voice Agent

--- name: voice-agent display-name: AI Voice Agent Backend version: 1.1.0 description: Local Voice Input/Output for Agents using the AI Voice Agent API. author: trevisanricardo homepage: https://github.com/ricardotrevisan/ai-conversational-skill user-invocable: true disable-model-invocation: false ---

Voice Agent

This skill allows you to speak and listen to the user using a local Voice Agent API. It is client-only and does not start containers or services. It uses local Whisper for Speech-to-Text transcription and AWS Polly for Text-to-Speech generation.

Prerequisite

Requires a running backend API at http://localhost:8000. Backend setup instructions are in this repository:

README.md
walkthrough.md
DOCKER_README.md

Behavior Guidelines

Audio First: When the user communicates via audio (files), your PRIMARY mode of response is Audio File.
Silent Delivery: When sending an audio response, DO NOT send a text explanation like "I sent an audio". Just send the audio file.
Workflow:

1. User sends audio. 2. Use transcribe to read it. 3. You think of a response. 4. Use synthesize to generate the audio file. 5. You send the file. 6. STOP. Do not add text commentary.

Failure Handling: If health fails or connection errors occur, do not attempt service management from this skill. Ask the user to start or fix the backend using the repository docs.

Tools

Transcribe File

To transcribe an audio file with local Whisper STT, run the client script with the transcribe command.

python3 {baseDir}/scripts/client.py transcribe "/path/to/audio/file.ogg"

Synthesize to File

To generate audio from text with AWS Polly TTS and save it to a file, run the client script with the synthesize command.

python3 {baseDir}/scripts/client.py synthesize "Text to speak" --output "/path/to/output.mp3"

Health Check

To check if the voice agent API is running and healthy:

python3 {baseDir}/scripts/client.py health

Prompt 示例

安装 Voice Agent 后，可以对 AI 说这些话来触发它

U

Help me get started with Voice Agent

A

Explains what Voice Agent does, walks through the setup, and runs a quick demo based on your current project

U

Use Voice Agent to local Voice Input/Output for Agents using the AI Voice Agent API

A

Invokes Voice Agent with the right parameters and returns the result directly in the conversation

U

What can I do with Voice Agent in my design & creative workflow?

A

Lists the top use cases for Voice Agent, with example commands for each scenario

常见问题

如何安装 Voice Agent？▾

将技能文件夹放到 ~/.claude/skills/voice-agent/ 目录（个人级，所有项目可用），或 .claude/skills/voice-agent/（项目级）。重启 AI 客户端后，用 /voice-agent 主动调用，或让 AI 根据上下文自动发现并使用。

Voice Agent 支持哪些 AI 平台？▾

Voice Agent 支持 Claude、Cursor、OpenClaw，可与这些 AI 平台无缝集成，扩展其能力。

Voice Agent 是免费的吗？▾

Voice Agent 可免费安装使用。请查阅仓库了解许可证信息。

Voice Agent 有什么功能？▾

Local Voice Input/Output for Agents using the AI Voice Agent API.

Voice Agent 属于哪个分类？▾

Voice Agent 属于「Design & Creative」分类，该分类的技能帮助 AI 智能体在此领域执行专业任务。

使用场景

Getting Started with Voice Agent→Automate Design & Creative Workflows with Voice Agent→Team Collaboration with Voice Agent→

Voice Agent

安装

关于 Voice Agent

Voice Agent

Prerequisite

Behavior Guidelines

Tools

Transcribe File

Synthesize to File

Health Check

Prompt 示例

常见问题

使用场景

同类技能推荐

Humanizer

Nano Banana Pro

Openai Whisper

YouTube Watcher