Walkie-Talkie Mode

Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk"

Visit Website

Introduction

# Walkie-Talkie Mode

This skill automates the voice-to-voice loop on WhatsApp using local transcription and local TTS.

## Workflow

1. **Incoming Audio**: When a user sends an audio/ogg/opus file: - Use `tools/transcribe_voice.sh` to get the text. - Process the text as a normal user prompt.

2. **Outgoing Response**: - Instead of a text reply, generate speech using `bin/sherpa-onnx-tts`. - Send the resulting `.ogg` file back to the user as a voice note.

## Triggers

- User sends an audio message. - User says "activa modo walkie-talkie" or "hablemos por voz".

## Constraints

- Use local tools only (ffmpeg, whisper-cpp, sherpa-onnx-tts). - Maintain a fast response time (RTF < 0.5). - Always reply with BOTH text (for clarity) and audio.

## Manual Execution (Internal)

To respond with voice manually: ```bash bin/sherpa-onnx-tts /tmp/reply.ogg "Tu mensaje aquí" ``` Then send `/tmp/reply.ogg` via `message` tool with `filePath`.

Back

More Products

Slack

Communication

Use when you need to control Slack from Clawdbot via the slack tool, including reacting to messages or pinning/unpinning items in Slack channels or DMs.

2026-02-207.8K

AgentMail

Communication

API-first email platform designed for AI agents. Create and manage dedicated email inboxes, send and receive emails programmatically, and handle email-based wor

2026-02-204.3K

Communication

OpenClaw skill for designing Telegram Bot API workflows and command-driven conversations using direct HTTPS requests (no SDKs).

2026-02-202.8K