ClawSkills logoClawSkills

Elevenlabs

Text-to-speech, sound effects, music generation, voice management, and quota checks via the ElevenLabs API. Use when generating audio with ElevenLabs or managin

Introduction

# ElevenLabs Skill

Core tools for interacting with the ElevenLabs API for sound generation, music, and voice management.

## Setup

See [SETUP.md](SETUP.md) for prerequisites and setup instructions.

## Models

| Model | ID | Use Case | |-------|----|----------| | **Eleven v3** | `eleven_v3` | ⭐ Best for expressive/creative audio. Supports **audio tags** (square brackets): `[laughs]`, `[sighs]`, `[whispers]`, `[excited]`, `[grumpy voice]`, `[clears throat]`, etc. Use for storytelling, characters, demos. | | Multilingual v2 | `eleven_multilingual_v2` | Stable multilingual. No audio tags. Good for straightforward narration. | | Turbo v2.5 | `eleven_turbo_v2_5` | Low-latency, good for non-English (German TTS). Required for realtime/conversational. | | Flash v2.5 | `eleven_flash_v2_5` | Fastest, lowest cost. |

### v3 Audio Tags (square brackets, NOT XML/SSML) ``` [laughs], [chuckles], [sighs], [clears throat], [whispers], [shouts] [excited], [sad], [angry], [warmly], [deadpan], [sarcastic] [grumpy voice], [philosophical], [whiny voice], [resigned] [laughs hard], [sighs deeply], [pause] ``` Tags can be placed anywhere in text. Combine freely. v3 understands emotional context deeply.

## Output Formats

All scripts support multiple output formats via `--format`:

| Format | Description | |--------|-------------| | `mp3_44100_128` | MP3, 44.1kHz, 128kbps (default) | | `mp3_44100_192` | MP3, 44.1kHz, 192kbps | | `mp3_44100_96` | MP3, 44.1kHz, 96kbps | | `mp3_44100_64` | MP3, 44.1kHz, 64kbps | | `mp3_44100_32` | MP3, 44.1kHz, 32kbps | | `mp3_24000_48` | MP3, 24kHz, 48kbps | | `mp3_22050_32` | MP3, 22.05kHz, 32kbps | | `opus_48000_192` | Opus, 48kHz, 192kbps ⭐ best for AirPlay | | `opus_48000_128` | Opus, 48kHz, 128kbps | | `opus_48000_96` | Opus, 48kHz, 96kbps | | `opus_48000_64` | Opus, 48kHz, 64kbps | | `opus_48000_32` | Opus, 48kHz, 32kbps | | `pcm_16000` | Raw PCM, 16kHz | | `pcm_22050` | Raw PCM, 22.05kHz | | `pcm_24000` | Raw PCM, 24kHz | | `alaw_8000` | A-law, 8kHz (telephony) |

## Tools

### 1. Speech (`speech.py`) Text-to-speech using ElevenLabs voices.

```bash # Basic usage python3 {baseDir}/scripts/speech.py "Hello world" -v <voice_id> -o output.mp3

# With format option python3 {baseDir}/scripts/speech.py "Hello world" -v <voice_id> -o output.pcm --format pcm_44100

# With voice settings python3 {baseDir}/scripts/speech.py "Hello" -v <voice_id> -o out.mp3 --stability 0.7 --similarity 0.8 ```

### 2. Sound Effects (`sfx.py`) Generate sound effects and short audio clips.

```bash # Generate a sound python3 {baseDir}/scripts/sfx.py "Cinematic boom" -o boom.mp3

# Generate a loop python3 {baseDir}/scripts/sfx.py "Lo-fi hip hop beat" --duration 10 --loop -o beat.mp3

# Different format python3 {baseDir}/scripts/sfx.py "Whoosh" -o whoosh.pcm --format pcm_44100 ```

### 3. Music Generation (`music.py`) Generate full musical compositions or instrumental tracks.

```bash # Generate instrumental intro python3 {baseDir}/scripts/music.py --prompt "Upbeat 6s news intro sting, instrumental" --length-ms 6000 -o intro.mp3

# Generate background bed python3 {baseDir}/scripts/music.py --prompt "Soft ambient synth pad" --length-ms 30000 -o bed.mp3

# High quality MP3 python3 {baseDir}/scripts/music.py --prompt "Jazz piano" --length-ms 10000 -o jazz.mp3 --output-format mp3_44100_192 ```

### 4. Voices (`voices.py`) List available voices and their IDs.

```bash # List voices python3 {baseDir}/scripts/voices.py

# JSON output python3 {baseDir}/scripts/voices.py --json ```

### 5. Voice Cloning (`voiceclone.py`) Create instant voice clones from audio samples.

**Security:** by default this script will only read files from: - `~/.openclaw/elevenlabs/voiceclone-samples/`

Copy your samples there (or pass `--sample-dir`). Reading files outside the sample directory is blocked.

```bash # Clone from audio files (put samples into ~/.openclaw/elevenlabs/voiceclone-samples) python3 {baseDir}/scripts/voiceclone.py --name "MyVoice" --files sample1.mp3 sample2.mp3

# Use a custom sample dir python3 {baseDir}/scripts/voiceclone.py --name "Andi" --sample-dir ./samples --files a.m4a b.m4a --language de --gender male

# With description and noise removal python3 {baseDir}/scripts/voiceclone.py --name "Andi" --files a.m4a b.m4a --description "German male" --denoise ```

### 6. Quota & Usage (`quota.py`) Check subscription quota and usage statistics.

```bash # Show current quota python3 {baseDir}/scripts/quota.py

# Include usage breakdown by voice python3 {baseDir}/scripts/quota.py --usage

# Last 7 days usage python3 {baseDir}/scripts/quota.py --usage --days 7

# JSON output python3 {baseDir}/scripts/quota.py --json ```

Output: ``` šŸ“Š ElevenLabs Quota ======================================= Plan: pro (active) — annual Characters: 66.6K / 500.0K (13.3%) [ā–ˆā–ˆā–ˆā–‘ā–‘ā–‘ā–‘ā–‘ā–‘ā–‘ā–‘ā–‘ā–‘ā–‘ā–‘ā–‘ā–‘ā–‘ā–‘ā–‘ā–‘ā–‘ā–‘ā–‘ā–‘ā–‘ā–‘ā–‘ā–‘ā–‘] Resets: 2026-02-18 (29 days) Voices: 22 / 160 (IVC: āœ“) Pro Voice: 0 / 1 (PVC: āœ“) ```

More Products