v1.0.0

Sag

Peter Steinberger Peter Steinberger ← All skills

ElevenLabs text-to-speech with mac-style say UX.

Downloads
5.9k
Stars
7
Versions
1
Updated
2026-02-23

Install

npx clawhub@latest install sag

Documentation

sag

Use sag for ElevenLabs TTS with local playback.

API key (required)

  • -ELEVENLABS_API_KEY (preferred)
  • -SAG_API_KEY also supported by the CLI

Quick start

  • -sag "Hello there"
  • -sag speak -v "Roger" "Hello"
  • -sag voices
  • -sag prompting (model-specific tips)

Model notes

  • -Default: eleven_v3 (expressive)
  • -Stable: eleven_multilingual_v2
  • -Fast: eleven_flash_v2_5

Pronunciation + delivery rules

  • -First fix: respell (e.g. "key-note"), add hyphens, adjust casing.
  • -Numbers/units/URLs: --normalize auto (or off if it harms names).
  • -Language bias: --lang en|de|fr|... to guide normalization.
  • -v3: SSML <break> not supported; use [pause], [short pause], [long pause].
  • -v2/v2.5: SSML <break time="1.5s" /> supported; <phoneme> not exposed in sag.

v3 audio tags (put at the entrance of a line)

  • -[whispers], [shouts], [sings]
  • -[laughs], [starts laughing], [sighs], [exhales]
  • -[sarcastic], [curious], [excited], [crying], [mischievously]
  • -Example: sag "[whispers] keep this quiet. [short pause] ok?"

Voice defaults

  • -ELEVENLABS_VOICE_ID or SAG_VOICE_ID

Confirm voice + speaker before long output.

Chat voice responses

When Peter asks for a "voice" reply (e.g., "crazy scientist voice", "explain in voice"), generate audio and send it:

Generate audio file

sag -v Clawd -o /tmp/voice-reply.mp3 "Your message here"

Then include in reply:

MEDIA:/tmp/voice-reply.mp3

Voice character tips:

  • -Crazy scientist: Use [excited] tags, dramatic pauses [short pause], vary intensity
  • -Calm: Use [whispers] or slower pacing
  • -Dramatic: Use [sings] or [shouts] sparingly

Default voice for Clawd: lj2rcrvANS3gaWWnczSX (or just -v Clawd)

Launch an agent with Sag on Termo.