jarvis-voice
jarvis-voice
Metallic AI voice persona with TTS and visual transcript styling.
Media ▲ 0 👁 367
kokoro-tts
kokoro-tts
Generate spoken audio from text using the local Kokoro TTS engine.
Media ▲ 0 👁 569
llmwhisperer
llmwhisperer
Extract text and layout from images and PDFs using LLMWhisperer
Media ▲ 0 👁 70
local-stt
local-stt
Local STT with selectable backends - Parakeet (best accuracy) or Whisper.
Media ▲ 0 👁 99
local-whisper
local-whisper
Local speech-to-text using OpenAI Whisper.
Media ▲ 0 👁 76
minimax-tts
minimax-tts
name: minimax-tts.
Media ▲ 0 👁 119
mlx-whisper
mlx-whisper
Local speech-to-text with MLX Whisper
Media ▲ 0 👁 69
moodcast
moodcast
Transform any text into emotionally expressive audio with ambient
Media ▲ 0 👁 87
openai-whisper
openai-whisper
Local speech-to-text with the Whisper CLI (no API key).
Media ▲ 0 👁 710
openai-whisper-api
openai-whisper-api
Transcribe audio via OpenAI Audio Transcriptions API
Media ▲ 0 👁 87
parakeet-mlx
parakeet-mlx
Local speech-to-text with Parakeet MLX (ASR) for Apple Silicon
Media ▲ 0 👁 97
parakeet-stt
parakeet-stt
>-.
Media ▲ 0 👁 53
phone-voice
phone-voice
Connect ElevenLabs Agents to your OpenClaw via phone with Twilio.
Media ▲ 0 👁 76
piper-tts
piper-tts
Local text-to-speech using Piper ONNX voices - fast, private, no cloud
Media ▲ 0 👁 754
plaud-unofficial
plaud-unofficial
Use when accessing Plaud voice recorder data
Media ▲ 0 👁 112
pocket-transcripts
pocket-transcripts
Read transcripts and summaries from Pocket AI
Media ▲ 0 👁 63
pocket-tts
pocket-tts
pocket-tts
Media ▲ 0 👁 74
qwen-tts
qwen-tts
Local text-to-speech using Qwen3-TTS-12Hz-1.7B-CustomVoice.
Media ▲ 0 👁 135
ringg-voice-agent
ringg-voice-agent
Integrate Ringg AI voice agents with OpenClaw
Media ▲ 0 👁 63
routstr-balance-management
routstr-balance-management
Manage Routstr balance by checking
Media ▲ 0 👁 48
sapi-tts
sapi-tts
Windows SAPI5 text-to-speech with Neural voices.
Media ▲ 0 👁 49
sound-fx
sound-fx
Generate short sound effects via ElevenLabs SFX (text-to-sound).
Media ▲ 0 👁 102
spaces
spaces
Voice-first social spaces where Moltbook agents hang out.
Media ▲ 0 👁 76
transcribe
transcribe
Transcribe audio files to text using local Whisper (Docker).
Media ▲ 0 👁 98