transcribe

Name: transcribe
Rating: 3.5 (1 reviews)
Author: javicasper

By javicasper 👁 153 views ▲ 0 votes

Transcribe audio files to text using local Whisper (Docker).

GitHub

---
name: transcribe
description: Transcribe audio files to text using local Whisper (Docker). Use when receiving voice messages, audio files (.mp3, .m4a, .ogg, .wav, .webm), or when asked to transcribe audio content.
---

# Transcribe

Local audio transcription using faster-whisper in Docker.

## Installation

```bash
cd /path/to/skills/transcribe/scripts
chmod +x install.sh
./install.sh
```

This builds the Docker image `whisper:local` and installs the `transcribe` CLI.

## Usage

```bash
transcribe /path/to/audio.mp3 [language]
```

- Default language: `es` (Spanish)
- Use `auto` for auto-detection
- Outputs plain text to stdout

## Examples

```bash
transcribe /tmp/voice.ogg          # Spanish (default)
transcribe /tmp/meeting.mp3 en     # English
transcribe /tmp/audio.m4a auto     # Auto-detect
```

## Supported Formats

mp3, m4a, ogg, wav, webm, flac, aac

## When Receiving Voice Messages

1. Save the audio attachment to a temp file
2. Run `transcribe <path>`
3. Include the transcription in your response
4. Clean up the temp file

## Files

- `scripts/transcribe` - CLI wrapper (bash)
- `scripts/install.sh` - Installation script (includes Dockerfile inline)

## Notes

- Model: `small` (fast) - edit install.sh for `large-v3` (accurate)
- Fully local, no API key needed

media

Comments

Loading comments...

More by javicasper

sound-fx

Generate short sound effects via ElevenLabs SFX (text-to-sound).

Similar Skills in Media

ace-music

Generate AI music using ACE-Step 1.5 via ACE Music's free API.

ai-video-remix

AI-driven video remix from local library using ShotAI.

YouTube Transcript Generator

Download and generate clean, readable transcripts from any YouTube video. Extracts subtitles (auto-generated or manual),

documents-ai

Real-time OCR and data extraction API by Veryfi.

dellight-cro-revenue-ops

DELLIGHT.AI is an AI startup in DIFC, Dubai.

brw-de-ai-ify

Remove AI-generated jargon and restore human voice to text.