← Back to Skills
Media

audio-transcribe

aktheknight By aktheknight 👁 162 views ▲ 0 votes

Auto-transcribe voice messages using faster-whisper (local, no API key needed).

GitHub
# Audio Transcription Skill

Auto-transcribe voice messages using faster-whisper (local, no API key needed).

## Requirements

```bash
pip install faster-whisper
```

Models download automatically on first use.

## Usage

### Transcribe a file

```bash
python3 /root/clawd/skills/audio-transcribe/scripts/transcribe.py /path/to/audio.ogg
```

### Change model (edit script)

Edit `transcribe.py` and change:
```python
model = WhisperModel('small', device='cpu', compute_type='int8')  # Options: tiny, base, small, medium, large-v3
```

## Models

| Model | Size | VRAM/RAM | Speed | Use Case |
|-------|------|----------|-------|----------|
| tiny | 39 MB | ~1 GB | โšกโšกโšก | Quick drafts |
| base | 74 MB | ~1 GB | โšกโšก | Basic accuracy |
| **small** | **244 MB** | **~2 GB** | **โšก** | **Recommended** |
| medium | 769 MB | ~5 GB | ๐Ÿข | Better accuracy |
| large-v3 | 1.5 GB | ~10 GB | ๐Ÿข๐Ÿข | Best accuracy |

## Integration

Clawdbot auto-transcribes incoming voice messages when this skill is enabled.

## Files

- `scripts/transcribe.py` โ€” Main transcription script
- `SKILL.md` โ€” This file
media

Comments

Sign in to leave a comment

Loading comments...