Media
audio-transcribe
Auto-transcribe voice messages using faster-whisper (local, no API key needed).
# Audio Transcription Skill
Auto-transcribe voice messages using faster-whisper (local, no API key needed).
## Requirements
```bash
pip install faster-whisper
```
Models download automatically on first use.
## Usage
### Transcribe a file
```bash
python3 /root/clawd/skills/audio-transcribe/scripts/transcribe.py /path/to/audio.ogg
```
### Change model (edit script)
Edit `transcribe.py` and change:
```python
model = WhisperModel('small', device='cpu', compute_type='int8') # Options: tiny, base, small, medium, large-v3
```
## Models
| Model | Size | VRAM/RAM | Speed | Use Case |
|-------|------|----------|-------|----------|
| tiny | 39 MB | ~1 GB | โกโกโก | Quick drafts |
| base | 74 MB | ~1 GB | โกโก | Basic accuracy |
| **small** | **244 MB** | **~2 GB** | **โก** | **Recommended** |
| medium | 769 MB | ~5 GB | ๐ข | Better accuracy |
| large-v3 | 1.5 GB | ~10 GB | ๐ข๐ข | Best accuracy |
## Integration
Clawdbot auto-transcribes incoming voice messages when this skill is enabled.
## Files
- `scripts/transcribe.py` โ Main transcription script
- `SKILL.md` โ This file
media
By
Comments
Sign in to leave a comment