Tools
Evolve
Self-improving OpenClaw plugin โ measures, analyzes, and proposes behavioral improvements across a bot network
Install
openclaw plugins install evolve
README
# Evolve
**Self-improving OpenClaw plugin.** Measures, analyzes, and systematically proposes behavioral improvements across a network of OpenClaw bot instances โ without retraining the underlying model.
```
Bot network โ measure โ analyze โ propose โ human approval โ Forge validation โ deploy
โ |
โโโโโโโโโโโโโโโโโ recursive improvement โโโโโโโโโโโโโโโโโโโโโโโโโโโโ
```
## What it does
- **Observes** every turn across all bots, annotating sessions with quality signals
- **Measures** bot performance: task resolution rate, maintenance overhead, reliability
- **Analyzes** patterns weekly, generates specific improvement proposals with evidence
- **Validates** approved proposals in a sandboxed Forge instance before production
- **Dashboards** โ web UI showing network health, scores, and proposal queue
## What it doesn't do
- Modify any bot without human approval
- Retrain or fine-tune the underlying model
- Operate outside its defined scope
- Touch SOUL.md or core identity files
## Repository structure
```
packages/
plugin/ OpenClaw plugin (TypeScript) โ runs in-process, annotates turns
analyzer/ Analysis engine (Python) โ measures, detects patterns, generates proposals
dashboard/ Web UI (served by plugin HTTP route)
scripts/ Deploy, forge-setup, launchd templates
config/ Network topology configuration
docs/ Architecture, getting started, configuration reference
```
## Quick start
```bash
# Install on a bot
openclaw plugins install evolve
# Configure (add to openclaw.json)
# See docs/getting-started.md
# Deploy across a network
bash scripts/deploy.sh --network my-network
```
## Roadmap
| Version | Status | Milestone |
|---------|--------|-----------|
| v0.1 | ๐ง In progress | Plugin skeleton, turn annotation, HTTP status endpoint |
| v0.2 | Planned | Metrics collection, daily measurement, basic scoreboard |
| v0.3 | Planned | Tier classification, session scoring |
| v0.4 | Planned | Analysis engine, proposals, Telegram alerts |
| v0.5 | Planned | Dashboard โ Overview + BotDetail |
| v0.6 | Planned | Forge integration, shadow mode, approval flow |
| v0.7 | Planned | Proposals page, rejection tracking |
| v0.8 | Planned | Multi-bot rollout |
| v0.9 | Planned | Drift detection, cross-bot correlation |
| v1.0 | Planned | Recursive layer, full network operational |
## Design
See `docs/architecture.md` for the full design, including the Tier 1/Tier 2
classification model, the Goodhart's Law mitigation strategy, and the Forge
validation pipeline.
## License
MIT
tools
Comments
Sign in to leave a comment