Learning Module¶

Always-on, zero-config active learning that discovers which tools and prompt sections actually help your agent.

The Problem¶

AI agent prompts grow over time, but nobody knows what's working:

Prompt bloat — Tools, memories, and instructions accumulate without review
No signal — You can't tell which prompt sections the model actually uses
Token waste — Unused context burns tokens on every request
Manual tuning — Prompt optimization is guesswork without data

OpenClaw Learning solves this automatically. From the first request, it traces which components the model references and builds statistical posteriors that reveal what helps vs. what hurts.

Key Features¶

Feature	Description
Always-on tracing	Captures arm outcomes from the first request
Thompson Sampling	Bayesian bandit that balances exploration and exploitation
Two-phase operation	Passive (observe only) or Active (optimize prompts)
Baseline A/B	Counterfactual evaluation with configurable baseline rate
Token savings	Measures actual token reduction vs. full-prompt baseline
Multiple interfaces	CLI, Gateway dashboard, REST API

Quick Start¶

# Check learning layer status
openclaw learning status

# Open the live dashboard
openclaw learning dashboard

# Export posteriors and traces
openclaw learning export --format json

Architecture¶

Next Steps¶

Installation — Verify and configure
Quick Start — 8-step walkthrough
Core Concepts — Arms, posteriors, phases
CLI Reference — All commands documented
Thompson Sampling — How the bandit works