Roadmap

Where buildlog is heading

buildlog is an agentic learning loop for AI coding tools. It captures what your agent gets right and wrong, selects which rules to surface with Thompson Sampling, and measures whether they reduce mistakes. v0.23 ships the core loop end-to-end. What follows is about making it sharper, faster, and cross-project.

Recently Shipped

Feature Version What it does
Bragi v3 v0.22 27 LLM prose detection rules from ACL/COLING, PNAS, GPTZero research
ensure_session() v0.23 buildlog_commit() no longer blocks without manual experiment start. New users can install and commit immediately.
Gauntlet credit system v0.21 Rule citations in gauntlet findings drive posterior updates. Solves credit assignment.
Posterior history v0.21 Every gauntlet credit snapshots alpha/beta/mean per rule. Query convergence over time.
Category-aware RMR v0.21 Repeat detection uses rules_consulted overlap, not just description similarity.
Multi-agent render v0.18 Same rules render to CLAUDE.md, .cursor/rules/, copilot-instructions.md, .windsurf/rules/, .continue/rules/, settings.json.
Global SQLite v0.11 Single database at ~/.buildlog/buildlog.db. WAL mode. Project isolation via SHA-256.
Ambient emission protocol v0.18 Fire-and-forget JSON artifacts to ~/.buildlog/emissions/pending/ for downstream systems.

In Progress

Dashboard visualization fixes (#243)

Four charts need work: rule labels should show human-readable text instead of domain:hash, RMR chart needs a minimum-count filter, error class chart conflates severity with category, rule growth chart is uninformative. All presentation-layer fixes in the marimo notebook.

Session decoupling Phase 2-3 (#237)

Phase 1 shipped in v0.23 (ensure_session()). Remaining:

  • Phase 2: Harden session-start hooks — log failures instead of swallowing them
  • Phase 3: Session-independent RMR — compute directly from mistakes table with time windows, no session context required

Planned

Track-and-Stop (#236)

Optimal stopping for rule convergence. Rules that have converged (posterior CI width below threshold) stop consuming gauntlet citations. Based on Garivier & Kaufmann (2016). Frees exploration budget for uncertain rules. Includes schema migration (v8), convergence indicators in the dashboard, and integration with the gauntlet credit system.

Cross-project convergence

Detect rules independently rediscovered across projects. Track cross-project salience. When the same pattern surfaces in three repos, that's a signal. Builds on the global SQLite backend and the emission protocol.

Emergent rule graphs

Cluster embeddings into concept nodes. Derive edges from co-occurrence and bandit correlation. Contextual bandits with embedding-space context vectors (LinUCB). The embedding persistence layer is already shipped via qortex; this is the graph structure on top.

L-MVA: Learned Minimal Viable Agent (qortex#96)

Outer bandit that discovers the minimum token budget maintaining quality, per task, per domain. Inner bandit picks rules under that budget. Both update from the same reward signal.

  • #243 — Dashboard visualization bugs (4 charts)
  • #237 — Session decoupling (Phase 2-3)
  • #236 — Track-and-Stop optimal stopping
  • #100 — sqlite-vec + emergent rule graphs
  • #87 — qortex knowledge graph integration
  • qortex#96 — L-MVA (BudgetLearner)
  • qortex#63 — Production deployment (triple interface)