Files
Christian Nennemann 4f8e2a9962 feat: add run replay for archetype effectiveness analysis
- archeflow-decision.sh records decision points during runs
- archeflow-replay.sh: timeline, whatif, compare commands
- What-if replay with adjustable archetype weights
- /af-replay skill for interactive use
- Tests in archeflow-replay.bats
2026-04-06 21:43:29 +02:00

156 lines
7.8 KiB
Markdown

# ArcheFlow — Status Log
## 2026-04-06: Run replay (v0.9.0)
- `lib/archeflow-decision.sh` — append `decision.point` (phase, archetype, input, decision, confidence).
- `lib/archeflow-replay.sh``timeline` / `whatif` (weighted archetypes, threshold) / `compare`; optional `--json`.
- Skill `af-replay`, plugin bump, DAG renders `decision.point`, `tests/archeflow-replay.bats`.
## 2026-04-04: Triple Release Sprint (v0.4 → v0.6)
### What happened
Three ArcheFlow PDCA cycles in one session, each using ArcheFlow's own orchestration to develop itself (dogfooding). Each cycle: Explorer→Creator→Maker→Guardian+Skeptic+Sage→fixes→merge→push.
### v0.4.0 — Gap Fixes (8 commits, 541 lines, 15 files)
- Unified feedback routing tables across 3 skills (canonical 8-row version)
- Confidence gate with concrete bash parsing, 3 branches (pause/upgrade/mini-Explorer)
- `archeflow-rollback.sh` — post-merge auto-revert with `--mainline 1`
- Test-first validation gate in Do phase (word-boundary patterns)
- Memory injection audit trail (`--audit` flag, `audit-check` command)
- Review fixes: safe jq `--arg`, confidence fallback→0.0, pattern hardening
### v0.5.0 — Infrastructure (8 commits, 483 lines, 12 files)
- Lib script validation at run initialization (0a)
- Hook points documentation (`docs/hooks.md` + config template with 6 events)
- Phase rollback via `--to <phase>` in rollback script
- Per-workflow model assignment configuration
- Cross-run finding regression detection
- Check-phase fleshed out with parallel reviewer spawning protocol
- Review fixes: mutual exclusivity guard, jq --arg everywhere, table-row grep
### v0.6.0 — Quality Polish (5 commits, 253 lines, 13 files)
- Attention-filters expanded from 39-line stub to full skill (prompt templates, token budgets, cycle-back rules, verification checklist)
- Explorer skip heuristic in plan-phase skill
- Agent persona normalization (4 agents: examples, model comments, isolation note)
- Runnable quickstart example (`examples/runnable-quickstart.md`)
- CHANGELOG completed with missing v0.4.0 entry + roadmap version history
### v0.7.0 — Superpowers-Inspired + Strategy Abstraction (8 commits, 485 lines, 20 files)
- Context isolation protocol (attention-filters + all 7 agents)
- Structured status tokens: DONE/DONE_WITH_CONCERNS/NEEDS_CONTEXT/BLOCKED
- Evidence-gated verification: banned phrases, evidence markers, downgrade-to-INFO
- Plan granularity constraint: 2-5 min tasks with file:line + code block + verify
- Strategy abstraction: `pdca` (cyclic) vs `pipeline` (linear) vs `auto` (selected by task)
- README: experimental status + interdisciplinary framing (psychology + process eng + software eng)
- Review fixes: fast→pipeline auto-select, merge guard, evidence check completeness
### Key numbers
| Metric | v0.3 → v0.7 delta |
|--------|-------------------|
| Commits this session | 29 |
| Lines added | ~1,762 |
| Files touched | 30+ |
| Lib scripts | 8 → 9 (archeflow-rollback.sh) |
| Skills | 24 (all fleshed out, no stubs remain) |
| Review cycles | 4 (v0.4: full, v0.5: full, v0.6: fast, v0.7: Guardian-only) |
| Review findings fixed | 15 |
### What to do next
1. **End-to-end dogfood** — run `af-run` on a real task (not ArcheFlow itself) to test both strategies
2. **Hook execution runtime** — config documents 6 hook events but no runner yet
3. **Pipeline strategy testing** — exercise the `--strategy pipeline` path on a bug fix
4. **Publish** — tag v0.7.0, consider claude.com/plugins marketplace listing
5. **GitHub Action** — automated PR review (roadmap item, low effort)
## 2026-04-03: Major Feature Sprint (v0.1 → v0.3)
### What happened
Single-session sprint that took ArcheFlow from 9 skills + 2 scripts to **24 skills + 8 scripts**. Driven by dogfooding: we wrote a short story ("Der Huster", Giesing Gschichten) using ArcheFlow to orchestrate the creative writing process, and every gap we hit became a feature.
### Commits (chronological)
```
1753e69 feat: process logging with DAG-based event sourcing
b6df3d1 feat: automated PDCA loop, domain adapters, cost tracking, DAG renderer
19f8f76 feat: memory, convergence, colette bridge, templates, progress, effectiveness, git integration
6bd2c93 feat: archeflow-init.sh template gallery script
ef995fd feat: archeflow-git.sh for per-phase commits and rollback
ee5dfa7 feat: multi-project orchestration with dependency DAG and shared budget
9faea1d feat: progress and effectiveness scoring scripts
9e22ff5 docs: rewrite README, CHANGELOG, skill index, roadmap
9bf64fc fix: input validation for event emitter + test report (42/42 pass)
```
### What's production-ready
- All 8 lib scripts pass validation (42/42 tests, see docs/test-report-2026-04-03.md)
- README fully rewritten with all 24 skills documented
- CHANGELOG covers v0.1 → v0.3
- Plugin manifest updated to v0.3.0
- Event emitter has input validation (JSON + parent format)
### New features by category
**Core Orchestration:**
- `archeflow:run` — single-command PDCA with --start-from, --dry-run
- `archeflow:act-phase` — structured review→fix pipeline
- `archeflow:artifact-routing` — inter-phase artifact protocol
**Process Intelligence:**
- `archeflow:process-log` — event-sourced JSONL with DAG parents
- `archeflow:memory` — cross-run learning from recurring findings
- `archeflow:effectiveness` — per-archetype signal-to-noise scoring
- `archeflow:progress` — live progress.md during runs
- `archeflow:convergence` — oscillation detection + early termination
**Integration:**
- `archeflow:colette-bridge` — auto-inject voice profiles, personas, characters
- `archeflow:git-integration` — branch-per-run, commit-per-phase, rollback
- `archeflow:multi-project` — cross-repo orchestration with dependency DAG
**Configuration:**
- `archeflow:domains` — writing/code/research domain adapters
- `archeflow:cost-tracking` — budget enforcement + model tier recommendations
- `archeflow:templates` — workflow gallery with init/save/share
### Dogfood project: Giesing Gschichten
- Voice profile, persona, series config in Colette
- First story "Der Huster" (~6000 words) — full PDCA cycle
- All process artifacts: research, outline, reviews, event log, process report
- In `book.giesing-gschichten/` (parent repo) + `writing.colette/` (Colette repo)
### Done (late session)
- **Global hook** — `.claude/settings.json` with SessionStart hook, CLAUDE.md updated with ArcheFlow section
- **Template bundles** — 4 bundles shipped: writing-short-story, backend-feature, security-review, quick-fix (22 files, +936 lines)
- **Multi-project examples** — giesing + fullstack examples + examples/README.md
- **Default config** — `.archeflow/config.yaml` with all documented options
- **Production hardening** — 42/42 script tests pass, input validation on event emitter
### What to do next
1. **Write story #2** with live event logging (true dogfood of `archeflow:run` + `archeflow:progress`)
2. **Colette integration test** — test colette-bridge with actual `colette write` commands
3. **Multi-project run** — test cross-repo orchestration using `examples/multi-project-giesing.yaml`
4. **Publish** — consider making the repo public for others to use
5. **MCP server** — ArcheFlow as MCP tool for Cursor/Windsurf (future)
### Architecture snapshot
```
archeflow/
├── agents/ 7 archetype personas (md)
├── skills/ 24 behavioral skills (md)
├── lib/ 8 bash utilities (sh)
├── examples/ workflows, teams, archetypes
├── hooks/ session-start auto-activation
├── docs/ roadmap, test report
├── CHANGELOG.md v0.1 → v0.3
└── README.md full reference
```
### Key numbers
| Metric | Value |
|--------|-------|
| Skills | 24 |
| Agents | 7 |
| Lib scripts | 8 |
| Total lines added | ~7,600 |
| Tests passed | 42/42 |
| Version | 0.3.0 |