Commit Graph

50 Commits

Author SHA1 Message Date
f9afca0c58 refactor: trim act-phase skill from 371 to 140 lines
Remove duplicated routing tables, verbose JSON event examples,
writing/prose domain template (belongs in domains/colette-bridge),
--start-from section (belongs in run skill), and redundant checklist.
Consolidate three Agent() templates into one compact template.
Preserve all routing rules, decision logic, and feedback format.
2026-04-06 20:39:34 +02:00
e19ff0acc3 refactor: refocus ArcheFlow as workspace orchestrator, not feature-dev competitor
- README: lead with af-sprint (parallel multi-project), af-review (post-impl quality)
- Sprint skill: L/XL code tasks use feature-dev style (explore→plan→impl→self-review)
  instead of PDCA. Reserve PDCA for writing/research domains.
- Session start: route to af-sprint/af-review/af-run based on task type
- Explicitly state: for single-feature dev, use feature-dev plugin instead
2026-04-04 18:44:18 +02:00
1bf1376a80 feat: implement archeflow-review.sh for Guardian-only diff review
Standalone bash script that extracts git diffs for af-review without
PDCA orchestration. Supports --branch, --commit, and uncommitted modes.
Reports stats (files/lines changed) to stderr, diff to stdout.
2026-04-04 18:39:06 +02:00
6309614bfa feat: add sprint runner and review-only skills 2026-04-04 18:21:19 +02:00
6854e858a4 fix: address v0.7.0 review findings
- Auto-select: fast workflow now maps to pipeline strategy (was falling through to pdca)
- Evidence validation: check for missing evidence markers, not just banned phrases
- Remove sed-based artifact mutation (avoids table row corruption), track downgrades in events only
- Pipeline verify: explicit merge guard prevents merging before tests/re-review pass
2026-04-04 09:36:05 +02:00
44f0896e3c docs: update CHANGELOG and version for v0.7.0 2026-04-04 09:36:05 +02:00
29762a8464 feat: add strategy abstraction with pdca and pipeline strategies 2026-04-04 09:36:05 +02:00
a6dcd2c956 feat: add plan granularity constraint to plan-phase and creator 2026-04-04 09:36:05 +02:00
516fe11710 feat: add evidence-gated verification to check phase and reviewers 2026-04-04 09:36:05 +02:00
f10e853d8e feat: add structured status tokens to all agents and run skill 2026-04-04 09:36:05 +02:00
eabf13b9b0 feat: add context isolation protocol to attention-filters and all agents 2026-04-04 09:36:05 +02:00
57e95ba151 docs: add v0.4.0 changelog, update to v0.6.0 2026-04-04 08:51:19 +02:00
3c7d336c93 feat: add Explorer skip heuristic to plan-phase skill 2026-04-04 08:51:19 +02:00
12575b5a47 feat: expand attention-filters from stub to full skill 2026-04-04 08:51:19 +02:00
362fb9ada9 fix: address v0.5.0 review findings
- Add --to/--test-cmd mutual exclusivity guard in rollback script
- Convert all jq string interpolation to --arg (cmd_extract, cmd_inject, cmd_forget)
- Fix CRITICAL/WARNING grep to match table rows only (not prose)
- Add thorough+cycle-1 guard to fast-path bash snippet in check-phase
- Clarify prev_run_id selection comment (tail -1 = most recent non-current)
2026-04-04 08:44:16 +02:00
c3f5df8161 docs: update CHANGELOG and version for v0.5.0 2026-04-04 08:44:16 +02:00
c5174e88eb feat: flesh out check-phase with parallel reviewer spawning protocol 2026-04-04 08:44:16 +02:00
5e2117c9be feat: add cross-run finding regression detection 2026-04-04 08:44:16 +02:00
30ddc6a2c4 feat: add per-workflow model assignment configuration 2026-04-04 08:44:16 +02:00
008315b0c4 feat: add lib script validation at run initialization 2026-04-04 08:44:16 +02:00
d9ec148bb3 fix: address review findings (rollback mainline, audit flag, confidence gate, test pattern, jq args)
- git revert uses --mainline 1 for merge commits
- inject dispatch passes all args so --audit flag is reachable
- confidence gate defaults to 0.0 (triggers gate) instead of 0.7 (bypasses)
- test-first grep uses word-boundary patterns to avoid false positives
- jq uses --arg instead of string interpolation for safe filtering
2026-04-04 08:30:43 +02:00
dd82944529 feat: add memory injection audit trail for effectiveness tracking 2026-04-04 08:30:31 +02:00
8af9db2c12 feat: add test-first validation gate in Do phase 2026-04-04 08:30:31 +02:00
7f99d52a09 feat: add archeflow-rollback.sh for post-merge test failure auto-revert 2026-04-04 08:30:31 +02:00
34f101c166 feat: specify confidence gate parsing and mini-Explorer spawning in run skill 2026-04-04 08:30:31 +02:00
960aba5faa feat: clarify worktree merge flow with explicit git commands in run skill 2026-04-04 08:30:31 +02:00
2247e52ae4 fix: unify feedback routing tables across orchestration, act-phase, artifact-routing 2026-04-04 08:30:31 +02:00
6bc5e48357 feat: flesh out do-phase skill from stub to full execution protocol 2026-04-03 12:25:29 +02:00
6b0a9b7b90 feat: add presence skill — visible value, not noise
Defines how ArcheFlow communicates: compact status lines per phase,
show outcomes not mechanics, silence for clean passes. One-line
activation indicator at session start.
2026-04-03 12:18:18 +02:00
9e22ff5822 docs: rewrite README, add CHANGELOG, update skill index and roadmap
- README: complete rewrite with all 24 skills, 8 scripts, architecture overview
- CHANGELOG: v0.1.0 → v0.2.0 → v0.3.0 with full feature history
- using-archeflow: updated skill reference to all 24 skills in 6 categories
- plugin.json: version bump to 0.3.0
- roadmap: updated with v0.3.0 features
2026-04-03 12:05:53 +02:00
ee5dfa70b8 feat: add multi-project orchestration with dependency DAG and shared budget 2026-04-03 11:41:06 +02:00
19f8f76232 feat: add memory, convergence, colette bridge, templates, progress, effectiveness, git integration
- skills/memory: cross-run learning from recurring findings + lib/archeflow-memory.sh
- skills/convergence: oscillation detection + early termination in multi-cycle runs
- skills/colette-bridge: auto-inject voice profiles, personas, characters from colette.yaml
- skills/templates: workflow/team/archetype gallery with init/save/share
- skills/progress: live .archeflow/progress.md during runs
- skills/effectiveness: per-archetype signal-to-noise + cost efficiency scoring
- skills/git-integration: auto-branch per run, commit per phase, rollback support
2026-04-03 11:40:04 +02:00
b6df3d19fd feat: add automated PDCA loop, domain adapters, cost tracking, DAG renderer
- skills/run: automated PDCA execution loop with --start-from, --dry-run
- skills/artifact-routing: inter-phase artifact protocol with context injection
- skills/act-phase: structured review→fix pipeline with cycle feedback
- skills/domains: domain adapter system (writing, code, research)
- skills/cost-tracking: per-agent cost estimation, budget enforcement
- lib/archeflow-dag.sh: ASCII DAG renderer from JSONL events
- lib/archeflow-report.sh: updated with DAG section, cycle diff, --dag/--summary flags
2026-04-03 11:20:14 +02:00
1753e69a9f feat: add process logging with DAG-based event sourcing
Event-sourced orchestration logging: JSONL events with parent
relationships form a DAG for causal reconstruction of agent flows.
Includes bash event emitter (jq-based) and markdown report generator.
2026-04-03 11:06:02 +02:00
8dec44d199 chore: consolidate plugin for sharing
- Add .gitignore (ignore .archeflow/ runtime state)
- Move custom archetype examples from .archeflow/ to examples/
- Remove onboarding skill (covered by using-archeflow + README)
- Remove internal planning doc
- Clean roadmap (remove tool.archeflow references)
- Simplify session-start hook config
- Polish README for external users
2026-04-03 07:29:53 +02:00
5eefa309cb fix: redesign adaptation rules per Guardian review — resolve race conditions
Guardian review (Sprint 1) found critical issues in adaptation rules:
- A2 couldn't skip already-spawned parallel reviewers (race condition)
- A1 escalated then A2 skipped the escalated reviewers (self-defeating)
- A3 "Pause" conflicted with parallel agent execution
- A1's "thorough" override was unreachable dead code

Fixes:
- Reorder rules: A3 (after Plan) → A2 (after Guardian) → A1 (after Check)
- Guardian now runs first in Check phase, A2 evaluated before spawning others
- A2 doesn't apply to escalated workflows or first cycle of thorough
- A3 task-understanding pause runs before Do phase (no cancellation needed)
- Added explicit evaluation order diagram
2026-04-03 06:25:27 +02:00
8755d68dc9 fix: address Guardian review findings — sync Creator agent, wire hooks, add cost table
- Sync agents/creator.md output format with plan-phase skill (3-axis confidence,
  alternatives considered, mini-reflect) — fixes adaptation rule A3 dependency
- Wire hook points into orchestration Act phase (pre-merge, post-merge)
- Add cost-per-agent-tier table to autonomous-mode for budget scheduling
- Add team preset loading reference to orchestration Step 0
2026-04-03 06:23:47 +02:00
761d64b821 feat: DX, extensibility, and adaptive intelligence — completes 25-feature backlog
Sprint 5 — Developer Experience (D1-D5):
- Progress indicators with emoji phase markers during orchestration
- Dry-run mode: preview workflow without executing
- /archeflow:status and /archeflow:history commands
- Onboarding skill for first-time users

Sprint 6 — Extensibility (E1-E4, A4-A6):
- Archetype composition: combine 2 archetypes into super-reviewer
- Team presets: save common team configs in .archeflow/teams/
- Hook points: pre-plan, post-check, pre-merge, post-merge custom validation
- Workflow template library: api-design, migration, dep-upgrade, docs, hotfix
- Reviewer profiles: per-project config of which reviewers run
- Explorer cache: skip redundant research on recently-explored code
- Learning from history: track archetype usefulness, recommend profile changes
2026-04-03 06:20:44 +02:00
83e09b70f2 feat: workflow intelligence, quality loop, completion promises, parallel teams
Sprint 1 — Workflow Intelligence (A1-A3):
- Conditional escalation: fast→standard on 2+ CRITICALs
- Guardian fast-path: skip remaining reviewers on clean pass
- Confidence-triggered escalation: pause/upgrade/probe on low scores

Sprint 2 — Quality Loop (B1-B2, B5-B6):
- Maker self-review checklist before submitting to Check phase
- Proposal diff ("What Changed") on cycle 2+ revisions
- Convergence detection: escalate to user if same finding persists 2 cycles
- Cross-archetype dedup: merge duplicate findings from different reviewers

Sprint 3 — Completion & Verification (B3-B4):
- Completion promise: user-defined done criteria checked in Act phase
- Post-merge verification: run tests on main, auto-revert on failure

Sprint 4 — Parallel & Scale (C1-C4):
- Parallel team orchestration: 2-3 independent teams with merge gate
- Task dependency graph in autonomous queue format
- Auto-resume on interruption via .archeflow/state.json
- Budget-aware scheduling with automatic workflow downgrade
2026-04-03 06:17:53 +02:00
5139f1ad89 feat: Ralph Loop integration — mini-reflect, alternatives, structured confidence
- Mini-Reflect for fast workflow: Creator must restate task, list assumptions,
  name highest-damage risk before proposing (catches misunderstandings early)
- Alternatives Considered section: Creator must evaluate 2+ approaches with
  rejection rationale before committing to one (prevents tunnel vision)
- Structured confidence scoring: 3-axis table (task understanding, solution
  completeness, risk coverage) replaces bare 0.0-1.0 number. Low scores
  trigger targeted action (clarify, upgrade workflow, or research)
- Mini-Reflect fallback for skipped tasks: quick reflection even when
  ArcheFlow doesn't activate (non-trivial single-file changes)
2026-04-03 06:08:01 +02:00
d08dc657d1 feat: core improvements — feedback loop, attention filters, shadow heuristics, metrics, auto-activation
- Cross-cycle feedback protocol with structured finding format, routing, and resolution tracking
- Attention filter enforcement: explicit context include/exclude per archetype
- Shadow detection: quantitative checklists with concrete thresholds
- Orchestration metrics: per-phase timing, agent count, findings summary
- Autonomous mode wiring: checkpoint protocol, session log, stop conditions
- Auto-activation: SessionStart hook fires ArcheFlow for implementation tasks without user config
- Emoji avatars for all 7 archetypes
- Standardized finding format across all reviewers for cross-cycle tracking
- Persisted implementation plan in docs/
2026-04-03 06:02:10 +02:00
ebe943a67e refactor: realistic shadows for Claude behavior
- Creator: Perfectionist → Over-Architect (Claude over-designs, doesn't endlessly revise)
- Maker: Cowboy → Rogue (same behavior, better name)
- Trickster: Saboteur → False Alarm (Claude floods with noise, doesn't sabotage)
2026-04-02 18:50:40 +00:00
60cdedeec1 chore: remove memory skill — duplicates native Claude Code CLAUDE.md and MEMORY.md 2026-04-02 18:38:29 +00:00
cccaf86995 refactor: replace 3-Sets diagnostic with focused attention filters and memory
The 3-Sets framework doesn't transfer well to agents — all three
dimensions are fully visible and controllable config, not hidden
human psychology. Removed the branding, kept the practical bits:

- attention-filters: what context each archetype receives (token savings)
- memory: persistent learnings across orchestrations (project knowledge)
2026-04-02 18:36:10 +00:00
ed821097de feat: add 3-Sets agent diagnostic and attention filters
New skill: agent-diagnostic — applies the 3-Sets framework
(Tool-Set, Skill-Set, Mind-Set) to agent orchestration:

- Pre-orchestration diagnostic: check each agent's configuration
  across three dimensions, fix the weakest set first
- Chain principle: weakest set caps output (Opus + bad prompt = waste)
- Alignment principle: modest aligned agents beat excellent misaligned ones
- Attention filters: each archetype reads only relevant artifacts
- Post-orchestration learning: extract learnings to persistent memory
  structured by the three sets

Based on the 3-Sets Method diagnostic framework.
2026-04-02 18:32:18 +00:00
f7f3c84b41 refactor: trim phase skills to reduce token waste
- check-phase: 831 → 204 words (remove protocol duplication with agents)
- plan-phase: 521 → 143 words (keep output formats, drop behavioral text)
- do-phase: 408 → 113 words (keep output format + critical commit rule)
- Total plugin: 9830 → 8269 words (~2000 tokens saved)
2026-04-02 18:24:50 +00:00
87183fc2a4 refactor: one shadow per archetype, trim bootstrap skill
- Consolidate to single shadow per archetype (fold best bits from
  dropped shadows into the remaining one)
- Trim bootstrap skill from 515 to 254 words (~50% token reduction)
- Remove redundant shadow table from bootstrap (already in archetype table)
2026-04-02 18:22:58 +00:00
5cc3d67718 feat: add virtues and second shadows to all archetypes
Each archetype now has the full Jungian triad:
- Virtue: the unique contribution (what makes it worth including)
- Shadow 1: primary dysfunction (strength pushed too far)
- Shadow 2: complementary dysfunction (different failure mode)

Virtues: Contextual Clarity, Decisive Framing, Execution Discipline,
Threat Intuition, Assumption Surfacing, Adversarial Creativity,
Maintainability Judgment.

New shadows: Catalog Fetish, Over-Architect, Scope Creep, Gatekeeper,
Whataboutist, Scope Escape, Philosopher.
2026-04-02 18:18:29 +00:00
21becd8b58 chore: remove ArcheHelix branding, use plain PDCA language
The archetypes and shadows are distinctive enough — no need for
a fancy name on top of the standard PDCA cycle terminology.
2026-04-02 18:09:28 +00:00
a6fa708f8b feat: ArcheFlow — multi-agent orchestration plugin for Claude Code
Zero-dependency Claude Code plugin using Jungian archetypes as
behavioral protocols for multi-agent orchestration.

- 7 archetypes (Explorer, Creator, Maker, Guardian, Skeptic, Trickster, Sage)
- ArcheHelix: rising PDCA quality spiral with feedback loops
- Shadow detection: automatic dysfunction recognition and correction
- 3 built-in workflows (fast, standard, thorough)
- Autonomous mode: unattended overnight sessions with full visibility
- Custom archetypes and workflows via markdown/YAML
- SessionStart hook for automatic bootstrap
- Examples for feature implementation and security review
2026-04-02 16:37:44 +00:00