Files
Christian Nennemann d94688ca1b refactor: trim 11 secondary ArcheFlow skills from 3340 to 952 lines
Remove verbose YAML examples, bash pseudo-code, tutorial prose, and
motivational content from configuration/integration skills while
preserving all operational protocols, reference tables, and rules.

Skills trimmed: domains, colette-bridge, multi-project, cost-tracking,
git-integration, custom-archetypes, workflow-design, templates,
autonomous-mode, progress, presence.
2026-04-06 20:48:50 +02:00

95 lines
3.0 KiB
Markdown

---
name: cost-tracking
description: |
Cost aggregation, budget enforcement, and model selection for ArcheFlow orchestrations.
Tracks per-agent and per-run token costs, enforces budgets, and recommends the cheapest
model that meets quality requirements per archetype and domain.
<example>User: "How much did that orchestration cost?"</example>
<example>Automatically active when budget is configured</example>
---
# Cost Tracking -- Budget-Aware Orchestration
Tracks costs per agent and per run, enforces budgets, and selects cost-optimal models.
## Model Pricing
| Model | Input ($/M tok) | Output ($/M tok) |
|-------|----------------:|-----------------:|
| claude-opus-4-6 | 15.00 | 75.00 |
| claude-sonnet-4-6 | 3.00 | 15.00 |
| claude-haiku-4-5 | 0.80 | 4.00 |
**Prompt caching:** 90% discount on cached input tokens. Structure system prompts for cache hits.
**Batches API:** 50% discount. Use for non-time-sensitive bulk ops.
## Cost Calculation
```
cost = (input - cache_read) * input_price/1M
+ cache_read * input_price * 0.10/1M
+ output * output_price/1M
```
If exact tokens unavailable, estimate: `tokens ~= chars / 4`. Mark with `cost_estimated: true`.
## Default Model Assignments
| Archetype | Code | Writing |
|-----------|------|---------|
| Explorer | haiku | haiku |
| Creator | sonnet | sonnet |
| Maker | sonnet | **sonnet** |
| Guardian | haiku | haiku |
| Skeptic | haiku | haiku |
| Sage | sonnet | **sonnet** |
| Trickster | haiku | haiku |
Opus is user-opt-in only (team preset `model_overrides`).
**Resolution order:** team preset override > domain override > archetype default.
## Pre-Agent Cost Estimates
| Archetype | Typical Input | Typical Output |
|-----------|-------------:|---------------:|
| Explorer | 8k | 4k |
| Creator | 12k | 6k |
| Maker | 15k | 12k |
| Guardian | 10k | 3k |
| Skeptic | 8k | 3k |
| Sage | 12k | 4k |
| Trickster | 8k | 4k |
After 10+ runs, use actual averages from `metrics.jsonl` instead.
## Budget Configuration
```yaml
budget:
per_run_usd: 10.00
per_agent_usd: 3.00
daily_usd: 50.00
warn_at_percent: 75
```
Team preset budget overrides global config. No budget = unlimited (costs still tracked).
## Budget Enforcement
**Pre-agent:** Estimate cost. If > remaining budget: stop (autonomous) or warn (attended).
**Post-agent:** Update total. Warn at threshold. Stop if budget exceeded.
## Cost Optimization
1. **Prompt caching:** Stable content first (archetype instructions, voice profiles). Saves 30-50% on input.
2. **Guardian fast-path (A2):** 0 issues = skip remaining reviewers. Saves $0.30-0.80/cycle.
3. **Explorer cache:** Reuse recent research. Saves $0.02-0.05/hit.
4. **Batches API:** For autonomous/overnight review passes (50% discount).
5. **Early termination:** Clean Guardian + clean Maker self-review = skip remaining cycles.
## Daily Cost Tracking
Ledger at `.archeflow/costs/<YYYY-MM-DD>.jsonl`. One line per run with cost, tokens, models, domain. Daily budget enforcement reads this before starting new runs.