Run pipeline, write Post 08, commit untracked files
Pipeline: - Extract ideas for 38 new drafts → 462 ideas total - Convergence analysis: 132 cross-org convergent ideas (33% rate) - Fetch authors for 102 drafts → 709 authors (up from 403) - Refresh gap analysis: 12 gaps across full 474-draft corpus - Update verified counts with new totals Post 08: - Complete rewrite of "Agents Building the Agent Analysis" (2,953 words) - Covers 3 phases: writing team → review cycle → fix cycle - Meta-irony table mapping team coordination to IETF gap names - Specific examples from dev journal (SQL injection, consent conflation, ideas mismatch) Untracked files committed: - scripts/: backfill-wg-names, classify-unrated, compare-classifiers, download-relevant-text, run-webui - src/ietf_analyzer/classifier.py: two-stage Ollama classifier - src/webui/: analytics (GDPR-compliant), auth, obsidian_export - tests/test_obsidian_export.py (10 tests) - data/reports/: wg-analysis, generated draft for gap #37 Housekeeping: - .gitignore: exclude LaTeX artifacts, stale DBs, analytics.db Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
This commit is contained in:
@@ -12,7 +12,7 @@
|
||||
| drafts | 434 | Up from 361 after 2026-03-07 fetch |
|
||||
| ratings | 434 | 1:1 with drafts |
|
||||
| authors | 557 | Unique persons from Datatracker |
|
||||
| ideas | 419 | See "Ideas Count History" below |
|
||||
| ideas | 462 | Re-extracted 2026-03-08, see "Ideas Count History" below |
|
||||
| gaps | 11 | Not 12 -- see gap list below |
|
||||
| embeddings | 434 | 1:1 with drafts |
|
||||
| draft_authors | 1,057 | Draft-author links |
|
||||
@@ -79,24 +79,25 @@ Blog posts reference 12 gaps with different names (e.g., "Agent Resource Exhaust
|
||||
|
||||
## Ideas Count History
|
||||
|
||||
The database currently contains **419 ideas** across **377 drafts**. This is the third different count encountered:
|
||||
The database currently contains **462 ideas** across **415 drafts**. This is the fourth count encountered:
|
||||
|
||||
| Source | Count | Date | Likely Explanation |
|
||||
|--------|-------|------|-------------------|
|
||||
| Blog post 5 filename | 1,262 | ~2026-03-03 | Pre-expansion dataset (260 drafts), before dedup |
|
||||
| Blog post 5 text / master stats | 1,780 | ~2026-03-05 | Post-expansion (361 drafts), before dedup |
|
||||
| Current database | 419 | 2026-03-08 | After `dedup_ideas` run (0.85 threshold) or re-extraction with different params |
|
||||
| Previous database | 419 | 2026-03-08 | After `dedup_ideas` run (0.85 threshold) or re-extraction with different params |
|
||||
| Current database | 462 | 2026-03-08 | After re-extraction for 38 drafts missing ideas (474 total drafts, 59 still without ideas) |
|
||||
|
||||
### Ideas by Type (current DB)
|
||||
|
||||
| Type | Count |
|
||||
|------|-------|
|
||||
| protocol | 96 |
|
||||
| architecture | 95 |
|
||||
| extension | 79 |
|
||||
| mechanism | 68 |
|
||||
| requirement | 42 |
|
||||
| pattern | 35 |
|
||||
| architecture | 107 |
|
||||
| protocol | 106 |
|
||||
| extension | 84 |
|
||||
| mechanism | 74 |
|
||||
| requirement | 47 |
|
||||
| pattern | 40 |
|
||||
| framework | 3 |
|
||||
| format | 1 |
|
||||
|
||||
@@ -104,14 +105,30 @@ The database currently contains **419 ideas** across **377 drafts**. This is the
|
||||
|
||||
| Ideas/Draft | Drafts |
|
||||
|-------------|--------|
|
||||
| 1 | 337 |
|
||||
| 2 | 38 |
|
||||
| 1 | 370 |
|
||||
| 2 | 43 |
|
||||
| 3 | 2 |
|
||||
| 0 (no ideas) | 57 |
|
||||
| 0 (no ideas) | 59 |
|
||||
|
||||
The near-uniform 1-idea-per-draft (89% of drafts with ideas) suggests either aggressive dedup or a re-extraction with constrained output. The original pipeline extracted 1-4 ideas per draft, so the 1,780 figure likely reflects pre-dedup counts.
|
||||
|
||||
Excluding false positives: 365 ideas across 326 drafts.
|
||||
### Convergence Analysis (2026-03-08)
|
||||
|
||||
Cross-organization idea convergence analysis (threshold: 0.75 SequenceMatcher similarity):
|
||||
|
||||
| Metric | Value |
|
||||
|--------|-------|
|
||||
| Total ideas | 462 |
|
||||
| Unique clusters | 398 |
|
||||
| Cross-org convergent ideas | 132 |
|
||||
| Convergence rate | 33% |
|
||||
|
||||
Top convergent ideas by organization count:
|
||||
- **Fully Adaptive Routing Ethernet for AI** — 14 orgs (Baidu, Broadcom, China Mobile, etc.)
|
||||
- **AI Agent Protocol Framework** — 7 orgs, 3 drafts
|
||||
- **Natural Language Protocol for Agent Comm** — 7 orgs
|
||||
- **LISP-based geospatial intelligence network** — 6 orgs
|
||||
- **MCP-Based Network Management Plane** — 4 orgs (Deutsche Telekom, Huawei, Orange, Telefonica)
|
||||
|
||||
## Actions Taken (2026-03-08)
|
||||
|
||||
|
||||
Reference in New Issue
Block a user