Files
ietf-draft-analyzer/data/reports/blog-series/data/00-master-stats.md
Christian Nennemann e7527ad68e Fix remaining critical, high, and medium issues from 4-perspective review
Critical fixes:
- Fix rating clamp range 1-10 → 1-5 (actual scale)
- Add `ietf ideas convergence` command (SequenceMatcher at 0.75 threshold)
- Fix "628 cross-org ideas" → 130 (verified from current DB) across 8 files

Security fixes:
- Sanitize FTS5 query input (strip special chars + boolean operators)
- Add rate limiting (10 req/min/IP) on Claude-calling endpoints
- Change <path:name> → <string:name> on draft routes

Codebase fixes:
- Add Database context manager (__enter__/__exit__)
- Wire false_positive filtering into queries (exclude by default in web UI)
- Fix Post 3 arithmetic ("~300" → "~409" distinct proposals)

Content & licensing:
- Add MIT LICENSE file
- Add IPR/FRAND notes (BCP 79, RFC 8179) to Posts 03 and 07
- Qualify "4:1 safety ratio" with monthly variation in 6 remaining files
- Add "Data as of March 2026" freeze-date headers to all 10 blog posts
- Hedge causal language in Post 04

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-08 12:47:47 +01:00

7.9 KiB

Master Statistics — Updated 2026-03-03 (Full 361-Draft Corpus)

All numbers below reflect the complete 361-draft dataset after pipeline run on 101 new drafts.

Core Numbers

Stat Value Notes
Total drafts 361 up from 260 after keyword expansion
Total authors 557 up from 403
Total organizations 230 up from 184
Total ideas (raw) 1,780 up from 1,262 (~4.9/draft avg)
Unique idea clusters 1,467 after fuzzy dedup
Cross-org ideas (2+ orgs) 130 36% of unique clusters (current 419-idea extraction); earlier 1,780-idea run yielded 628 — LEAD METRIC
Total gaps 12 3 critical, 6 high, 3 medium
Total embeddings 361 all drafts embedded
WG-adopted drafts 36 (10.0%) 18 WGs
Individual drafts 325 (90.0%)
RFC cross-references 4,231 2,443 RFC + 698 draft + 1,090 BCP
Avg novelty 3.32 (1-5 scale)
Avg maturity 2.96
Avg relevance 3.84

Growth Curve (Monthly Submissions)

Month Drafts Cumulative
2024-01 3 3
2024-02 1 4
2024-04 1 5
2024-09 2 7
2024-10 1 8
2024-12 1 9
2025-01 4 13
2025-04 5 18
2025-05 2 20
2025-06 5 25
2025-07 5 30
2025-08 8 38
2025-09 17 55
2025-10 67 122
2025-11 61 183
2025-12 16 199
2026-01 54 253
2026-02 86 339
2026-03 22 361

Peak: 86 drafts in Feb 2026. Growth from ~2/mo (mid-2024) to 86/mo = 43x acceleration.

Category Distribution (Full 361 Drafts)

Category Count %
A2A protocols 136 37.7%
Agent identity/auth 121 33.5%
Autonomous netops 98 27.1%
ML traffic mgmt 74 20.5%
AI safety/alignment 45 12.5%
Human-agent interaction 30 8.3%

Note: drafts can have multiple categories.

Safety Ratio

  • Safety drafts: 45 (12.5% of corpus)
  • Capability drafts (any non-safety category): 351
  • Ratio: ~8:1 capability-to-safety
  • Improvement from original 4:1 (260 drafts) because keyword expansion brought in more ML infrastructure drafts, some with safety elements

Keyword Expansion Impact (Original 260 vs New 101)

Category Original 260 New 101 Total
Data formats/interop 102 43 145
A2A protocols 92 28 120
Agent identity/auth 98 10 108
Autonomous netops 60 33 93
Policy/governance 60 31 91
ML traffic mgmt 23 50 73
Agent discovery/reg 57 8 65
AI safety/alignment 36 8 44
Model serving/inference 13 29 42
Human-agent interaction 22 8 30

Key finding: "ML traffic mgmt" and "Model serving/inference" surged with the new keywords — these categories more than doubled. The "inference" and "generative" keywords opened up the ML infrastructure community.

Geopolitical Split

Region Drafts Authors
Chinese-affiliated 152 218
Western-affiliated 94 81
Other/Unclassified 158 221

Chinese orgs contribute ~42% of drafts from ~39% of authors. Western orgs: ~26% of drafts from ~15% of authors.

Idea Taxonomy (current: 419 ideas / 361 unique clusters / 130 cross-org; earlier run: 1,780 raw / 1,467 unique / 628 cross-org)

Type Count %
mechanism 663 37.2%
architecture 280 15.7%
pattern 251 14.1%
protocol 228 12.8%
requirement 171 9.6%
extension 168 9.4%
framework 9 0.5%
other 10 0.6%

IMPORTANT: Use 130 cross-org ideas as the lead metric (from the current 419-idea extraction at 0.75 SequenceMatcher threshold). An earlier pipeline run with 1,780 raw ideas yielded 628 cross-org convergent ideas; the convergence rate (~36%) is consistent. The raw count is a pipeline artifact (~4.9/draft avg). See Post 5 data package for details.

Top Organizations

Org Drafts Authors Composite Score
Huawei (all entities) 57+ 28+ 3.1
China Mobile 35 24 3.2
China Telecom 23 22 3.0
China Unicom 22 22 3.0
Cisco (all entities) 25 19 3.4
Tsinghua University 16 13 3.5
Telefonica 13 2 3.2
ZTE Corporation 10 10 3.0
Google 10 4 3.3
Five9 10 1 3.8
Ericsson 9 4 3.6

Quality Leaders (Composite >= 3.5, min 3 drafts)

Org Drafts Composite
Aiiva.org 3 4.42
AWS 3 4.38
Mozilla 4 3.81
Zhongguancun Lab 6 3.81
Five9 10 3.75
Bitwave 6 3.75
Siemens 5 3.75
Inria 4 3.70
Ericsson 9 3.59
Nokia 5 3.58
Beijing Univ P&T 4 3.57
Tsinghua 16 3.53
Cisco Systems 17 3.50

WG Adoption

WG Drafts Focus
lamps 6 PKI/certificates
lake 5 EDHOC/lightweight crypto
tls 3 TLS extensions
emu 3 EAP methods
sshm 2 SSH maintenance
httpbis 2 HTTP extensions
anima 2 Bootstrapping
aipref 2 AI preferences
ace 2 Auth for constrained envs

19 of 36 WG drafts (53%) are in security/crypto WGs. Only 2 are in an agent-specific WG (aipref).

Top 10 Highest-Scored Drafts

Draft Title Composite
draft-aylward-daap-v2 Distributed AI Accountability Protocol v2 4.75
draft-ietf-lake-app-profiles EDHOC Application Profiles 4.75
draft-cowles-volt Verifiable Operations Ledger and Trace 4.75
draft-goswami-agentic-jwt Secure Intent Protocol for Agents 4.50
draft-chang-agent-token-efficient Token-efficient Data Layer for Agents 4.50
draft-birkholz-verifiable-agent-conversations Verifiable Agent Conversations 4.50
draft-guy-bary-stamp-protocol Secure Task-bound Agent Message Proof 4.50
draft-drake-email-tpm-attestation Hardware Attestation for Email 4.50
draft-ietf-tls-ecdhe-mlkem Post-quantum Hybrid Key Agreement 4.50
draft-ietf-hpke-hpke Hybrid Public Key Encryption 4.50

Updated Gap List (12 gaps, refreshed)

Critical (3)

  1. Agent Behavior Verification — No mechanisms to verify agents actually behave according to declared policies
  2. Cross-Domain Agent Liability — When agents cause harm across organizational boundaries, who's responsible?
  3. Human Override Protocols — No standardized emergency override protocols for autonomous agents

High (6)

  1. Agent Resource Exhaustion Protection — No mechanisms to prevent agents from consuming excessive resources
  2. Agent-Generated Data Provenance — Insufficient tracking of data origins as info flows between agents
  3. Agent Capability Degradation Handling — No approach for detecting when agent capabilities degrade
  4. Multi-Agent Coordination Deadlocks — Insufficient attention to preventing deadlock in multi-agent systems
  5. Agent Privacy Preservation — Agents process sensitive data without adequate privacy protections
  6. Agent Firmware/Model Update Security — Insufficient focus on secure update mechanisms

Medium (3)

  1. Real-time Agent Debugging — Missing protocols for debugging agents in production
  2. Cross-Protocol Agent Migration — No mechanisms for migrating agent state between protocols
  3. Agent Energy Consumption Optimization — Missing standards for energy-aware agent operation

Most Referenced RFCs (Foundation Standards)

RFC Cited By Subject
RFC 2119 285 drafts Key words (MUST, SHALL, etc.)
RFC 8174 237 drafts Key words update
RFC 8446 42 drafts TLS 1.3
RFC 6749 36 drafts OAuth 2.0
RFC 9110 34 drafts HTTP Semantics
RFC 8126 26 drafts IANA Guidelines
RFC 8259 26 drafts JSON
RFC 5280 22 drafts X.509 PKI
RFC 7519 22 drafts JWT
RFC 9052 20 drafts CBOR Object Signing (COSE)