Public Datasets & Transparency

Fully Open, Blinded Datasets — Scored by GPT + Grok + Gemini Ensemble. Download anything.

50
Series (43 distinct societies)
41,519
Node-year records
3–5
AI scorers per series (ensemble)
5 CE – 2026
Time span covered

33 datasets fully validated (Stress-column pass) · 8 further datasets undergoing quality review

Validation results

75–90%
Ensemble hindcast accuracy across tested societies
r = −0.958
Entropy–health correlation in validated datasets
50
Historical series across 43 societies, 5 CE–2026
r = 0.78
Seshat cross-validation, Latium/Rome (Apr 2026)
Methodology — how the scores are produced:
1. Blinding. Each node is scored using only information available in that historical moment — the scorer assesses function at that year without reference to what came next. Scorers are trained on historical text that may implicitly contain outcome information; this limitation is acknowledged openly in the methodology documentation.
2. Triple-AI scoring. GPT-4, Grok, and Gemini each independently assign 1–10 values for Coherence, Capacity, Stress, and Abstraction per node per time-step.
3. Ensemble average. The three scores are averaged to produce the final CAMS matrix.
4. Validation. Ensemble reliability (r > 0.7 across models) and inter-rater agreement are documented in the repository: DATASET_VALIDATION_SUMMARY.md, CAMS_Validation_Formulation.md.
Formal validation of the ensemble approach: The Wright Thesis — that ensemble variance reduces by exactly 1/√N and SNR improves by √5 — has been confirmed against CAMS Germany and USA data (1880–2026). Crucially, inter-rater disagreement is structured and historiographically meaningful, not random noise.
Ensemble Validation Report →
GitHub Stars Last Commit Open Science

All datasets are in the public GitHub repository under data/cleaned/. No login required.

Browse all datasets on GitHub Full datasets index (DATASETS_INDEX.md)

19 nations now have matched ensemble mean + envelope pairs. Added May–June 2026: Colombia (1875–2026 annual), USA (1900–2026 5-yr), Türkiye (1875–2026 5-yr), India (1875–2026 5-yr), Japan (1875–2026 5-yr), and Australia envelope. Earlier additions: Argentina, Canada, Chile, France, UK, Germany, Norway, Poland, Russia, Sweden, Iran, China, Thailand. New series in data/nations/; earlier series in cleaned_datasets/.

Americas

Colombia — Ensemble Mean 1875–2026 · 5-Agent Ensemble · with Node Value & Bond Strength
Colombia — Envelope 1875–2026 · inter-rater SD per node per year · with V_range
United States — Ensemble Mean 1900–2026 · 5-Agent Ensemble · 5-year snapshots · with Node Value & Bond Strength
United States — Envelope 1900–2026 · inter-rater SD per node per year · with V_range
Argentina — Ensemble Mean 1950–2026 · 5-Agent Ensemble · with Node Value & Bond Strength
Argentina — Envelope 1950–2026 · inter-rater SD per node per year
Argentina 1950–2026 · 5-Agent Ensemble · raw scores (C, K, S, A)
Canada — Ensemble Mean 1850–2026 · 5-Agent Ensemble · with Node Value & Bond Strength
Canada — Envelope 1850–2026 · inter-rater SD per node per year
Chile — Ensemble Mean 1875–2026 · 5-Agent Ensemble · with Node Value & Bond Strength
Chile — Envelope 1875–2026 · inter-rater SD per node per year · with V_range

Europe

France — Ensemble Mean 1850–2026 · 5-Agent Ensemble · with Node Value & Bond Strength
France — Envelope 1850–2026 · inter-rater SD per node per year · with V_range
United Kingdom — Ensemble Mean 1880–2026 · CAMNations5 · with Node Value & Bond Strength
United Kingdom — Envelope 1880–2026 · inter-rater SD per node per year · with V_range
Germany — Ensemble Mean 1880–2026 · 5-Agent Ensemble · with Node Value & Bond Strength
Germany — Envelope 1880–2026 · inter-rater SD per node per year · with V_range
Norway — Ensemble Mean 1880–2026 · 5-Agent Ensemble · with Node Value & Bond Strength
Norway — Envelope 1880–2026 · inter-rater SD per node per year · with V_range
Poland — Ensemble Mean 1875–2026 · 5-Agent Ensemble · with Node Value & Bond Strength
Poland — Envelope 1875–2026 · inter-rater SD per node per year · with V_range
Russia — Ensemble Mean 1800–2026 · 5-Agent Ensemble · with Node Value & Bond Strength
Russia — Envelope 1800–2026 · inter-rater SD per node per year · with V_range
Sweden — Ensemble Mean 1880–2026 · 5-Agent Ensemble · with Node Value & Bond Strength
Sweden — Envelope 1850–2026 · inter-rater SD per node per year · with V_range
Türkiye — Ensemble Mean 1875–2026 · 5-Agent Ensemble · 5-year snapshots · with Node Value & Bond Strength
Türkiye — Envelope 1875–2026 · inter-rater SD per node per year · with V_range

Middle East

Iran — Ensemble Mean 1875–2026 · 5-Agent Ensemble · with Node Value & Bond Strength
Iran — Envelope 1875–2026 · inter-rater SD per node per year · with V_range

Asia-Pacific

Australia — Ensemble Mean 1875–2026 · 5-Agent Ensemble · with Node Value & Bond Strength
Australia — Envelope 1875–2026 · inter-rater SD per node per year · with V_range
India — Ensemble Mean 1875–2026 · 5-Agent Ensemble · 5-year snapshots · with Node Value & Bond Strength
India — Envelope 1875–2026 · inter-rater SD per node per year · with V_range
Japan — Ensemble Mean 1875–2026 · 5-Agent Ensemble · 5-year snapshots · with Node Value & Bond Strength
Japan — Envelope 1875–2026 · inter-rater SD per node per year · with V_range
China — Ensemble Mean 1800–2025 · CAMNations5 · with Node Value & Bond Strength
China — Envelope 1800–2025 · inter-rater SD per node per year · with V_range
China — Block2 Envelope 1800–2025 · CAMNations5 Block2 · SD + V_range · V_min · V_max
China — Envelope (1850–2026) 1850–2026 · inter-rater SD · extended series · with V_range
China — Extended Ensemble 1850–2026 · 5-Agent Ensemble · extended series
Thailand — Ensemble Mean 1850–2026 · 5-Agent Ensemble · with Node Value & Bond Strength
Thailand — Envelope 1850–2025 · inter-rater SD per node per year · with V_range

Ancient & Pre-Modern

Latium Vetus — Ensemble Mean 460–2010 · Multi-Agent Ensemble · Roman institutional history
Latium Vetus — Envelope 460–2010 · inter-rater SD per node per year · with V_range

22 new single-scorer and ensemble datasets added May 2026, now powering the CAMS National Diagnostic (32 societies). Canonical 8-node format throughout.

⚠ Pending integration: These datasets are not yet available in the CAMS Explorer, CAMS Interpreter, Zeitgeist Detector, or other dynamic services. Add to all live tools when resources allow.

Western Europe

France 1785–2024 · Single-scorer · C, K, S, A, V, BS
Netherlands 1750–2024 · Single-scorer · C, K, S, A, V, BS
Denmark 1752–2025 · Single-scorer · C, K, S, A, V, BS
Norway 1881–2025 · GEM scorer · C, K, S, A, V, BS

East Asia & Pacific

Japan 1850–2025 · Single-scorer · C, K, S, A, V, BS
Hong Kong 1900–2015 · Single-scorer · C, K, S, A, V, BS
Singapore 1935–2025 · Single-scorer · C, K, S, A, V, BS
Thailand 1850–2025 · Single-scorer · C, K, S, A, V, BS
Indonesia 1941–2025 · Single-scorer · C, K, S, A, V, BS

South Asia

India 1950–2024 · Single-scorer · C, K, S, A, V, BS
Pakistan 1947–2025 · Single-scorer · C, K, S, A, V, BS

Middle East & North Africa

Saudi Arabia 1918–2025 · Single-scorer · C, K, S, A, V, BS
UAE 1970–2026 · GEM scorer · C, K, S, A, V, BS
Iran — Ensemble Mean 5-Agent Ensemble Mean · C, K, S, A, V, BS
Iran — Envelope Inter-rater SD per node per year
Iran — Single-scorer (no BS) 1900–2025 · C, K, S, A, V · bond strength unavailable
Iraq 1900–2025 · Single-scorer · C, K, S, A, V, BS
Israel 1946–2025 · Single-scorer · C, K, S, A, V, BS
Lebanon 1943–2025 · Single-scorer · C, K, S, A, V, BS
Syria 1893–2024 · Single-scorer · C, K, S, A, V, BS

Eurasia

Ukraine 1980–2025 · Single-scorer · C, K, S, A, V, BS

Americas & Africa

Venezuela 1970–2025 · GEM scorer · C, K, S, A, V, BS
Brazil 1880–2025 · Grok scorer · C, K, S, A, V, BS
South Africa 1880–2025 · GEM scorer · C, K, S, A, V, BS
Society ↕ Years ↕ Records ↕ Family Region Download

Every dataset follows the same column structure — directly loadable into the CAMS Dashboard, Python scripts, or any data tool.

Column Type Description
SocietystringSociety or organisation name
YearintegerYear of observation
NodestringOne of the 8 institutional nodes (Lore, Archive, Helm, Stewards, Shield, Craft, Hands, Flow)
Coherencefloat 1–10Internal alignment of the node
Capacityfloat 1–10Resources and effectiveness
Stressfloat 1–10Accumulated pressure and dysfunction
Abstractionfloat 1–10Symbolic sophistication and long-range planning
Node ValuefloatDerived: C + K + (A/2) − S
Bond Strengthfloat 0–1Derived: coupling with adjacent node

→ See the model in action — Live Applications & Dashboards

→ Core framework — The CAMS Model

Repository: github.com/KaliBond/wintermute
Open science. Contributions and new datasets welcome. CC0 licence.

Let the numbers do the talking.