Public Datasets & Transparency
Fully Open, Blinded Datasets — Scored by GPT + Grok + Gemini Ensemble. Download anything.
50
Series (43 distinct societies)
3–5
AI scorers per series (ensemble)
5 CE – 2026
Time span covered
33 datasets fully validated (Stress-column pass) · 8 further datasets undergoing quality review
Validation results
75–90%
Ensemble hindcast accuracy across tested societies
r = −0.958
Entropy–health correlation in validated datasets
50
Historical series across 43 societies, 5 CE–2026
r = 0.78
Seshat cross-validation, Latium/Rome (Apr 2026)
Methodology — how the scores are produced:
1. Blinding. Each node is scored using only information available in that historical moment — the scorer assesses function at that year without reference to what came next. Scorers are trained on historical text that may implicitly contain outcome information; this limitation is acknowledged openly in the methodology documentation.
2. Triple-AI scoring. GPT-4, Grok, and Gemini each independently assign 1–10 values for Coherence, Capacity, Stress, and Abstraction per node per time-step.
3. Ensemble average. The three scores are averaged to produce the final CAMS matrix.
4. Validation. Ensemble reliability (r > 0.7 across models) and inter-rater agreement are documented in the repository:
DATASET_VALIDATION_SUMMARY.md,
CAMS_Validation_Formulation.md.
Formal validation of the ensemble approach: The Wright Thesis — that ensemble variance reduces by exactly 1/√N and SNR improves by √5 — has been confirmed against CAMS Germany and USA data (1880–2026). Crucially, inter-rater disagreement is structured and historiographically meaningful, not random noise.
Ensemble Validation Report →
All datasets are in the public GitHub repository under data/cleaned/. No login required.
Browse all datasets on GitHub
Full datasets index (DATASETS_INDEX.md)
New Datasets — May 2026 · cam5 Extension Series
19 nations now have matched ensemble mean + envelope pairs. Added May–June 2026: Colombia (1875–2026 annual), USA (1900–2026 5-yr), Türkiye (1875–2026 5-yr), India (1875–2026 5-yr), Japan (1875–2026 5-yr), and Australia envelope. Earlier additions: Argentina, Canada, Chile, France, UK, Germany, Norway, Poland, Russia, Sweden, Iran, China, Thailand. New series in data/nations/; earlier series in cleaned_datasets/.
Americas
Colombia — Ensemble Mean
1875–2026 · 5-Agent Ensemble · with Node Value & Bond Strength
Colombia — Envelope
1875–2026 · inter-rater SD per node per year · with V_range
United States — Ensemble Mean
1900–2026 · 5-Agent Ensemble · 5-year snapshots · with Node Value & Bond Strength
United States — Envelope
1900–2026 · inter-rater SD per node per year · with V_range
Argentina — Ensemble Mean
1950–2026 · 5-Agent Ensemble · with Node Value & Bond Strength
Argentina — Envelope
1950–2026 · inter-rater SD per node per year
Argentina
1950–2026 · 5-Agent Ensemble · raw scores (C, K, S, A)
Canada — Ensemble Mean
1850–2026 · 5-Agent Ensemble · with Node Value & Bond Strength
Canada — Envelope
1850–2026 · inter-rater SD per node per year
Chile — Ensemble Mean
1875–2026 · 5-Agent Ensemble · with Node Value & Bond Strength
Chile — Envelope
1875–2026 · inter-rater SD per node per year · with V_range
Europe
France — Ensemble Mean
1850–2026 · 5-Agent Ensemble · with Node Value & Bond Strength
France — Envelope
1850–2026 · inter-rater SD per node per year · with V_range
United Kingdom — Ensemble Mean
1880–2026 · CAMNations5 · with Node Value & Bond Strength
United Kingdom — Envelope
1880–2026 · inter-rater SD per node per year · with V_range
Germany — Ensemble Mean
1880–2026 · 5-Agent Ensemble · with Node Value & Bond Strength
Germany — Envelope
1880–2026 · inter-rater SD per node per year · with V_range
Norway — Ensemble Mean
1880–2026 · 5-Agent Ensemble · with Node Value & Bond Strength
Norway — Envelope
1880–2026 · inter-rater SD per node per year · with V_range
Poland — Ensemble Mean
1875–2026 · 5-Agent Ensemble · with Node Value & Bond Strength
Poland — Envelope
1875–2026 · inter-rater SD per node per year · with V_range
Russia — Ensemble Mean
1800–2026 · 5-Agent Ensemble · with Node Value & Bond Strength
Russia — Envelope
1800–2026 · inter-rater SD per node per year · with V_range
Sweden — Ensemble Mean
1880–2026 · 5-Agent Ensemble · with Node Value & Bond Strength
Sweden — Envelope
1850–2026 · inter-rater SD per node per year · with V_range
Türkiye — Ensemble Mean
1875–2026 · 5-Agent Ensemble · 5-year snapshots · with Node Value & Bond Strength
Türkiye — Envelope
1875–2026 · inter-rater SD per node per year · with V_range
Middle East
Iran — Ensemble Mean
1875–2026 · 5-Agent Ensemble · with Node Value & Bond Strength
Iran — Envelope
1875–2026 · inter-rater SD per node per year · with V_range
Asia-Pacific
Australia — Ensemble Mean
1875–2026 · 5-Agent Ensemble · with Node Value & Bond Strength
Australia — Envelope
1875–2026 · inter-rater SD per node per year · with V_range
India — Ensemble Mean
1875–2026 · 5-Agent Ensemble · 5-year snapshots · with Node Value & Bond Strength
India — Envelope
1875–2026 · inter-rater SD per node per year · with V_range
Japan — Ensemble Mean
1875–2026 · 5-Agent Ensemble · 5-year snapshots · with Node Value & Bond Strength
Japan — Envelope
1875–2026 · inter-rater SD per node per year · with V_range
China — Ensemble Mean
1800–2025 · CAMNations5 · with Node Value & Bond Strength
China — Envelope
1800–2025 · inter-rater SD per node per year · with V_range
China — Block2 Envelope
1800–2025 · CAMNations5 Block2 · SD + V_range · V_min · V_max
China — Envelope (1850–2026)
1850–2026 · inter-rater SD · extended series · with V_range
China — Extended Ensemble
1850–2026 · 5-Agent Ensemble · extended series
Thailand — Ensemble Mean
1850–2026 · 5-Agent Ensemble · with Node Value & Bond Strength
Thailand — Envelope
1850–2025 · inter-rater SD per node per year · with V_range
Ancient & Pre-Modern
Latium Vetus — Ensemble Mean
460–2010 · Multi-Agent Ensemble · Roman institutional history
Latium Vetus — Envelope
460–2010 · inter-rater SD per node per year · with V_range
New Datasets — May 2026 · 22 Societies
22 new single-scorer and ensemble datasets added May 2026, now powering the CAMS National Diagnostic (32 societies). Canonical 8-node format throughout.
⚠ Pending integration: These datasets are not yet available in the CAMS Explorer, CAMS Interpreter, Zeitgeist Detector, or other dynamic services. Add to all live tools when resources allow.
Western Europe
France
1785–2024 · Single-scorer · C, K, S, A, V, BS
Netherlands
1750–2024 · Single-scorer · C, K, S, A, V, BS
Denmark
1752–2025 · Single-scorer · C, K, S, A, V, BS
Norway
1881–2025 · GEM scorer · C, K, S, A, V, BS
East Asia & Pacific
Japan
1850–2025 · Single-scorer · C, K, S, A, V, BS
Hong Kong
1900–2015 · Single-scorer · C, K, S, A, V, BS
Singapore
1935–2025 · Single-scorer · C, K, S, A, V, BS
Thailand
1850–2025 · Single-scorer · C, K, S, A, V, BS
Indonesia
1941–2025 · Single-scorer · C, K, S, A, V, BS
South Asia
India
1950–2024 · Single-scorer · C, K, S, A, V, BS
Pakistan
1947–2025 · Single-scorer · C, K, S, A, V, BS
Middle East & North Africa
Saudi Arabia
1918–2025 · Single-scorer · C, K, S, A, V, BS
UAE
1970–2026 · GEM scorer · C, K, S, A, V, BS
Iran — Ensemble Mean
5-Agent Ensemble Mean · C, K, S, A, V, BS
Iran — Envelope
Inter-rater SD per node per year
Iran — Single-scorer (no BS)
1900–2025 · C, K, S, A, V · bond strength unavailable
Iraq
1900–2025 · Single-scorer · C, K, S, A, V, BS
Israel
1946–2025 · Single-scorer · C, K, S, A, V, BS
Lebanon
1943–2025 · Single-scorer · C, K, S, A, V, BS
Syria
1893–2024 · Single-scorer · C, K, S, A, V, BS
Eurasia
Ukraine
1980–2025 · Single-scorer · C, K, S, A, V, BS
Americas & Africa
Venezuela
1970–2025 · GEM scorer · C, K, S, A, V, BS
Brazil
1880–2025 · Grok scorer · C, K, S, A, V, BS
South Africa
1880–2025 · GEM scorer · C, K, S, A, V, BS
All Datasets — 76 files · 62,871 records
| Society ↕ |
Years ↕ |
Records ↕ |
Family |
Region |
Download |
No datasets match this filter.
CSV Schema
Every dataset follows the same column structure — directly loadable into the CAMS Dashboard, Python scripts, or any data tool.
| Column |
Type |
Description |
Society | string | Society or organisation name |
Year | integer | Year of observation |
Node | string | One of the 8 institutional nodes (Lore, Archive, Helm, Stewards, Shield, Craft, Hands, Flow) |
Coherence | float 1–10 | Internal alignment of the node |
Capacity | float 1–10 | Resources and effectiveness |
Stress | float 1–10 | Accumulated pressure and dysfunction |
Abstraction | float 1–10 | Symbolic sophistication and long-range planning |
Node Value | float | Derived: C + K + (A/2) − S |
Bond Strength | float 0–1 | Derived: coupling with adjacent node |