Where the programme stands — what has been built, what the evidence shows, and what we are still earning. Refreshed roughly monthly.
Last updated: 22 June 2026
This is already a substantial body of work. A complete framework — a documented ontology in which any system is read as three coupled capacities — integration, interaction and bonding, each with a level and an orientation — and a working engine that implements it: it scores from each system's own stated-and-enacted purpose and cross-checks structure against that meaning at every step, so the model is built to audit itself rather than confirm its assumptions. Behind it sits a real case base — a corpus of 200+ historical and contemporary cases held for leakage-free analysis, spanning civilisations and states, organisations (among them a globally systemic bank), and the major economic crises of the last century; a growing set modelled in depth as multi-tick trajectories, fifteen full structural reads, and a 600-plus-check test battery — plus a standing, sealed register of dated forecasts on live systems. Most of this is engine-supported today; none of it is yet validated against data the model has not seen.
Produced on cases built from the historical record — real evidence that the mechanism is coherent and carries information:
A different kind of evidence, added June 2026 — not whether the model is right about the future, but whether its readings are reproducible and its categories sound. We handed the written method to independent scorers who had no contact with us, or with each other, and checked whether they arrived where the method says they should.
The same instrument has been run across very different systems, each read held to the same sealed discipline — a spread that matters as much as any single result:
One limit holds across the record, stated once rather than case by case: these are consistency checks and blind-localisation results on cases built from the historical record — genuinely informative, but not a positive result on data the model has never seen. The historical cases also carry the ordinary cautions (sources written later; the risk of a clean arc drawn over a messy record), and where a reading agrees with the secondary literature that is a consistency check, not independent confirmation. Closing that gap is what the forward register below is for.
A sealed register of dated forecasts on live systems runs since June 2026 — with controls (systems predicted to stay sound, so a doom-bias would be caught) and tamper-evident timestamps. Predictions are scored as their windows close; the first close in 2027–2028, and results will be shown here whichever way they land.
We are explicit about the limits of what the reports surface today. The current read leans most fully on integration and correction; the model defines more than the report yet shows, and the additions are named and sequenced, not vague. A deeper diagnostic layer will surface what the engine already computes but the headline omits — per-strand topology and opacity, friction read as a three-part vector rather than one number, and life-cycle phase. And three genuinely new measurements are in development: a system's external interaction (how it actually couples with and responds to its environment — the dimension our environmental read is still missing), its output (the real, self-report-proof effect it imposes on the world — the check that exposes a system looking strong while quietly extracting), and its legacy (what it transmits to whatever comes next). These extend the instrument beneath the four-pillar headline, not on top of it, and each is admitted only if it carries information the current measures cannot. The output measure is also exactly what ESG and SDG “impact” reporting reaches for and rarely captures — a system's real, conduct-grounded effect on the world; when it lands, it becomes the rigorous core of the reporting offering described on the main site.
A mechanism only counts if it emerges in the engine without being coded in, under sealed pre-registration: claims, thresholds and falsifiers are hashed and recorded before any run, scored exactly as sealed, with misses kept permanently. The order is simulate first, then matched-shock historical tests (condition coded blind from pre-shock sources), then live forward calls — with conduct, not words, as the standard of evidence. Findings enter canon only through a formal ratification sitting. And the apparatus is distrusted by default: five broken instruments were caught before producing a single false result, and two of our own over-claims were killed by fresh-configuration tests.
One thing, stated plainly because the honesty is part of the method: a positive out-of-sample result. Two leakage-free out-of-sample tests have been run and came back null — and we treat that as information, not embarrassment. They taught us two things we now build on. First, that no single early-warning parameter works: what carries signal is the arc — the trajectory across several coupled readings over time — not any lone metric. Second, where the instrument's lane actually lies: systems read in depth and in motion, not sparse aggregates read from a distance. Both lessons fed directly into the current design. Convergence with the framework is real evidence, but it is not yet validation against the world; the mechanism continues to be strengthened, and phase-dependent results are held provisional until that review closes. None of this is hidden — it is how a serious programme builds trust, and it is exactly the gap the forward register above is designed to close.